A Survey on Vision Mamba: Models, Applications and Challenges

ruixxxx/awesome-vision-mamba-models 29 Apr 2024

To help keep pace with the rapid advancements in computer vision, this paper aims to provide a comprehensive review of visual Mamba approaches.

125
0.34 stars / hour

AIOS: LLM Agent Operating System

agiresearch/aios 25 Mar 2024

Inspired by these challenges, this paper presents AIOS, an LLM agent operating system, which embeds large language model into operating systems (OS) as the brain of the OS, enabling an operating system "with soul" -- an important step towards AGI.

Language Modelling Large Language Model +1

2,716
0.33 stars / hour

CLLMs: Consistency Large Language Models

hao-ai-lab/Consistency_LLM 28 Feb 2024

Parallel decoding methods such as Jacobi decoding show promise for more efficient LLM inference as it breaks the sequential nature of the LLM decoding process and transforms it into parallelizable computation.

270
0.33 stars / hour

decoupleQ: Towards 2-bit Post-Training Uniform Quantization via decoupling Parameters into Integer and Floating Points

bytedance/decoupleq 19 Apr 2024

However, existing quantization schemes suffer from significant accuracy degradation at very low bits, or require some additional computational overhead when deployed, making it difficult to be applied to large-scale applications in industry.

Quantization

48
0.32 stars / hour

QLoRA: Efficient Finetuning of Quantized LLMs

internlm/xtuner NeurIPS 2023

Our best model family, which we name Guanaco, outperforms all previous openly released models on the Vicuna benchmark, reaching 99. 3% of the performance level of ChatGPT while only requiring 24 hours of finetuning on a single GPU.

Chatbot Instruction Following +2

2,701
0.31 stars / hour

ImageInWords: Unlocking Hyper-Detailed Image Descriptions

google/imageinwords 5 May 2024

To address these issues, we introduce ImageInWords (IIW), a carefully designed human-in-the-loop annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process.

Specificity Text-to-Image Generation

146
0.30 stars / hour

The Platonic Representation Hypothesis

minyoungg/platonic-rep 13 May 2024

We argue that representations in AI models, particularly deep networks, are converging.

79
0.30 stars / hour

MRSegmentator: Robust Multi-Modality Segmentation of 40 Classes in MRI and CT Sequences

hhaentze/mrsegmentator 10 May 2024

Results: The model showcased high accuracy in segmenting well-defined organs, achieving Dice Similarity Coefficient (DSC) scores of 0. 97 for the right and left lungs, and 0. 95 for the heart.

Model Optimization Organ Segmentation +2

38
0.30 stars / hour

SceneTracker: Long-term Scene Flow Estimation Network

wwsource/scenetracker 29 Mar 2024

Considering the complementarity of scene flow estimation in the spatial domain's focusing capability and 3D object tracking in the temporal domain's coherence, this study aims to address a comprehensive new task that can simultaneously capture fine-grained and long-term 3D motion in an online manner: long-term scene flow estimation (LSFE).

3D Object Tracking Object Tracking +1

55
0.29 stars / hour

A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model

start2020/awesome-timeseries-llm-fm 3 May 2024

There are two main research lines, namely pre-training foundation models from scratch for time series and adapting large language foundation models for time series.

Decision Making Few-Shot Learning +4

48
0.28 stars / hour