Trending Research

A Survey on Vision Mamba: Models, Applications and Challenges

ruixxxx/awesome-vision-mamba-models • • 29 Apr 2024

To help keep pace with the rapid advancements in computer vision, this paper aims to provide a comprehensive review of visual Mamba approaches.

125

0.34 stars / hour

Paper
Code

AIOS: LLM Agent Operating System

agiresearch/aios • 25 Mar 2024

Inspired by these challenges, this paper presents AIOS, an LLM agent operating system, which embeds large language model into operating systems (OS) as the brain of the OS, enabling an operating system "with soul" -- an important step towards AGI.

Language Modelling Large Language Model +1

2,716

0.33 stars / hour

Paper
Code

CLLMs: Consistency Large Language Models

hao-ai-lab/Consistency_LLM • • 28 Feb 2024

Parallel decoding methods such as Jacobi decoding show promise for more efficient LLM inference as it breaks the sequential nature of the LLM decoding process and transforms it into parallelizable computation.

270

0.33 stars / hour

Paper
Code

decoupleQ: Towards 2-bit Post-Training Uniform Quantization via decoupling Parameters into Integer and Floating Points

bytedance/decoupleq • • 19 Apr 2024

However, existing quantization schemes suffer from significant accuracy degradation at very low bits, or require some additional computational overhead when deployed, making it difficult to be applied to large-scale applications in industry.

Quantization

0.32 stars / hour

Paper
Code

QLoRA: Efficient Finetuning of Quantized LLMs

internlm/xtuner • • NeurIPS 2023

Our best model family, which we name Guanaco, outperforms all previous openly released models on the Vicuna benchmark, reaching 99. 3% of the performance level of ChatGPT while only requiring 24 hours of finetuning on a single GPU.

Chatbot Instruction Following +2

2,701

0.31 stars / hour

Paper
Code

ImageInWords: Unlocking Hyper-Detailed Image Descriptions

google/imageinwords • 5 May 2024

To address these issues, we introduce ImageInWords (IIW), a carefully designed human-in-the-loop annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process.

Specificity Text-to-Image Generation

146

0.30 stars / hour

Paper
Code

The Platonic Representation Hypothesis

minyoungg/platonic-rep • • 13 May 2024

We argue that representations in AI models, particularly deep networks, are converging.

0.30 stars / hour

Paper
Code

MRSegmentator: Robust Multi-Modality Segmentation of 40 Classes in MRI and CT Sequences

hhaentze/mrsegmentator • • 10 May 2024

Results: The model showcased high accuracy in segmenting well-defined organs, achieving Dice Similarity Coefficient (DSC) scores of 0. 97 for the right and left lungs, and 0. 95 for the heart.

Model Optimization Organ Segmentation +2

0.30 stars / hour

Paper
Code

SceneTracker: Long-term Scene Flow Estimation Network

wwsource/scenetracker • • 29 Mar 2024

Considering the complementarity of scene flow estimation in the spatial domain's focusing capability and 3D object tracking in the temporal domain's coherence, this study aims to address a comprehensive new task that can simultaneously capture fine-grained and long-term 3D motion in an online manner: long-term scene flow estimation (LSFE).

3D Object Tracking Object Tracking +1

0.29 stars / hour

Paper
Code

A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model

start2020/awesome-timeseries-llm-fm • • 3 May 2024

There are two main research lines, namely pre-training foundation models from scratch for time series and adapting large language foundation models for time series.

Decision Making Few-Shot Learning +4

0.28 stars / hour

Paper
Code