Trending Research

ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model

chenhongruixuan/mambacd • 4 Apr 2024

For the change decoder, which is available in all three architectures, we propose three spatio-temporal relationship modeling mechanisms, which can be naturally combined with the Mamba architecture and fully utilize its attribute to achieve spatio-temporal interaction of multi-temporal features, thereby obtaining accurate change information.

Ranked #1 on 2D Semantic Segmentation on xBD

2D Semantic Segmentation Attribute +1

139

0.41 stars / hour

Paper
Code

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

openbmb/minicpm • • 9 Apr 2024

For data scaling, we introduce a Warmup-Stable-Decay (WSD) learning rate scheduler (LRS), conducive to continuous training and domain adaptation.

Domain Adaptation

3,634

0.39 stars / hour

Paper
Code

Chronos: Learning the Language of Time Series

amazon-science/chronos-forecasting • • 12 Mar 2024

We introduce Chronos, a simple yet effective framework for pretrained probabilistic time series models.

Gaussian Processes Language Modelling +2

1,542

0.39 stars / hour

Paper
Code

AIOS: LLM Agent Operating System

agiresearch/aios • 25 Mar 2024

Inspired by these challenges, this paper presents AIOS, an LLM agent operating system, which embeds large language model into operating systems (OS) as the brain of the OS, enabling an operating system "with soul" -- an important step towards AGI.

Language Modelling Large Language Model +1

2,362

0.38 stars / hour

Paper
Code

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

myshell-ai/jetmoe • • 11 Apr 2024

Large Language Models (LLMs) have achieved remarkable results, but their increasing resource demand has become a major obstacle to the development of powerful and accessible super-human intelligence.

866

0.38 stars / hour

Paper
Code

LLoCO: Learning Long Contexts Offline

jeffreysijuntan/lloco • • 11 Apr 2024

We introduce LLoCO, a technique that combines context compression, retrieval, and parameter-efficient finetuning using LoRA.

4k In-Context Learning +1

0.37 stars / hour

Paper
Code

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

idea-research/groundingdino • • 9 Mar 2023

To effectively fuse language and vision modalities, we conceptually divide a closed-set detector into three phases and propose a tight fusion solution, which includes a feature enhancer, a language-guided query selection, and a cross-modality decoder for cross-modality fusion.

Ranked #1 on Zero-Shot Object Detection on MSCOCO

Referring Expression Referring Expression Comprehension +2

4,916

0.37 stars / hour

Paper
Code

PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models

3DAgentWorld/Toolkit-for-Prompt-Compression • • 26 Mar 2024

Prompt compression is an innovative method for efficiently condensing input prompts while preserving essential information.

Code Completion Few-Shot Learning +2

114

0.36 stars / hour

Paper
Code

Less is More: Fewer Interpretable Region via Submodular Subset Selection

ruoyuchen10/smdl-attribution • • 14 Feb 2024

For incorrectly predicted samples, our method achieves gains of 81. 0% and 18. 4% compared to the HSIC-Attribution algorithm in the average highest confidence and Insertion score respectively.

Interpretability Techniques for Deep Learning

0.34 stars / hour

Paper
Code

HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach

airi-institute/hairfastgan • • 1 Apr 2024

Our paper addresses the complex task of transferring a hairstyle from a reference image to an input photo for virtual hair try-on.

173

0.34 stars / hour

Paper
Code