RSMamba: Remote Sensing Image Classification with State Space Model

KyanChen/RSMamba 28 Mar 2024

Remote sensing image classification forms the foundation of various understanding tasks, serving a crucial function in remote sensing image interpretation.

Classification Image Classification +2

43
1.38 stars / hour

Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework

ictmcg/make-your-anchor 25 Mar 2024

We adopt a two-stage training strategy for the diffusion model, effectively binding movements with specific appearances.

Denoising

135
1.26 stars / hour

BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion

tencentarc/brushnet 11 Mar 2024

Image inpainting, the process of restoring corrupted images, has seen significant advancements with the advent of diffusion models (DMs).

Image Inpainting

255
1.13 stars / hour

Long-CLIP: Unlocking the Long-Text Capability of CLIP

beichenzbc/long-clip 22 Mar 2024

Contrastive Language-Image Pre-training (CLIP) has been the cornerstone for zero-shot classification, text-image retrieval, and text-image generation by aligning image and text modalities.

Image Retrieval Language Modelling +3

161
1.11 stars / hour

Evolutionary Optimization of Model Merging Recipes

sakanaai/evolutionary-model-merge 19 Mar 2024

Surprisingly, our Japanese Math LLM achieved state-of-the-art performance on a variety of established Japanese LLM benchmarks, even surpassing models with significantly more parameters, despite not being explicitly trained for such tasks.

Evolutionary Algorithms Math

810
1.09 stars / hour

One-Step Image Translation with Text-to-Image Models

gaparmar/img2img-turbo 18 Mar 2024

In this work, we address two limitations of existing conditional diffusion models: their slow inference speed due to the iterative denoising process and their reliance on paired data for model fine-tuning.

Denoising Translation

773
1.00 stars / hour

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

mhamilton723/FeatUp 15 Mar 2024

Deep features are a cornerstone of computer vision research, capturing image semantics and enabling the community to solve downstream tasks even in the zero- or few-shot regime.

Depth Estimation Depth Prediction +5

862
0.96 stars / hour

MegaBlocks: Efficient Sparse Training with Mixture-of-Experts

stanford-futuredata/megablocks 29 Nov 2022

We present MegaBlocks, a system for efficient Mixture-of-Experts (MoE) training on GPUs.

914
0.89 stars / hour

Logit Standardization in Knowledge Distillation

sunshangquan/logit-standardardization-kd 3 Mar 2024

Knowledge distillation involves transferring soft labels from a teacher to a student using a shared temperature-based softmax function.

Knowledge Distillation

81
0.80 stars / hour

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

microsoft/LLMLingua 19 Mar 2024

The challenge is that information entropy may be a suboptimal compression metric: (i) it only leverages unidirectional context and may fail to capture all essential information needed for prompt compression; (ii) it is not aligned with the prompt compression objective.

GSM8K Language Modelling +3

3,428
0.78 stars / hour