Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

apple/ml-depth-pro 2 Oct 2024

We present a foundation model for zero-shot metric monocular depth estimation.

Monocular Depth Estimation

2,594
6.23 stars / hour

Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration

ohayonguy/PMRF 1 Oct 2024

Photo-realistic image restoration algorithms are typically evaluated by distortion measures (e. g., PSNR, SSIM) and by perceptual quality measures (e. g., FID, NIQE), where the desire is to attain the lowest possible distortion without compromising on perceptual quality.

 Ranked #1 on Blind Face Restoration on CelebA-Test (FID metric)

Blind Face Restoration Image Colorization +5

303
1.68 stars / hour

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models

microsoft/vptq 25 Sep 2024

Due to the redundancy in LLM weights, recent research has focused on pushing weight-only quantization to extremely low-bit (even down to 2 bits).

Quantization

369
1.43 stars / hour

"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

verazuo/jailbreak_llms 7 Aug 2023

We hope that our study can facilitate the research community and LLM vendors in promoting safer and regulated LLMs.

Community Detection

2,531
1.37 stars / hour

Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers

liruiw/HPT 30 Sep 2024

Previous robot learning methods often collect data to train with one specific embodiment for one task, which is expensive and prone to overfitting.

236
1.05 stars / hour

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

thudm/longwriter 13 Aug 2024

By incorporating this dataset into model training, we successfully scale the output length of existing models to over 10, 000 words while maintaining output quality.

1,395
0.98 stars / hour

A Multi-Level Superoptimizer for Tensor Programs

mirage-project/mirage 9 May 2024

We introduce Mirage, the first multi-level superoptimizer for tensor programs.

Navigate

472
0.68 stars / hour

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

thu-ml/SageAttention 3 Oct 2024

Although quantization has proven to be an effective method for accelerating model inference, existing quantization methods primarily focus on optimizing the linear layer.

Image Generation Quantization +1

148
0.66 stars / hour

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

ictnlp/llama-omni 10 Sep 2024

We build our model based on the latest Llama-3. 1-8B-Instruct model.

2,315
0.59 stars / hour

Diffusion Models are Evolutionary Algorithms

Zhangyanbo/diffusion-evolution 3 Oct 2024

In a convergence of machine learning and biology, we reveal that diffusion models are evolutionary algorithms.

Denoising Evolutionary Algorithms

71
0.59 stars / hour