Trending Research

BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation

flagopen/flagembedding • • 5 Feb 2024

It can simultaneously perform the three common retrieval functionalities of embedding model: dense retrieval, multi-vector retrieval, and sparse retrieval, which provides a unified model foundation for real-world IR applications.

Retrieval Self-Knowledge Distillation

4,739

0.35 stars / hour

Paper
Code

Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance

KU-CVLAB/Perturbed-Attention-Guidance • • 26 Mar 2024

These techniques are often not applicable in unconditional generation or in various downstream tasks such as image restoration.

Deblurring Denoising +2

156

0.34 stars / hour

Paper
Code

PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models

3DAgentWorld/Toolkit-for-Prompt-Compression • • 26 Mar 2024

Prompt compression is an innovative method for efficiently condensing input prompts while preserving essential information.

Code Completion Few-Shot Learning +2

130

0.34 stars / hour

Paper
Code

TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting Methods

decisionintelligence/tfb • • 29 Mar 2024

Next, we employ TFB to perform a thorough evaluation of 21 Univariate Time Series Forecasting (UTSF) methods on 8, 068 univariate time series and 14 Multivariate Time Series Forecasting (MTSF) methods on 25 datasets.

Benchmarking Multivariate Time Series Forecasting +2

114

0.33 stars / hour

Paper
Code

SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing

modelscope/swift • • 18 Dec 2023

Image diffusion models have been utilized in various tasks, such as text-to-image generation and controllable image synthesis.

Text-to-Image Generation

1,095

0.33 stars / hour

Paper
Code

QLoRA: Efficient Finetuning of Quantized LLMs

internlm/xtuner • • NeurIPS 2023

Our best model family, which we name Guanaco, outperforms all previous openly released models on the Vicuna benchmark, reaching 99. 3% of the performance level of ChatGPT while only requiring 24 hours of finetuning on a single GPU.

Chatbot Instruction Following +2

1,374

0.32 stars / hour

Paper
Code

DUSt3R: Geometric 3D Vision Made Easy

naver/dust3r • • 21 Dec 2023

Our formulation directly provides a 3D model of the scene as well as depth information, but interestingly, we can seamlessly recover from it, pixel matches, relative and absolute camera.

3D Reconstruction Camera Calibration +2

4,078

0.32 stars / hour

Paper
Code

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

myshell-ai/jetmoe • • 11 Apr 2024

Large Language Models (LLMs) have achieved remarkable results, but their increasing resource demand has become a major obstacle to the development of powerful and accessible super-human intelligence.

890

0.31 stars / hour

Paper
Code

ObjectLab: Automated Diagnosis of Mislabeled Images in Object Detection Data

cleanlab/cleanlab • • 2 Sep 2023

Despite powering sensitive systems like autonomous vehicles, object detection remains fairly brittle in part due to annotation errors that plague most real-world training datasets.

Autonomous Vehicles Object +2

8,597

0.30 stars / hour

Paper
Code

ReFT: Representation Finetuning for Language Models

stanfordnlp/pyreft • • 4 Apr 2024

LoReFT is a drop-in replacement for existing PEFTs and learns interventions that are 10x-50x more parameter-efficient than prior state-of-the-art PEFTs.

Arithmetic Reasoning

549

0.29 stars / hour

Paper
Code