Trending Research

DoRA: Weight-Decomposed Low-Rank Adaptation

NVlabs/DoRA • • 14 Feb 2024

By employing DoRA, we enhance both the learning capacity and training stability of LoRA while avoiding any additional inference overhead.

144

0.69 stars / hour

Paper
Code

MemGPT: Towards LLMs as Operating Systems

cpacker/memgpt • 12 Oct 2023

Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis.

Management

9,747

0.67 stars / hour

Paper
Code

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

FoundationVision/VAR • • 3 Apr 2024

We present Visual AutoRegressive modeling (VAR), a new generation paradigm that redefines the autoregressive learning on images as coarse-to-fine "next-scale prediction" or "next-resolution prediction", diverging from the standard raster-scan "next-token prediction".

Ranked #7 on Image Generation on ImageNet 256x256

Image Generation Language Modelling +2

3,445

0.61 stars / hour

Paper
Code

WavCraft: Audio Editing and Generation with Natural Language Prompts

jinhualiang/wavcraft • • 14 Mar 2024

We introduce WavCraft, a collective system that leverages large language models (LLMs) to connect diverse task-specific models for audio content creation and editing.

In-Context Learning

289

0.60 stars / hour

Paper
Code

TensorIR: An Abstraction for Automatic Tensorized Program Optimization

mlc-ai/web-llm • • 9 Jul 2022

Finally, we build an end-to-end framework on top of our abstraction to automatically optimize deep learning models for given tensor computation primitives.

BIG-bench Machine Learning

9,626

0.55 stars / hour

Paper
Code

SUQL: Conversational Search over Structured and Unstructured Data with Large Language Models

stanford-oval/suql • 16 Nov 2023

This paper presents the first conversational agent that supports the full generality of hybrid data access for large knowledge corpora, through a language we developed called SUQL (Structured and Unstructured Query Language).

Conversational Search In-Context Learning +1

135

0.53 stars / hour

Paper
Code

Lightplane: Highly-Scalable Components for Neural 3D Fields

facebookresearch/lightplane • • 30 Apr 2024

Contemporary 3D research, particularly in reconstruction and generation, heavily relies on 2D images for inputs or supervision.

3D Reconstruction

182

0.52 stars / hour

Paper
Code

Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection

sclbd/deepfakebench • • 19 Nov 2023

Deepfake detection faces a critical generalization hurdle, with performance deteriorating when there is a mismatch between the distributions of training and testing data.

DeepFake Detection Face Swapping +1