DoRA: Weight-Decomposed Low-Rank Adaptation

NVlabs/DoRA 14 Feb 2024

By employing DoRA, we enhance both the learning capacity and training stability of LoRA while avoiding any additional inference overhead.

127
0.69 stars / hour

MemGPT: Towards LLMs as Operating Systems

cpacker/memgpt 12 Oct 2023

Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis.

Management

9,590
0.67 stars / hour

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

FoundationVision/VAR 3 Apr 2024

We present Visual AutoRegressive modeling (VAR), a new generation paradigm that redefines the autoregressive learning on images as coarse-to-fine "next-scale prediction" or "next-resolution prediction", diverging from the standard raster-scan "next-token prediction".

Image Generation Language Modelling +2

3,398
0.61 stars / hour

WavCraft: Audio Editing and Generation with Natural Language Prompts

jinhualiang/wavcraft 14 Mar 2024

We introduce WavCraft, a collective system that leverages large language models (LLMs) to connect diverse task-specific models for audio content creation and editing.

In-Context Learning

250
0.60 stars / hour

TensorIR: An Abstraction for Automatic Tensorized Program Optimization

mlc-ai/web-llm 9 Jul 2022

Finally, we build an end-to-end framework on top of our abstraction to automatically optimize deep learning models for given tensor computation primitives.

BIG-bench Machine Learning

9,521
0.55 stars / hour

SUQL: Conversational Search over Structured and Unstructured Data with Large Language Models

stanford-oval/suql 16 Nov 2023

This paper presents the first conversational agent that supports the full generality of hybrid data access for large knowledge corpora, through a language we developed called SUQL (Structured and Unstructured Query Language).

Conversational Search In-Context Learning +1

120
0.53 stars / hour

Lightplane: Highly-Scalable Components for Neural 3D Fields

facebookresearch/lightplane 30 Apr 2024

Contemporary 3D research, particularly in reconstruction and generation, heavily relies on 2D images for inputs or supervision.

3D Reconstruction

169
0.52 stars / hour

Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection

sclbd/deepfakebench 19 Nov 2023

Deepfake detection faces a critical generalization hurdle, with performance deteriorating when there is a mismatch between the distributions of training and testing data.

DeepFake Detection Face Swapping +1

287
0.52 stars / hour

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

FoundationVision/Groma 19 Apr 2024

We introduce Groma, a Multimodal Large Language Model (MLLM) with grounded and fine-grained visual perception ability.

Language Modelling Large Language Model +2

364
0.50 stars / hour

AgentScope: A Flexible yet Robust Multi-Agent Platform

modelscope/agentscope 21 Feb 2024

With the rapid advancement of Large Language Models (LLMs), significant progress has been made in multi-agent applications.

1,226
0.47 stars / hour