Trending Research

Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small

openai/transformer-debugger • • 1 Nov 2022

Research in mechanistic interpretability seeks to explain behaviors of machine learning models in terms of their internal components.

Language Modelling

3,853

0.29 stars / hour

Paper
Code

DoRA: Weight-Decomposed Low-Rank Adaptation

NVlabs/DoRA • • 14 Feb 2024

By employing DoRA, we enhance both the learning capacity and training stability of LoRA while avoiding any additional inference overhead.

211

0.29 stars / hour

Paper
Code

SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning

nzolman/sindy-rl • 14 Mar 2024

Deep reinforcement learning (DRL) has shown significant promise for uncovering sophisticated control policies that interact in environments with complicated dynamics, such as stabilizing the magnetohydrodynamics of a tokamak fusion reactor or minimizing the drag force exerted on an object in a fluid flow.

Dictionary Learning Model-based Reinforcement Learning +1

0.27 stars / hour

Paper
Code

MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation

csyxwei/masterweaver • 9 May 2024

In this work, we present MasterWeaver, a test-time tuning-free method designed to generate personalized images with both faithful identity fidelity and flexible editability.

Text-to-Image Generation

0.27 stars / hour

Paper
Code

Perseus: Removing Energy Bloat from Large Model Training

ml-energy/zeus • • 12 Dec 2023

Training large AI models on numerous GPUs consumes a massive amount of energy.

167

0.27 stars / hour

Paper
Code

QLoRA: Efficient Finetuning of Quantized LLMs

internlm/xtuner • • NeurIPS 2023

Our best model family, which we name Guanaco, outperforms all previous openly released models on the Vicuna benchmark, reaching 99. 3% of the performance level of ChatGPT while only requiring 24 hours of finetuning on a single GPU.

Chatbot Instruction Following +2