How to Train Long-Context Language Models (Effectively)

princeton-nlp/prolong 3 Oct 2024

We study continued training and supervised fine-tuning (SFT) of a language model (LM) to make effective use of long-context information.

48
0.35 stars / hour

RouteLLM: Learning to Route LLMs with Preference Data

lm-sys/routellm 26 Jun 2024

Large language models (LLMs) exhibit impressive capabilities across a wide range of tasks, yet the choice of which model to use often involves a trade-off between performance and cost.

Data Augmentation Transfer Learning

3,080
0.31 stars / hour

Diffusion Models are Evolutionary Algorithms

Zhangyanbo/diffusion-evolution 3 Oct 2024

In a convergence of machine learning and biology, we reveal that diffusion models are evolutionary algorithms.

Denoising Evolutionary Algorithms

58
0.30 stars / hour

PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation

stevenlsw/physgen 27 Sep 2024

We present PhysGen, a novel image-to-video generation method that converts a single image and an input condition (e. g., force and torque applied to an object in the image) to produce a realistic, physically plausible, and temporally consistent video.

Image to Video Generation

129
0.29 stars / hour

Generative Agents: Interactive Simulacra of Human Behavior

joonspk-research/generative_agents 7 Apr 2023

Believable proxies of human behavior can empower interactive applications ranging from immersive environments to rehearsal spaces for interpersonal communication to prototyping tools.

Language Modelling Large Language Model

16,702
0.29 stars / hour

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

YerbaPage/MGDebugger 2 Oct 2024

While large language models have made significant strides in code generation, the pass rate of the generated code is bottlenecked on subtle errors, often requiring human intervention to pass tests, especially for complex problems.

Auto Debugging Bug fixing +3

27
0.29 stars / hour

Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia

google-deepmind/concordia 6 Dec 2023

Agent-based modeling has been around for decades, and applied widely across the social and natural sciences.

Common Sense Reasoning

617
0.28 stars / hour

Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

hqhqaq/mip-adapter 26 Sep 2024

Personalized text-to-image generation methods can generate customized images based on the reference images, which have garnered wide research interest.

Object Personalized Image Generation +1

46
0.26 stars / hour

CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control

hsliu-initial/ctrla 29 May 2024

Retrieval-augmented generation (RAG) has emerged as a promising solution for mitigating hallucinations of large language models (LLMs) with retrieved external knowledge.

RAG Response Generation +1

56
0.26 stars / hour

Data Formulator 2: Iteratively Creating Rich Visualizations with AI

microsoft/data-formulator 28 Aug 2024

To create rich visualizations, data analysts often need to iterate back and forth among data processing and chart specification to achieve their goals.

Code Generation Navigate

95
0.25 stars / hour