LLMs Get Lost In Multi-Turn Conversation

microsoft/lost_in_conversation 9 May 2025

Large Language Models (LLMs) are conversational interfaces.

78
0.61 stars / hour

Zep: A Temporal Knowledge Graph Architecture for Agent Memory

getzep/graphiti 20 Jan 2025

We introduce Zep, a novel memory layer service for AI agents that outperforms the current state-of-the-art system, MemGPT, in the Deep Memory Retrieval (DMR) benchmark.

RAG Retrieval

9,171
0.60 stars / hour

Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents

simular-ai/agent-s 1 Apr 2025

Computer use agents automate digital tasks by directly interacting with graphical user interfaces (GUIs) on computers and mobile devices, offering significant potential to enhance human productivity by completing an open-ended space of user queries.

AI Agent Task Planning

4,953
0.59 stars / hour

LBM: Latent Bridge Matching for Fast Image-to-Image Translation

gojasper/lbm 10 Mar 2025

In this paper, we introduce Latent Bridge Matching (LBM), a new, versatile and scalable method that relies on Bridge Matching in a latent space to achieve fast image-to-image translation.

Depth Estimation Image Relighting +2

495
0.58 stars / hour

SkyReels-V2: Infinite-length Film Generative Model

skyworkai/skyreels-v2 17 Apr 2025

Recent advances in video generation have been driven by diffusion models and autoregressive frameworks, yet critical challenges persist in harmonizing prompt adherence, visual quality, motion dynamics, and duration: compromises in motion dynamics to enhance temporal visual quality, constrained video duration (5-10 seconds) to prioritize resolution, and inadequate shot-aware generation stemming from general-purpose MLLMs' inability to interpret cinematic grammar, such as shot composition, actor expressions, and camera motions.

Large Language Model model +2

2,361
0.56 stars / hour

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

ruc-nlpir/webthinker 30 Apr 2025

Large reasoning models (LRMs), such as OpenAI-o1 and DeepSeek-R1, demonstrate impressive long-horizon reasoning capabilities.

Navigate

844
0.52 stars / hour

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

zhiyuanhubj/meta-ability-alignment 15 May 2025

Large reasoning models (LRMs) already possess a latent capacity for long chain-of-thought reasoning.

Math reinforcement-learning +2

34
0.44 stars / hour

Unified Continuous Generative Models

LINs-Lab/UCGM 12 May 2025

We introduce a unified framework for training, sampling, and analyzing these models.

Image Generation

89
0.44 stars / hour

AlphaNet: Scaling Up Local-frame-based Atomistic Interatomic Potential

zmyybc/alphanet 13 Jan 2025

Molecular dynamics simulations demand an unprecedented combination of accuracy and scalability to tackle grand challenges in catalysis and materials design.

Computational Efficiency

91
0.42 stars / hour

Fast Text-to-Audio Generation with Adversarial Post-Training

stability-ai/stable-audio-tools 13 May 2025

Text-to-audio systems, while increasingly performant, are slow at inference time, thus making their latency unpractical for many creative applications.

ARC Audio Generation

3,196
0.40 stars / hour