Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

lllyasviel/framepack 17 Apr 2025

We present a neural network structure, FramePack, to train next-frame (or next-frame-section) prediction models for video generation.

12,846
0.53 stars / hour

DUET: Dual Clustering Enhanced Multivariate Time Series Forecasting

decisionintelligence/tfb 14 Dec 2024

First, we design a Temporal Clustering Module (TCM) that clusters time series into fine-grained distributions to handle heterogeneous temporal patterns.

Clustering energy management +4

816
0.51 stars / hour

SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing

bytedance/superedit 5 May 2025

This includes rectifying the editing instructions to better align with the original-edited image pairs and using contrastive editing instructions to further enhance their effectiveness.

Triplet

96
0.48 stars / hour

Zep: A Temporal Knowledge Graph Architecture for Agent Memory

getzep/graphiti 20 Jan 2025

We introduce Zep, a novel memory layer service for AI agents that outperforms the current state-of-the-art system, MemGPT, in the Deep Memory Retrieval (DMR) benchmark.

RAG Retrieval

8,587
0.47 stars / hour

VGGT: Visual Geometry Grounded Transformer

facebookresearch/vggt 14 Mar 2025

We present VGGT, a feed-forward neural network that directly infers all key 3D attributes of a scene, including camera parameters, point maps, depth maps, and 3D point tracks, from one, a few, or hundreds of its views.

Depth Estimation Novel View Synthesis +3

6,525
0.40 stars / hour

Step1X-Edit: A Practical Framework for General Image Editing

stepfun-ai/step1x-edit 24 Apr 2025

In recent years, image editing models have witnessed remarkable and rapid development.

Image Manipulation

1,195
0.39 stars / hour

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

foundationagents/awesome-foundation-agents 31 Mar 2025

The advent of large language models (LLMs) has catalyzed a transformative shift in artificial intelligence, paving the way for advanced intelligent agents capable of sophisticated reasoning, robust perception, and versatile action across diverse domains.

 Ranked #1 on Continual Learning on AIDS (using extra training data)

AutoML Continual Learning

1,246
0.34 stars / hour

Multi-Modality Representation Learning for Antibody-Antigen Interactions Prediction

trashtian/mulaaip 22 Mar 2025

While deep learning models play a crucial role in predicting antibody-antigen interactions (AAI), the scarcity of publicly available sequence-structure pairings constrains their generalization.

Graph Attention Prediction +1

37
0.32 stars / hour

RM-R1: Reward Modeling as Reasoning

rm-r1-uiuc/rm-r1 5 May 2025

In this work, we introduce a new class of generative reward models -- Reasoning Reward Models (ReasRMs) -- which formulate reward modeling as a reasoning task.

61
0.31 stars / hour

Qwen2.5 Technical Report

qwenlm/qwen2.5 19 Dec 2024

In addition, for hosted solutions, the proprietary models currently include two mixture-of-experts (MoE) variants: Qwen2. 5-Turbo and Qwen2. 5-Plus, both available from Alibaba Cloud Model Studio.

Ranked #7 on on GPQA

Common Sense Reasoning +5

20,850
0.28 stars / hour