MMaDA: Multimodal Large Diffusion Language Models

gen-verse/mmada 21 May 2025

We introduce MMaDA, a novel class of multimodal diffusion foundation models designed to achieve superior performance across diverse domains such as textual reasoning, multimodal understanding, and text-to-image generation.

Reinforcement Learning (RL) Text-to-Image Generation

342
9.39 stars / hour

AlphaEvolve: A Learning Framework to Discover Novel Alphas in Quantitative Investment

codelion/openevolve 30 Mar 2021

In this paper, we introduce a new class of alphas to model scalar, vector, and matrix features which possess the strengths of these two existing classes.

AutoML Stock Prediction

659
3.87 stars / hour

Aligning Anime Video Generation with Human Feedback

bilibili/index-anisora 14 Apr 2025

Existing reward models, designed primarily for real-world videos, fail to capture the unique appearance and consistency requirements of anime.

Video Generation

762
3.32 stars / hour

Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting

bytedance/dolphin 20 May 2025

Document image parsing is challenging due to its complexly intertwined elements such as text paragraphs, figures, formulas, and tables.

200
3.21 stars / hour

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

jiuhaichen/blip3o 14 May 2025

Building on our innovative model design, training recipe, and datasets, we develop BLIP3-o, a suite of state-of-the-art unified multimodal models.

Image Generation

806
1.89 stars / hour

Parallel Scaling Law for Language Models

qwenlm/parscale 15 May 2025

We apply $P$ diverse and learnable transformations to the input, execute forward passes of the model in parallel, and dynamically aggregate the $P$ outputs.

292
1.58 stars / hour

Fully Open Source Moxin-7B Technical Report

moxin-org/moxin-llm 8 Dec 2024

Recently, Large Language Models (LLMs) have undergone a significant transformation, marked by a rapid rise in both their popularity and capabilities.

438
1.28 stars / hour

Group-in-Group Policy Optimization for LLM Agent Training

langfengq/verl-agent 16 May 2025

In this work, we propose Group-in-Group Policy Optimization (GiGPO), a novel RL algorithm that achieves fine-grained credit assignment for LLM agents while preserving the appealing properties of group-based RL: critic-free, low memory, and stable convergence.

Mathematical Reasoning Reinforcement Learning (RL)

145
1.04 stars / hour

Visual Planning: Let's Think Only with Images

yix8/visualplanning 16 May 2025

Recent advancements in Large Language Models (LLMs) and their multimodal extensions (MLLMs) have substantially enhanced machine reasoning across diverse tasks.

reinforcement-learning Reinforcement Learning +1

103
1.00 stars / hour

Thinkless: LLM Learns When to Think

vainf/thinkless 19 May 2025

Reasoning Language Models, capable of extended chain-of-thought reasoning, have demonstrated remarkable performance on tasks requiring complex logical inference.

GSM8K Math

68
0.99 stars / hour