s1: Simple test-time scaling

simplescaling/s1 31 Jan 2025

After supervised finetuning the Qwen2. 5-32B-Instruct language model on s1K and equipping it with budget forcing, our model s1-32B exceeds o1-preview on competition math questions by up to 27% (MATH and AIME24).

Language Modeling Language Modelling +2

4,465
12.74 stars / hour

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

deepseek-ai/deepseek-vl2 13 Dec 2024

We present DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL, through two key major upgrades.

Chart Understanding Optical Character Recognition +4

3,160
2.74 stars / hour

Flaming-hot Initiation with Regular Execution Sampling for Large Language Models

volcengine/verl 28 Oct 2024

Since the release of ChatGPT, large language models (LLMs) have demonstrated remarkable capabilities across various domains.

Diversity Math

2,579
2.07 stars / hour

Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback

pku-alignment/align-anything 20 Dec 2024

In this work, we make the first attempt to fine-tune all-modality models (i. e. input and output with any modality, also named any-to-any models) using human preference data across all modalities (including text, image, audio, and video), ensuring its behavior aligns with human intentions.

Instruction Following

1,555
1.92 stars / hour

ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills

lecar-lab/asap 3 Feb 2025

In the second stage, we deploy the policies in the real world and collect real-world data to train a delta (residual) action model that compensates for the dynamics mismatch.

455
1.82 stars / hour

DeepSeek-V3 Technical Report

deepseek-ai/deepseek-v3 27 Dec 2024

We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

Language Modeling Language Modelling

80,072
1.34 stars / hour

Data Formulator 2: Iteratively Creating Rich Visualizations with AI

microsoft/data-formulator 28 Aug 2024

To create rich visualizations, data analysts often need to iterate back and forth among data processing and chart specification to achieve their goals.

Code Generation Navigate

1,679
1.03 stars / hour

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

intellabs/ragfoundry 5 Aug 2024

We introduce RAG Foundry, an open-source framework for augmenting large language models for RAG use cases.

676
0.95 stars / hour

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

deepseek-ai/DeepSeek-Coder 25 Jan 2024

The rapid development of large language models has revolutionized code intelligence in software development.

Code Generation Language Modeling +2

18,500
0.90 stars / hour