s1: Simple test-time scaling

simplescaling/s1 31 Jan 2025

After supervised finetuning the Qwen2. 5-32B-Instruct language model on s1K and equipping it with budget forcing, our model s1-32B exceeds o1-preview on competition math questions by up to 27% (MATH and AIME24).

Language Modeling Language Modelling +2

5,015
8.45 stars / hour

Data Formulator 2: Iteratively Creating Rich Visualizations with AI

microsoft/data-formulator 28 Aug 2024

To create rich visualizations, data analysts often need to iterate back and forth among data processing and chart specification to achieve their goals.

Code Generation Navigate

4,190
4.19 stars / hour

LLM4Decompile: Decompiling Binary Code with Large Language Models

albertan017/LLM4Decompile 8 Mar 2024

Decompilation aims to convert binary code to high-level source code, but traditional tools like Ghidra often produce results that are difficult to read and execute.

HumanEval

4,218
2.93 stars / hour

Cut Your Losses in Large-Vocabulary Language Models

unslothai/unsloth 13 Nov 2024

We implement a custom kernel that performs the matrix multiplications and the log-sum-exp reduction over the vocabulary in flash memory, making global memory consumption for the cross-entropy computation negligible.

26,700
1.92 stars / hour

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

deepseek-ai/deepseek-vl2 13 Dec 2024

We present DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL, through two key major upgrades.

Chart Understanding Optical Character Recognition +4

3,438
1.71 stars / hour

Flaming-hot Initiation with Regular Execution Sampling for Large Language Models

volcengine/verl 28 Oct 2024

Since the release of ChatGPT, large language models (LLMs) have demonstrated remarkable capabilities across various domains.

Diversity Math

2,921
1.38 stars / hour

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

hkust-nlp/simplerl-reason 8 Jan 2025

We present rStar-Math to demonstrate that small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1, without distillation from superior models.

Math

2,538
1.36 stars / hour

Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback

pku-alignment/align-anything 20 Dec 2024

In this work, we make the first attempt to fine-tune all-modality models (i. e. input and output with any modality, also named any-to-any models) using human preference data across all modalities (including text, image, audio, and video), ensuring its behavior aligns with human intentions.

Instruction Following

1,733
1.04 stars / hour

FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration

fireredteam/fireredasr 24 Jan 2025

We present FireRedASR, a family of large-scale automatic speech recognition (ASR) models for Mandarin, designed to meet diverse requirements in superior performance and optimal efficiency across various applications.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

374
0.99 stars / hour