Data Formulator 2: Iteratively Creating Rich Visualizations with AI

microsoft/data-formulator 28 Aug 2024

To create rich visualizations, data analysts often need to iterate back and forth among data processing and chart specification to achieve their goals.

Code Generation Navigate

6,422
6.01 stars / hour

LLM4Decompile: Decompiling Binary Code with Large Language Models

albertan017/LLM4Decompile 8 Mar 2024

Decompilation aims to convert binary code to high-level source code, but traditional tools like Ghidra often produce results that are difficult to read and execute.

HumanEval

4,922
3.82 stars / hour

Cut Your Losses in Large-Vocabulary Language Models

unslothai/unsloth 13 Nov 2024

We implement a custom kernel that performs the matrix multiplications and the log-sum-exp reduction over the vocabulary in flash memory, making global memory consumption for the cross-entropy computation negligible.

29,112
2.33 stars / hour

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

seal-rg/recurrent-pretraining 7 Feb 2025

We scale a proof-of-concept model to 3. 5 billion parameters and 800 billion tokens.

Language Modeling Language Modelling

474
2.29 stars / hour

FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration

fireredteam/fireredasr 24 Jan 2025

We present FireRedASR, a family of large-scale automatic speech recognition (ASR) models for Mandarin, designed to meet diverse requirements in superior performance and optimal efficiency across various applications.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

497
2.23 stars / hour

Flaming-hot Initiation with Regular Execution Sampling for Large Language Models

volcengine/verl 28 Oct 2024

Since the release of ChatGPT, large language models (LLMs) have demonstrated remarkable capabilities across various domains.

Diversity Math

3,176
1.57 stars / hour

s1: Simple test-time scaling

simplescaling/s1 31 Jan 2025

After supervised finetuning the Qwen2. 5-32B-Instruct language model on s1K and equipping it with budget forcing, our model s1-32B exceeds o1-preview on competition math questions by up to 27% (MATH and AIME24).

Language Modeling Language Modelling +2

5,303
1.23 stars / hour

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

hkust-nlp/simplerl-reason 8 Jan 2025

We present rStar-Math to demonstrate that small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1, without distillation from superior models.

Math

2,661
1.21 stars / hour

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

facebookresearch/audiobox-aesthetics 7 Feb 2025

The quantification of audio aesthetics remains a complex challenge in audio processing, primarily due to its subjective nature, which is influenced by human perception and cultural context.

Benchmarking

292
1.17 stars / hour

MedRAX: Medical Reasoning Agent for Chest X-ray

bowang-lab/medrax 4 Feb 2025

Chest X-rays (CXRs) play an integral role in driving critical decisions in disease management and patient care.

AI Agent Management

351
1.11 stars / hour