Continuous Thought Machines

SakanaAI/continuous-thought-machines 8 May 2025

The CTM has two core innovations: (1) neuron-level temporal processing, where each neuron uses unique weight parameters to process a history of incoming signals; and (2) neural synchronization employed as a latent representation.

Computational Efficiency Question Answering

841
0.62 stars / hour

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

going-doer/paper2code 24 Apr 2025

Despite the rapid growth of machine learning research, corresponding code implementations are often unavailable, making it slow and labor-intensive for researchers to reproduce results and build upon prior work.

Code Generation

2,213
0.61 stars / hour

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

zhaochen0110/openthinkimg 13 May 2025

We hope OpenThinkIMG can serve as a foundational framework for advancing dynamic, tool-augmented visual reasoning, helping the community develop AI agents that can genuinely "think with images".

Reinforcement Learning (RL) Visual Reasoning

175
0.60 stars / hour

Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models

1zhou-Wang/MemVR 4 Oct 2024

Despite their impressive capabilities, multimodal large language models (MLLMs) are prone to hallucinations, i. e., the generated content that is nonsensical or unfaithful to input sources.

Decoder Hallucination

111
0.57 stars / hour

Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs

wqzustc/high-performance-tensor-processing-engines 8 Mar 2025

Based on this notation and its transformations, we propose four optimization techniques that improve timing, area, and power consumption.

75
0.57 stars / hour

WorldPM: Scaling Human Preference Modeling

qwenlm/worldpm 15 May 2025

Motivated by scaling laws in language modeling that demonstrate how test loss scales as a power law with model and dataset sizes, we find that similar laws exist in preference modeling.

Language Modeling Language Modelling

72
0.56 stars / hour

Hyperspectral Image Land Cover Captioning Dataset for Vision Language Models

arya-domain/hypercap 18 May 2025

We introduce HyperCap, the first large-scale hyperspectral captioning dataset designed to enhance model performance and effectiveness in remote sensing applications.

Classification

24
0.54 stars / hour

Dynamic Early Exit in Reasoning Models

iie-ycx/deer 22 Apr 2025

Recent advances in large reasoning language models (LRLMs) rely on test-time scaling, which extends long chain-of-thought (CoT) generation to solve complex tasks.

GSM8K Math

32
0.53 stars / hour

SkyReels-V2: Infinite-length Film Generative Model

skyworkai/skyreels-v2 17 Apr 2025

Recent advances in video generation have been driven by diffusion models and autoregressive frameworks, yet critical challenges persist in harmonizing prompt adherence, visual quality, motion dynamics, and duration: compromises in motion dynamics to enhance temporal visual quality, constrained video duration (5-10 seconds) to prioritize resolution, and inadequate shot-aware generation stemming from general-purpose MLLMs' inability to interpret cinematic grammar, such as shot composition, actor expressions, and camera motions.

Large Language Model model +2

2,494
0.53 stars / hour

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

sail-sg/anytimereasoner 19 May 2025

However, such methods optimize only the final performance under a large and fixed token budget, which hinders efficiency in both training and deployment.

Mathematical Reasoning Reinforcement Learning (RL)

30
0.53 stars / hour