Efficient Reasoning Models: A Survey

fscdc/awesome-efficient-reasoning-models 15 Apr 2025

Reasoning models have demonstrated remarkable progress in solving complex and logic-intensive tasks by generating extended Chain-of-Thoughts (CoTs) prior to arriving at a final answer.

Knowledge Distillation Model Compression +1

93
0.33 stars / hour

MinerU: An Open-Source Solution for Precise Document Content Extraction

opendatalab/mineru 27 Sep 2024

Document content analysis has been a crucial research area in computer vision.

Diversity Optical Character Recognition (OCR)

31,365
0.33 stars / hour

Olympus: A Universal Task Router for Computer Vision Tasks

yuanze-lin/Olympus 12 Dec 2024

We introduce Olympus, a new approach that transforms Multimodal Large Language Models (MLLMs) into a unified framework capable of handling a wide array of computer vision tasks.

309
0.32 stars / hour

MetaSpatial: Reinforcing 3D Spatial Reasoning in VLMs for the Metaverse

hiyouga/easyr1 24 Mar 2025

We present MetaSpatial, the first reinforcement learning (RL)-based framework designed to enhance 3D spatial reasoning in vision-language models (VLMs), enabling real-time 3D scene generation without the need for hard-coded optimizations.

Layout Generation Reinforcement Learning (RL) +2

2,066
0.32 stars / hour

InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

bytedance/infiniteyou 20 Mar 2025

Achieving flexible and high-fidelity identity-preserved image generation remains formidable, particularly with advanced Diffusion Transformers (DiTs) like FLUX.

Image Generation

2,003
0.32 stars / hour

DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments

gair-nlp/deepresearcher 4 Apr 2025

In this paper, we introduce DeepResearcher, the first comprehensive framework for end-to-end training of LLM-based deep research agents through scaling reinforcement learning (RL) in real-world environments with authentic web search interactions.

Navigate Prompt Engineering +2

242
0.32 stars / hour

Holistic Fusion: Task- and Setup-Agnostic Robot Localization and State Estimation with Factor Graphs

leggedrobotics/holistic_fusion 8 Apr 2025

Seamless operation of mobile robots in challenging environments requires low-latency local motion estimation (e. g., dynamic maneuvers) and accurate global localization (e. g., wayfinding).

Motion Estimation Sensor Fusion

76
0.31 stars / hour

Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU Simulation

Taccel-Simulator/Taccel 17 Apr 2025

Tactile sensing is crucial for achieving human-level robotic capabilities in manipulation tasks.

Object Recognition Robotic Grasping

48
0.31 stars / hour

TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second

automl/tabpfn 5 Jul 2022

We present TabPFN, a trained Transformer that can do supervised classification for small tabular datasets in less than a second, needs no hyperparameter tuning and is competitive with state-of-the-art classification methods.

AutoML Bayesian Inference +5

3,355
0.30 stars / hour

TorchFX: A modern approach to Audio DSP with PyTorch and GPU acceleration

matteospanio/torchfx 11 Apr 2025

In response, we introduce TorchFX: a GPU-accelerated Python library for DSP, specifically engineered to facilitate sophisticated audio signal processing.

Audio Signal Processing Benchmarking

64
0.30 stars / hour