Comet: Fine-grained Computation-communication Overlapping for Mixture-of-Experts

bytedance/flux 27 Feb 2025

The inter-device communication of a MoE layer can occupy 47% time of the entire model execution with popular models and frameworks.

Computational Efficiency

749
0.71 stars / hour

Self-rewarding correction for mathematical reasoning

volcengine/verl 26 Feb 2025

We study self-rewarding reasoning large language models (LLMs), which can simultaneously generate step-by-step reasoning and evaluate the correctness of their outputs during the inference time-without external feedback.

Mathematical Reasoning

4,853
0.70 stars / hour

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

petergriffinjin/search-r1 12 Mar 2025

Efficiently acquiring external knowledge and up-to-date information is essential for effective reasoning and text generation in large language models (LLMs).

Question Answering Reinforcement Learning (RL) +2

1,058
0.68 stars / hour

Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models

emcie-co/parlant 5 Mar 2025

We present Attentive Reasoning Queries (ARQs), a novel structured reasoning approach that significantly improves instruction-following in Large Language Models through domain-specialized reasoning blueprints.

Hallucination Instruction Following +1

1,835
0.63 stars / hour

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

tidedra/lmm-r1 10 Mar 2025

Enhancing reasoning in Large Multimodal Models (LMMs) faces unique challenges from the complex interplay between visual perception and logical reasoning, particularly in compact 3B-parameter architectures where architectural constraints limit reasoning capacity and modality alignment.

Logical Reasoning Multimodal Reasoning +1

550
0.62 stars / hour

ELIZA Reanimated: The world's first chatbot restored on the world's first time sharing system

rupertl/eliza-ctss 12 Jan 2025

The entire stack is open source, so that any user of a unix-like OS can run the world's first chatbot on the world's first time-sharing system.

Chatbot

219
0.62 stars / hour

HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation

dcdmllm/healthgpt 14 Feb 2025

To effectively learn the HealthGPT, we devise a comprehensive medical domain-specific comprehension and generation dataset called VL-Health.

Language Modeling Language Modelling +1

513
0.62 stars / hour

2 OLMo 2 Furious

allenai/OLMo-core 31 Dec 2024

Our modified model architecture and training recipe achieve both better training stability and improved per-token efficiency.

132
0.60 stars / hour

GENERator: A Long-Context Generative Genomic Foundation Model

generteam/generator 11 Feb 2025

Recent developments in genomic language models have underscored the potential of LLMs in deciphering DNA sequences.

model

351
0.60 stars / hour

VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control

TencentARC/VideoPainter 7 Mar 2025

Video inpainting, which aims to restore corrupted video content, has experienced substantial progress.

Image Inpainting Optical Flow Estimation +3

214
0.58 stars / hour