Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning

volcengine/verl 31 Mar 2025

We propose Rec-R1, a general reinforcement learning framework that bridges large language models (LLMs) with recommendation systems through closed-loop optimization.

General Reinforcement Learning Instruction Following +1

6,774
0.34 stars / hour

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

petergriffinjin/search-r1 12 Mar 2025

Efficiently acquiring external knowledge and up-to-date information is essential for effective reasoning and text generation in large language models (LLMs).

Question Answering Reinforcement Learning (RL) +2

1,911
0.34 stars / hour

VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning

vargpt-family/vargpt-v1.1 3 Apr 2025

Notably, through visual instruction tuning, the model acquires image editing functionality while maintaining architectural consistency with its predecessor, revealing the potential for unified visual understanding, generation, and editing.

Image Generation Instruction Following

193
0.34 stars / hour

MedSAM2: Segment Anything in 3D Medical Images and Videos

bowang-lab/medsam2 4 Apr 2025

Medical image and video segmentation is a critical task for precision medicine, which has witnessed considerable progress in developing task or modality-specific and generalist models for 2D images.

Segmentation Video Segmentation +1

104
0.33 stars / hour

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

multi-swe-bench/multi-swe-bench 3 Apr 2025

The task of issue resolving is to modify a codebase to generate a patch that addresses a given issue.

Reinforcement Learning (RL)

115
0.32 stars / hour

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

om-ai-lab/vlm-r1 10 Apr 2025

Motivated by this observation, we investigate the extension of R1-style reinforcement learning to Vision-Language Models (VLMs), aiming to enhance their visual reasoning capabilities.

Language Modeling Language Modelling +7

4,675
0.31 stars / hour

Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning

ding523/Curr_REFT 10 Mar 2025

While state-of-the-art vision-language models (VLMs) have demonstrated remarkable capabilities in complex visual-text tasks, their success heavily relies on massive model scaling, limiting their practical deployment.

46
0.31 stars / hour

Perception-R1: Pioneering Perception Policy with Reinforcement Learning

linkangheng/pr1 10 Apr 2025

In this work, we return to the fundamentals and explore the effects of RL on different perception tasks.

reinforcement-learning Reinforcement Learning +1

93
0.30 stars / hour

Auto-configuring Exploration-Exploitation Tradeoff in Evolutionary Computation via Deep Reinforcement Learning

GMC-DRL/MetaBox 12 Apr 2024

Evolutionary computation (EC) algorithms, renowned as powerful black-box optimizers, leverage a group of individuals to cooperatively search for the optimum.

Deep Reinforcement Learning

104
0.29 stars / hour

HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation

dcdmllm/healthgpt 14 Feb 2025

To effectively learn the HealthGPT, we devise a comprehensive medical domain-specific comprehension and generation dataset called VL-Health.

Language Modeling Language Modelling +1

1,044
0.28 stars / hour