Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning

volcengine/verl 31 Mar 2025

We propose Rec-R1, a general reinforcement learning framework that bridges large language models (LLMs) with recommendation systems through closed-loop optimization.

General Reinforcement Learning Instruction Following +1

6,716
0.25 stars / hour

Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria Reranking

microsoft/rebel 14 Mar 2025

In response, we show evaluations of existing RAG methods which account for both context relevance and answer quality.

All RAG +2

25
0.25 stars / hour

Perception-R1: Pioneering Perception Policy with Reinforcement Learning

linkangheng/pr1 10 Apr 2025

In this work, we return to the fundamentals and explore the effects of RL on different perception tasks.

reinforcement-learning Reinforcement Learning +1

81
0.25 stars / hour

Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster

guinmoon/llmfarm 6 Apr 2023

We study recent research advances that improve large language models through efficient pre-training and scaling, and open datasets and tools.

1,728
0.24 stars / hour

3DGUT: Enabling Distorted Cameras and Secondary Rays in Gaussian Splatting

nv-tlabs/3dgrut 17 Dec 2024

3D Gaussian Splatting (3DGS) enables efficient reconstruction and high-fidelity real-time rendering of complex scenes on consumer hardware.

3DGS

602
0.24 stars / hour

AssistanceZero: Scalably Solving Assistance Games

cassidylaidlaw/minecraft-building-assistance-game 9 Apr 2025

We present the first scalable approach to solving assistance games and apply it to a new, challenging Minecraft-based assistance game with over $10^{400}$ possible goals.

Imitation Learning Minecraft

123
0.24 stars / hour

Docling Technical Report

DS4SD/docling 19 Aug 2024

This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion.

27,426
0.24 stars / hour

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

stanfordnlp/dspy 17 Jun 2024

To make this tractable, we factorize our problem into optimizing the free-form instructions and few-shot demonstrations of every module and introduce several strategies to craft task-grounded instructions and navigate credit assignment across modules.

Language Modeling Language Modelling +1

23,541
0.24 stars / hour

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

om-ai-lab/vlm-r1 10 Apr 2025

Motivated by this observation, we investigate the extension of R1-style reinforcement learning to Vision-Language Models (VLMs), aiming to enhance their visual reasoning capabilities.

Language Modeling Language Modelling +7

4,650
0.24 stars / hour

Do Large Language Models Need a Content Delivery Network?

lmcache/lmcache 16 Sep 2024

As the use of large language models (LLMs) expands rapidly, so does the range of knowledge needed to supplement various LLM queries.

In-Context Learning

805
0.24 stars / hour