Search Results for author: Shu Yan

Found 2 papers, 1 papers with code

How Far Are We from Optimal Reasoning Efficiency?

1 code implementation8 Jun 2025 Jiaxuan Gao, Shu Yan, Qixin Tan, Lu Yang, Shusheng Xu, Wei Fu, Zhiyu Mei, Kaifeng Lyu, Yi Wu

To reduce the efficiency gap, we propose REO-RL, a class of Reinforcement Learning algorithms that minimizes REG by targeting a sparse set of token budgets.

16k Benchmarking +1

Cannot find the paper you are looking for? You can Submit a new open access paper.