Search Results for author: Yeyang Zhou

Found 2 papers, 1 papers with code

R-KV: Redundancy-aware KV Cache Compression for Training-Free Reasoning Models Acceleration

1 code implementation30 May 2025 Zefan Cai, Wen Xiao, Hanshi Sun, Cheng Luo, Yikai Zhang, Ke Wan, Yucheng Li, Yeyang Zhou, Li-Wen Chang, Jiuxiang Gu, Zhen Dong, Anima Anandkumar, Abedelkadir Asi, Junjie Hu

To address this, we propose Redundancy-aware KV Cache Compression for Reasoning models (R-KV), a novel method specifically targeting redundant tokens in reasoning models.

Mathematical Reasoning

Entropy-based Exploration Conduction for Multi-step Reasoning

no code implementations20 Mar 2025 Jinghan Zhang, Xiting Wang, Fengran Mo, Yeyang Zhou, Wanfu Gao, Kunpeng Liu

In large language model (LLM) reasoning, multi-step processes have proven effective for solving complex tasks.

Language Modeling Language Modelling +1

Cannot find the paper you are looking for? You can Submit a new open access paper.