Search Results for author: Jianfei Wang

Found 1 papers, 0 papers with code

Scaling Laws for Speculative Decoding

no code implementations8 May 2025 Siyuan Yan, Mo Zhu, Guo-qing Jiang, Jianfei Wang, Jiaxing Chen, Wentai Zhang, Xiang Liao, Xiao Cui, Chen Zhang, Zhuoran Song, Ran Zhu

The escalating demand for efficient decoding in large language models (LLMs) is particularly critical for reasoning-intensive architectures like OpenAI-o3 and DeepSeek-R1, which depend on extended chain-of-thought reasoning.

Cannot find the paper you are looking for? You can Submit a new open access paper.