Search Results for author: Zhuorui Liu

Found 1 papers, 0 papers with code

Beyond the Speculative Game: A Survey of Speculative Execution in Large Language Models

no code implementations23 Apr 2024 Chen Zhang, Zhuorui Liu, Dawei Song

The bottleneck is mainly due to the autoregressive innateness of LLMs, where tokens can only be generated sequentially during decoding.

Cannot find the paper you are looking for? You can Submit a new open access paper.