Search Results for author: Yueming Chen

Found 1 papers, 1 papers with code

Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

2 code implementations10 Apr 2024 Jie Ou, Yueming Chen, Wenhong Tian

While Large Language Models (LLMs) have shown remarkable abilities, they are hindered by significant resource consumption and considerable latency due to autoregressive processing.

Language Modelling Large Language Model

Cannot find the paper you are looking for? You can Submit a new open access paper.