Search Results for author: Yueming Chen

Found 1 papers, 1 papers with code

Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

2 code implementations • 10 Apr 2024 • Jie Ou, Yueming Chen, Wenhong Tian

While Large Language Models (LLMs) have shown remarkable abilities, they are hindered by significant resource consumption and considerable latency due to autoregressive processing.

Language Modelling Large Language Model

1,087

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.