1 code implementation • 24 Feb 2024 • Ziqian Zeng, Jiahong Yu, Qianshi Pang, ZiHao Wang, Huiping Zhuang, HongEn Shao, Xiaofeng Zou
Within this framework, we introduce a lightweight draft model that effectively utilizes previously generated tokens to predict subsequent words.