Search Results for author: Zhuowei Huang

Found 1 papers, 0 papers with code

Sequence can Secretly Tell You What to Discard

no code implementations24 Apr 2024 Jincheng Dai, Zhuowei Huang, Haiyun Jiang, Chen Chen, Deng Cai, Wei Bi, Shuming Shi

Large Language Models (LLMs), despite their impressive performance on a wide range of tasks, require significant GPU memory and consume substantial computational resources.

Cannot find the paper you are looking for? You can Submit a new open access paper.