Search Results for author: Yuliang Yan

Found 3 papers, 1 papers with code

SlimGPT: Layer-wise Structured Pruning for Large Language Models

no code implementations24 Dec 2024 Gui Ling, Ziyang Wang, Yuliang Yan, Qingwen Liu

Large language models (LLMs) have garnered significant attention for their remarkable capabilities across various domains, whose vast parameter scales present challenges for practical deployment.

Cannot find the paper you are looking for? You can Submit a new open access paper.