Search Results for author: Shiyun Wei

Found 2 papers, 0 papers with code

EdgeMoE: Fast On-Device Inference of MoE-based Large Language Models

no code implementations28 Aug 2023 Rongjie Yi, Liwei Guo, Shiyun Wei, Ao Zhou, Shangguang Wang, Mengwei Xu

Large Language Models (LLMs) such as GPTs and LLaMa have ushered in a revolution in machine intelligence, owing to their exceptional capabilities in a wide range of machine learning tasks.

Computational Efficiency

Cannot find the paper you are looking for? You can Submit a new open access paper.