Search Results for author: Pei Mu

Found 1 papers, 0 papers with code

WaferLLM: A Wafer-Scale LLM Inference System

no code implementations6 Feb 2025 Congjie He, Yeqi Huang, Pei Mu, Ziming Miao, Jilong Xue, Lingxiao Ma, Fan Yang, Luo Mai

Leveraging this model, WaferLLM pioneers wafer-scale LLM parallelism, optimizing the utilization of hundreds of thousands of on-chip cores.

Cannot find the paper you are looking for? You can Submit a new open access paper.