Search Results for author: Yimin Jiang

Found 1 papers, 0 papers with code

Adaptive Gating in Mixture-of-Experts based Language Models

no code implementations11 Oct 2023 Jiamin Li, Qiang Su, Yitao Yang, Yimin Jiang, Cong Wang, Hong Xu

Existing MoE model adopts a fixed gating network where each token is computed by the same number of experts.

Cannot find the paper you are looking for? You can Submit a new open access paper.