no code implementations • 19 Oct 2023 • Jianwei Li, Weizhi Gao, Qi Lei, Dongkuan Xu
It is widely acknowledged that large and sparse models have higher accuracy than small and dense models under the same model size constraints.