no code implementations • 13 Mar 2024 • Jianlin Chen
Since the breakthrough of ChatGPT, large language models (LLMs) have garnered significant attention in the research community.
1 code implementation • 30 Nov 2022 • Rui Pan, Shizhe Diao, Jianlin Chen, Tong Zhang
In this paper, we present ExtremeBERT, a toolkit for accelerating and customizing BERT pretraining.