1 code implementation • 10 Oct 2022 • Haw-Shiuan Chang, Ruei-Yao Sun, Kathryn Ricci, Andrew McCallum
Ensembling BERT models often significantly improves accuracy, but at the cost of significantly more computation and memory footprint.