1 code implementation • 14 Nov 2023 • Vishwajit Kumar Vishnu, C. Chandra Sekhar
Training Memory-based transformers can require a large amount of memory and can be quite inefficient.
Ranked #1 on Paraphrase Identification on Quora Question Pairs Dev (Val F1 Score metric)