1 code implementation • 28 Feb 2024 • Simran Arora, Sabri Eyuboglu, Michael Zhang, Aman Timalsina, Silas Alberti, Dylan Zinsley, James Zou, Atri Rudra, Christopher Ré
In this work, we explore whether we can improve language model efficiency (e. g. by reducing memory consumption) without compromising on recall.