1 code implementation • 4 Jun 2023 • Daniel Rotem, Michael Hassid, Jonathan Mamou, Roy Schwartz
Adaptive inference is a simple method for reducing inference costs.
1 code implementation • 7 Nov 2022 • Michael Hassid, Hao Peng, Daniel Rotem, Jungo Kasai, Ivan Montero, Noah A. Smith, Roy Schwartz
Our results motivate research on simpler alternatives to input-dependent attention, as well as on methods for better utilization of this mechanism in the Transformer architecture.