no code implementations • 15 May 2019 • Simeon E. Spasov, Pietro Lio
Existing methods for reducing the computational burden of neural networks at run-time, such as parameter pruning or dynamic computational path selection, focus solely on improving computational efficiency during inference.