no code implementations • 22 Apr 2024 • Bailey J. Eccles, Leon Wong, Blesson Varghese
We develop Reconvene, a system for rapidly generating pruned models suited for edge deployments using structured PaI.
1 code implementation • 13 Sep 2023 • Bailey J. Eccles, Philip Rodgers, Peter Kilpatrick, Ivor Spence, Blesson Varghese
Compared to sparse models, the pruned model variants are up to 5. 14x smaller and have a 1. 67x inference latency speedup, with no compromise to sparse model accuracy.