1 code implementation • 8 Nov 2023 • Haim Barad, Ekaterina Aidova, Yury Gorbachev
Inference optimizations are critical for improving user experience and reducing infrastructure costs and power consumption.
2 code implementations • 20 Feb 2020 • Alexander Kozlov, Ivan Lazarevich, Vasily Shamporov, Nikolay Lyalyushkin, Yury Gorbachev
In this work we present a new framework for neural networks compression with fine-tuning, which we called Neural Network Compression Framework (NNCF).
Ranked #3 on Binarization on ImageNet