no code implementations • ICLR 2018 • Jan Achterhold, Jan Mathias Koehler, Anke Schmeink, Tim Genewein
In this paper, the preparation of a neural network for pruning and few-bit quantization is formulated as a variational inference problem.
Quantization Variational Inference