Fast Adjustable Threshold For Uniform Neural Network Quantization (Winning solution of LPIRC-II)

19 Dec 2018Alexander GoncharenkoAndrey DenisovSergey AlyamkinEvgeny Terentev

Neural network quantization procedure is the necessary step for porting of neural networks to mobile devices. Quantization allows accelerating the inference, reducing memory consumption and model size... (read more)

PDF Abstract

Evaluation results from the paper


  Submit results from this paper to get state-of-the-art GitHub badges and help community compare results to other papers.