Low-bit Quantization of Neural Networks for Efficient Inference

18 Feb 2019Yoni ChoukrounEli KravchikFan YangPavel Kisilev

Recent machine learning methods use increasingly large deep neural networks to achieve state of the art results in various tasks. The gains in performance come at the cost of a substantial increase in computation and storage requirements... (read more)

PDF Abstract

Code


No code implementations yet. Submit your code now

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.