Efficient Bitwidth Search for Practical Mixed Precision Neural Network

17 Mar 2020 Yuhang Li Wei Wang Haoli Bai Ruihao Gong Xin Dong Fengwei Yu

Network quantization has rapidly become one of the most widely used methods to compress and accelerate deep neural networks. Recent efforts propose to quantize weights and activations from different layers with different precision to improve the overall performance... (read more)

PDF Abstract
No code implementations yet. Submit your code now

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper