no code implementations • ICLR 2020 • Xiandong Zhao, Ying Wang, Xuyi Cai, Cheng Liu, Lei Zhang
With the proliferation of specialized neural network processors that operate on low-precision integers, the performance of Deep Neural Network inference becomes increasingly dependent on the result of quantization.