no code implementations • 18 Dec 2019 • Tianyu Zhang, Lei Zhu, Qian Zhao, Kilho Shin
Quantization of weights of deep neural networks (DNN) has proven to be an effective solution for the purpose of implementing DNNs on edge devices such as mobiles, ASICs and FPGAs, because they have no sufficient resources to support computation involving millions of high precision weights and multiply-accumulate operations.