no code implementations • 19 Jan 2022 • Shien Zhu, Luan H. K. Duong, Hui Chen, Di Liu, Weichen Liu
Quantization is applied to reduce the latency and storage cost of CNNs.
no code implementations • 18 May 2020 • Fuyuan Lyu, Shien Zhu, Weichen Liu
However, these filter-wise quantification methods exist a natural upper limit, caused by the size of the kernel.