no code implementations • 20 Aug 2024 • Guanchen Li, Xiandong Zhao, Lian Liu, Zeping Li, Dong Li, Lu Tian, Jie He, Ashish Sirasao, Emad Barsoum
Next, we reconstruct a dense model featuring a pruning-friendly weight distribution by reactivating pruned connections with sparse regularization.
no code implementations • ICLR 2020 • Xiandong Zhao, Ying Wang, Xuyi Cai, Cheng Liu, Lei Zhang
With the proliferation of specialized neural network processors that operate on low-precision integers, the performance of Deep Neural Network inference becomes increasingly dependent on the result of quantization.