# Automatic Pruning for Quantized Neural Networks

3 Feb 2020Luis GuerraBohan ZhuangIan ReidTom Drummond

Neural network quantization and pruning are two techniques commonly used to reduce the computational complexity and memory footprint of these models for deployment. However, most existing pruning strategies operate on full-precision and cannot be directly applied to discrete parameter distributions after quantization... (read more)

PDF Abstract