Search Results for author: Steve Dai

Found 3 papers, 0 papers with code

Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training

no code implementations13 Jun 2022 Charbel Sakr, Steve Dai, Rangharajan Venkatesan, Brian Zimmer, William J. Dally, Brucek Khailany

Data clipping is crucial in reducing noise in quantization operations and improving the achievable accuracy of quantization-aware training (QAT).

Quantization

VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference

no code implementations8 Feb 2021 Steve Dai, Rangharajan Venkatesan, Haoxing Ren, Brian Zimmer, William J. Dally, Brucek Khailany

4-bit weights and 8-bit activations achieve near-full-precision accuracy for both BERT-base and BERT-large on SQuAD while reducing area by 26% compared to an 8-bit baseline.

Math Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.