Search Results for author: Summer Deng

Found 4 papers, 3 papers with code

FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference

1 code implementation13 Jan 2021 Daya Khudia, Jianyu Huang, Protonu Basu, Summer Deng, Haixin Liu, Jongsoo Park, Mikhail Smelyanskiy

Deep learning models typically use single-precision (FP32) floating point data types for representing activations and weights, but a slew of recent research work has shown that computations with reduced-precision data types (FP16, 16-bit integers, 8-bit integers or even 4- or 2-bit integers) are enough to achieve same accuracy as FP32 and are much more efficient.

Code Generation Quantization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.