Search Results for author: Zachary Yedidia

Found 1 papers, 0 papers with code

Quantized Neural Network Inference with Precision Batching

no code implementations26 Feb 2020 Maximilian Lam, Zachary Yedidia, Colby Banbury, Vijay Janapa Reddi

We present PrecisionBatching, a quantized inference algorithm for speeding up neural network execution on traditional hardware platforms at low bitwidths without the need for retraining or recalibration.

Language Modelling Natural Language Inference +1

Cannot find the paper you are looking for? You can Submit a new open access paper.