Search Results for author: Jerry Quinn

Found 3 papers, 1 papers with code

Zero-Shot Dynamic Quantization for Transformer Inference

4 code implementations • 17 Nov 2022 • Yousef El-Kurdi, Jerry Quinn, Avirup Sil

We introduce a novel run-time method for significantly reducing the accuracy loss associated with quantizing BERT-like models to 8-bit integers.

Quantization

Paper
Code

Optimal Mini-Batch Size Selection for Fast Gradient Descent

no code implementations • 15 Nov 2019 • Michael P. Perrone, Haidar Khan, Changhoan Kim, Anastasios Kyrillidis, Jerry Quinn, Valentina Salapura

This paper presents a methodology for selecting the mini-batch size that minimizes Stochastic Gradient Descent (SGD) learning time for single and multiple learner problems.

Machine Translation Translation

Paper
Add Code

Pieces of Eight: 8-bit Neural Machine Translation

no code implementations • NAACL 2018 • Jerry Quinn, Miguel Ballesteros

Neural machine translation has achieved levels of fluency and adequacy that would have been surprising a short time ago.

Machine Translation Quantization +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.