Search Results for author: Jack Kosaian

Found 5 papers, 2 papers with code

A Study on the Intersection of GPU Utilization and CNN Inference

no code implementations15 Dec 2022 Jack Kosaian, Amar Phanishayee

Achieving high GPU utilization is critical to increasing application-level throughput and ensuring a good return on investment for deploying GPUs.

Neural Architecture Search

Arithmetic-Intensity-Guided Fault Tolerance for Neural Network Inference on GPUs

1 code implementation19 Apr 2021 Jack Kosaian, K. V. Rashmi

Algorithm-based fault tolerance (ABFT) is emerging as an efficient approach for fault tolerance in NNs.

ECRM: Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding

no code implementations5 Apr 2021 Kaige Liu, Jack Kosaian, K. V. Rashmi

We present ECRM, a DLRM training system that achieves efficient fault tolerance using erasure coding.

Parity Models: A General Framework for Coding-Based Resilience in ML Inference

no code implementations2 May 2019 Jack Kosaian, K. V. Rashmi, Shivaram Venkataraman

In order to scale to high query rates, prediction serving systems are run on many machines in cluster settings, and thus are prone to slowdowns and failures that inflate tail latency and cause violations of strict latency targets.

BIG-bench Machine Learning Image Classification +3

Learning a Code: Machine Learning for Approximate Non-Linear Coded Computation

3 code implementations4 Jun 2018 Jack Kosaian, K. V. Rashmi, Shivaram Venkataraman

To the best of our knowledge, this work proposes the first learning-based approach for designing codes, and also presents the first coding-theoretic solution that can provide resilience for any non-linear (differentiable) computation.

BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.