Search Results for author: Matthew D. Sinclair

Found 4 papers, 1 papers with code

T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute & Collectives

no code implementations30 Jan 2024 Suchita Pati, Shaizeen Aga, Mahzabeen Islam, Nuwan Jayasena, Matthew D. Sinclair

One approach to hide this serialized communication is to interleave it with the producer operation (of the communicated data) in a fine-grained manner.

Demystifying BERT: Implications for Accelerator Design

no code implementations14 Apr 2021 Suchita Pati, Shaizeen Aga, Nuwan Jayasena, Matthew D. Sinclair

Further, we also identify heterogeneity in compute-intensive BERT computations and discuss software and possible hardware mechanisms to further optimize these computations.

Transfer Learning

Analyzing Machine Learning Workloads Using a Detailed GPU Simulator

13 code implementations18 Nov 2018 Jonathan Lew, Deval Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew D. Sinclair, Timothy G. Rogers, Tor Aamodt

Most deep neural networks deployed today are trained using GPUs via high-level frameworks such as TensorFlow and PyTorch.

Distributed, Parallel, and Cluster Computing

Cannot find the paper you are looking for? You can Submit a new open access paper.