Search Results for author: Niladrish Chatterjee

Found 3 papers, 0 papers with code

GPU Domain Specialization via Composable On-Package Architecture

no code implementations • 5 Apr 2021 • Yaosheng Fu, Evgeny Bolotin, Niladrish Chatterjee, David Nellans, Stephen W. Keckler

As GPUs scale their low precision matrix math throughput to boost deep learning (DL) performance, they upset the balance between math throughput and memory system capabilities.

Math

Paper
Add Code

DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis

no code implementations • 2 Apr 2019 • Sangkug Lym, Donghyuk Lee, Mike O'Connor, Niladrish Chatterjee, Mattan Erez

Training convolutional neural networks (CNNs) requires intense compute throughput and high memory bandwidth.

Paper
Add Code

Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks

no code implementations • 3 May 2017 • Minsoo Rhu, Mike O'Connor, Niladrish Chatterjee, Jeff Pool, Stephen W. Keckler

Popular deep learning frameworks require users to fine-tune their memory usage so that the training data of a deep neural network (DNN) fits within the GPU physical memory.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.