Search Results for author: Aakash Sharma

Found 2 papers, 0 papers with code

GPU Cluster Scheduling for Network-Sensitive Deep Learning

no code implementations29 Jan 2024 Aakash Sharma, Vivek M. Bhasi, Sonali Singh, George Kesidis, Mahmut T. Kandemir, Chita R. Das

We propose a novel GPU-cluster scheduler for distributed DL (DDL) workloads that enables proximity based consolidation of GPU resources based on the DDL jobs' sensitivities to the anticipated communication-network delays.

Scheduling

Analysis of Distributed Deep Learning in the Cloud

no code implementations30 Aug 2022 Aakash Sharma, Vivek M. Bhasi, Sonali Singh, Rishabh Jain, Jashwant Raj Gunasekaran, Subrata Mitra, Mahmut Taylan Kandemir, George Kesidis, Chita R. Das

We aim to resolve this problem by introducing a comprehensive distributed deep learning (DDL) profiler, which can determine the various execution "stalls" that DDL suffers from while running on a public cloud.

Cannot find the paper you are looking for? You can Submit a new open access paper.