Search Results for author: Abhinav Bhatele

Found 10 papers, 2 papers with code

Can Large Language Models Write Parallel Code?

no code implementations • 23 Jan 2024 • Daniel Nichols, Joshua H. Davis, Zhaojun Xie, Arjun Rajaram, Abhinav Bhatele

Large language models are increasingly becoming a popular tool for software development.

Paper
Add Code

Jorge: Approximate Preconditioning for GPU-efficient Second-order Optimization

no code implementations • 18 Oct 2023 • Siddharth Singh, Zachary Sating, Abhinav Bhatele

The primary efficiency bottleneck in such optimizers is matrix inverse calculations in the preconditioning step, which are expensive to compute on GPUs.

Computational Efficiency Second-order methods

Paper
Add Code

Modeling Parallel Programs using Large Language Models

no code implementations • 29 Jun 2023 • Daniel Nichols, Aniruddha Marathe, Harshitha Menon, Todd Gamblin, Abhinav Bhatele

In this paper, we show how large language models (LLMs) can be applied to tasks specific to high performance and scientific codes.

Language Modelling

Paper
Add Code

A 4D Hybrid Algorithm to Scale Parallel Training to Thousands of GPUs

no code implementations • 22 May 2023 • Siddharth Singh, Prajwal Singhania, Aditya K. Ranjan, Zack Sating, Abhinav Bhatele

Large communication costs are a critical bottleneck in training state-of-the-art neural networks on distributed systems.

Paper
Add Code

A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training

1 code implementation • 11 Mar 2023 • Siddharth Singh, Olatunji Ruwase, Ammar Ahmad Awan, Samyam Rajbhandari, Yuxiong He, Abhinav Bhatele

Mixture-of-Experts (MoE) is a neural network architecture that adds sparsely activated expert blocks to a base model, increasing the number of parameters without impacting computational costs.

32,517

Paper
Code

Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training

no code implementations • 10 Feb 2023 • Siddharth Singh, Abhinav Bhatele

Parallel training of neural networks at scale is challenging due to significant overheads arising from communication.

Paper
Add Code

A Survey and Empirical Evaluation of Parallel Deep Learning Frameworks

no code implementations • 9 Nov 2021 • Daniel Nichols, Siddharth Singh, Shu-Huai Lin, Abhinav Bhatele

This phenomenon has spurred the development of algorithms for distributed training of neural networks over a larger number of hardware accelerators.

Paper
Add Code

AxoNN: An asynchronous, message-driven parallel framework for extreme-scale deep learning

no code implementations • 25 Oct 2021 • Siddharth Singh, Abhinav Bhatele

This has necessitated the development of efficient algorithms to train these neural networks in parallel on large-scale GPU-based clusters.

Paper
Add Code

Analytics of Longitudinal System Monitoring Data for Performance Prediction

no code implementations • 7 Jul 2020 • Ian J. Costello, Abhinav Bhatele

In recent years, several HPC facilities have started continuous monitoring of their systems and jobs to collect performance-related data for understanding performance and operational efficiency.

Paper
Add Code

Scalable Comparative Visualization of Ensembles of Call Graphs

1 code implementation • 1 Jul 2020 • Suraj P. Kesavan, Harsh Bhatia, Abhinav Bhatele, Todd Gamblin, Peer-Timo Bremer, Kwan-Liu Ma

Optimizing the performance of large-scale parallel codes is critical for efficient utilization of computing resources.

Distributed, Parallel, and Cluster Computing Performance

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.