Search Results for author: Rahul Huilgol

Found 1 papers, 0 papers with code

Amazon SageMaker Model Parallelism: A General and Flexible Framework for Large Model Training

no code implementations10 Nov 2021 Can Karakus, Rahul Huilgol, Fei Wu, Anirudh Subramanian, Cade Daniel, Derya Cavdar, Teng Xu, Haohan Chen, Arash Rahnama, Luis Quintela

In contrast to existing solutions, the implementation of the SageMaker library is much more generic and flexible, in that it can automatically partition and run pipeline parallelism over arbitrary model architectures with minimal code change, and also offers a general and extensible framework for tensor parallelism, which supports a wider range of use cases, and is modular enough to be easily applied to new training scripts.

Collaborative Filtering

Cannot find the paper you are looking for? You can Submit a new open access paper.