Search Results for author: Guangtai Huang

HLAT: High-quality Large Language Model Pre-trained on AWS Trainium

In this paper, we showcase HLAT: a 7 billion parameter decoder-only LLM pre-trained using trn1 instances over 1. 8 trillion tokens.

Paper
Add Code

In this paper, we present RAF, a deep learning compiler for training.

134

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.