Search Results for author: Xinwei Fu

HLAT: High-quality Large Language Model Pre-trained on AWS Trainium

In this paper, we showcase HLAT: a 7 billion parameter decoder-only LLM pre-trained using trn1 instances over 1. 8 trillion tokens.

Paper
Add Code

Consequently, this paper aims to improve the confidence with view selection and hierarchical prompts.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.