Search Results for author: Kieran Bicheno

Found 1 papers, 1 papers with code

Datasheet for the Pile

2 code implementations13 Jan 2022 Stella Biderman, Kieran Bicheno, Leo Gao

This datasheet describes the Pile, a 825 GiB dataset of human-authored text compiled by EleutherAI for use in large-scale language modeling.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.