Search Results for author: Alexander Maloney

Found 1 papers, 0 papers with code

A Solvable Model of Neural Scaling Laws

no code implementations • 30 Oct 2022 • Alexander Maloney, Daniel A. Roberts, James Sully

Large language models with a huge number of parameters, when trained on near internet-sized number of tokens, have been empirically shown to obey neural scaling laws: specifically, their performance behaves predictably as a power law in either parameters or dataset size until bottlenecked by the other resource.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.