Search Results for author: Shree Thatte

Found 1 papers, 1 papers with code

Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale

1 code implementation26 May 2023 Vijeta Deshpande, Dan Pechi, Shree Thatte, Vladislav Lialin, Anna Rumshisky

The majority of recent scaling laws studies focused on high-compute high-parameter count settings, leaving the question of when these abilities begin to emerge largely unanswered.

Language Modelling Masked Language Modeling

Cannot find the paper you are looking for? You can Submit a new open access paper.