Search Results for author: Charles Foster

Found 2 papers, 1 papers with code

The Pile: An 800GB Dataset of Diverse Text for Language Modeling

20 code implementations31 Dec 2020 Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, Connor Leahy

Recent work has demonstrated that increased training dataset diversity improves general cross-domain knowledge and downstream generalization capability for large-scale language models.

Language Modelling

Sampled Image Tagging and Retrieval Methods on User Generated Content

no code implementations21 Nov 2016 Karl Ni, Kyle Zaragoza, Charles Foster, Carmen Carrano, Barry Chen, Yonas Tesfaye, Alex Gude

Specifically, we train a deep learning image tagging and retrieval system on large scale, user generated content (UGC) using sampling methods and joint optimization of word embeddings.

Retrieval TAG +1

Cannot find the paper you are looking for? You can Submit a new open access paper.