Search Results for author: Joseph Viviano

Found 2 papers, 1 papers with code

What's in the Box? An Analysis of Undesirable Content in the Common Crawl Corpus

no code implementations ACL 2021 Alexandra Luccioni, Joseph Viviano

Whereas much of the success of the current generation of neural language models has been driven by increasingly large training corpora, relatively little research has been dedicated to analyzing these massive sources of textual data.

Cannot find the paper you are looking for? You can Submit a new open access paper.