Composed by 2.7 billion tokens, and has been annotated with tagging and parsing information.

Source: The brWaC Corpus: A New Open Resource for Brazilian Portuguese

Papers


Paper Code Results Date Stars

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages