Search Results for author: Pedro Ortiz Suarez

Found 4 papers, 0 papers with code

From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French

no code implementations18 Feb 2022 Simon Gabay, Pedro Ortiz Suarez, Alexandre Bartz, Alix Chagué, Rachel Bawden, Philippe Gambette, Benoît Sagot

Because these historical states are at the same time more complex to process and more scarce in the corpora available, specific efforts are necessary to train natural language processing (NLP) tools adapted to the data.

Language Modelling Part-Of-Speech Tagging +1

Towards a Cleaner Document-Oriented Multilingual Crawled Corpus

no code implementations17 Jan 2022 Julien Abadji, Pedro Ortiz Suarez, Laurent Romary, Benoît Sagot

The need for raw large raw corpora has dramatically increased in recent years with the introduction of transfer learning and semi-supervised learning methods to Natural Language Processing.

Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.