no code implementations • LREC 2020 • Sara Grilo, M{\'a}rcia Bolrinha, Jo{\~a}o Silva, Rui Vaz, Ant{\'o}nio Branco
This paper presents the BDCam{\~o}es Collection of Portuguese Literary Documents, a new corpus of literary texts written in Portuguese that in its inaugural version includes close to 4 million words from over 200 complete documents from 83 authors in 14 genres, covering a time span from the 16th to the 21st century, and adhering to different orthographic conventions.
no code implementations • LREC 2020 • Ant{\'o}nio Branco, Sara Grilo, M{\'a}rcia Bolrinha, Chakaveh Saedi, Ruben Branco, Jo{\~a}o Silva, Andreia Querido, Rita de Carvalho, Rosa Gaudio, Mariana Avel{\~a}s, Clara Pinto
The objective of the present paper is twofold, to present the MWN. PT WordNet and to report on its construction and on the lessons learned with it.