Search Results for author: Jo{\~a}o Silva

Found 20 papers, 2 papers with code

The BDCam\~oes Collection of Portuguese Literary Documents: a Research Resource for Digital Humanities and Language Technology

no code implementations LREC 2020 Sara Grilo, M{\'a}rcia Bolrinha, Jo{\~a}o Silva, Rui Vaz, Ant{\'o}nio Branco

This paper presents the BDCam{\~o}es Collection of Portuguese Literary Documents, a new corpus of literary texts written in Portuguese that in its inaugural version includes close to 4 million words from over 200 complete documents from 83 authors in 14 genres, covering a time span from the 16th to the 21st century, and adhering to different orthographic conventions.

Genre classification

Reproduction and Revival of the Argument Reasoning Comprehension Task

no code implementations LREC 2020 Jo{\~a}o Ant{\'o}nio Rodrigues, Ruben Branco, Jo{\~a}o Silva, Ant{\'o}nio Branco

Given a recent publication that pointed out spurious statistical cues in the data set used in the shared task, and that produced a revised version of it, we also evaluated the reproduced systems with this new data set.

Infrastructure for the Science and Technology of Language PORTULAN CLARIN

no code implementations LREC 2020 Ant{\'o}nio Branco, Am{\'a}lia Mendes, Paulo Quaresma, Lu{\'\i}s Gomes, Jo{\~a}o Silva, Andrea Teixeira

This paper presents the PORTULAN CLARIN Research Infrastructure for the Science and Technology of Language, which is part of the European research infrastructure CLARIN ERIC as its Portuguese national node, and belongs to the Portuguese National Roadmap of Research Infrastructures of Strategic Relevance.

Predicting Brain Activation with WordNet Embeddings

1 code implementation WS 2018 Jo{\~a}o Ant{\'o}nio Rodrigues, Ruben Branco, Jo{\~a}o Silva, Chakaveh Saedi, Ant{\'o}nio Branco

The task of taking a semantic representation of a noun and predicting the brain activity triggered by it in terms of fMRI spatial patterns was pioneered by Mitchell et al. 2008.

Word Embeddings

Ways of Asking and Replying in Duplicate Question Detection

no code implementations SEMEVAL 2017 Jo{\~a}o Ant{\'o}nio Rodrigues, Chakaveh Saedi, Vladislav Maraev, Jo{\~a}o Silva, Ant{\'o}nio Branco

This paper presents the results of systematic experimentation on the impact in duplicate question detection of different types of questions across both a number of established approaches and a novel, superior one used to address this language processing task.

Machine Translation Question Answering +1

QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages

no code implementations LREC 2016 Arantxa Otegi, Nora Aranberri, Antonio Branco, Jan Haji{\v{c}}, Martin Popel, Kiril Simov, Eneko Agirre, Petya Osenova, Rita Pereira, Jo{\~a}o Silva, Steven Neale

This work presents parallel corpora automatically annotated with several NLP tools, including lemma and part-of-speech tagging, named-entity recognition and classification, named-entity disambiguation, word-sense disambiguation, and coreference.

Cross-Lingual Transfer Entity Disambiguation +9

A PropBank for Portuguese: the CINTIL-PropBank

no code implementations LREC 2012 Ant{\'o}nio Branco, Catarina Carvalheiro, S{\'\i}lvia Pereira, Sara Silveira, Jo{\~a}o Silva, S{\'e}rgio Castro, Jo{\~a}o Gra{\c{c}}a

With the CINTIL-International Corpus of Portuguese, an ongoing corpus annotated with fully flegded grammatical representation, sentences get not only a high level of lexical, morphological and syntactic annotation but also a semantic analysis that prepares the data to a manual specification step and thus opens the way for a number of tools and resources for which there is a great research focus at the present.

Dealing with unknown words in statistical machine translation

no code implementations LREC 2012 Jo{\~a}o Silva, Lu{\'\i}sa Coheur, {\^A}ngela Costa, Isabel Trancoso

In Statistical Machine Translation, words that were not seen during training are unknown words, that is, words that the system will not know how to translate.

Translation Transliteration

Cannot find the paper you are looking for? You can Submit a new open access paper.