no code implementations • LREC 2020 • Malgorzata Anna Ulasik, Manuela H{\"u}rlimann, Fabian Germann, Esin Gedik, Fern Benites, o, Mark Cieliebak
In this paper, we present CEASR, a Corpus for Evaluating the quality of Automatic Speech Recognition (ASR).
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • LREC 2020 • Manuel Fiorelli, Arm Stellato, o
A practical alignment service should be flexible enough to handle the varied alignment scenarios that arise in the real world, while minimizing the need for manual configuration.
no code implementations • LREC 2020 • Fern Benites, o, Gilbert Fran{\c{c}}ois Duivesteijn, Pius von D{\"a}niken, Mark Cieliebak
Transliteration is the process of expressing a proper name from a source language in the characters of a target language (e. g. from Cyrillic to Latin characters).
no code implementations • LREC 2020 • Mercedes Garc{\'\i}a-Mart{\'\i}nez, Manuel Herranz, Am Estela, o, {\'A}ngela Franco, Laurent Bi{\'e}
Eco is Pangeanic{'}s customer portal for generic or specialized translation services (machine translation and post-editing, generic API MT and custom API MT).
no code implementations • LREC 2020 • Manuel Fiorelli, Arm Stellato, o, Tiziano Lorenzetti, Andrea Turbati, Peter Schmitz, Enrico Francesconi, Najeh Hajlaoui, Brahim Batouche
OntoLex-Lemon is a collection of RDF vocabularies for specifying the verbalization of ontologies in natural language.
no code implementations • CL 2020 • Fern Alva-Manchego, o, Carolina Scarton, Lucia Specia
Sentence Simplification (SS) aims to modify a sentence in order to make it easier to read and understand.
no code implementations • WS 2019 • Jennifer D{'}Souza, Isaiah On Mulang{'}, o, S{\"o}ren Auer
The TextGraphs 2019 Shared Task on Multi-Hop Inference for Explanation Regeneration (MIER-19) tackles explanation generation for answers to elementary science questions.
no code implementations • WS 2019 • Fern Alva-Manchego, o, Carolina Scarton, Lucia Specia
Current approaches to Text Simplification focus on simplifying sentences individually.
no code implementations • WS 2019 • Fern Benites, o, Pius von D{\"a}niken, Mark Cieliebak
The goal was to identify dialects of Swiss German in GDI and Sumerian and Akkadian in CLI.
no code implementations • COLING 2018 • Anabela Barreiro, Fern Batista, o
On the other hand, contractions tend to be maintained when they occur in the beginning or in the middle of the multiword unit, i. e., in the frozen part of the multiword (e. g., [no que diz respeito a] | [with regard to] or [al{\'e}m disso] [in addition].
no code implementations • COLING 2018 • Fern Benites, o, Ralf Grubenmann, Pius von D{\"a}niken, Dirk von Gr{\"u}nigen, Jan Deriu, Mark Cieliebak
We describe our approaches used in the German Dialect Identification (GDI) task at the VarDial Evaluation Campaign 2018.
no code implementations • WS 2018 • Jiewen Wu, Rafael E. Banchs, Luis Fern D{'}Haro, o, Pavitra Krishnaswamy, Nancy Chen
The problem of sequence labelling in language understanding would benefit from approaches inspired by semantic priming phenomena.
no code implementations • SEMEVAL 2018 • Delia Iraz{\'u} Hern{\'a}ndez Far{\'\i}as, Fern S{\'a}nchez-Vega, o, Manuel Montes-y-G{\'o}mez, Paolo Rosso
This paper describes an ensemble approach to the SemEval-2018 Task 3.
1 code implementation • IJCNLP 2017 • Fern Alva-Manchego, o, Joachim Bingel, Gustavo Paetzold, Carolina Scarton, Lucia Specia
Current research in text simplification has been hampered by two central problems: (i) the small amount of high-quality parallel simplification data available, and (ii) the lack of explicit annotations of simplification operations, such as deletions or substitutions, on existing data.
Ranked #8 on
Text Simplification
on PWKP / WikiSmall
(SARI metric)
no code implementations • IJCNLP 2017 • Gustavo Paetzold, Fern Alva-Manchego, o, Lucia Specia
We introduce MASSAlign: a Python library for the alignment and annotation of monolingual comparable documents.
no code implementations • RANLP 2017 • Javi Fern{\'a}ndez, Fern Llopis, o, Yoan Guti{\'e}rrez, Patricio Mart{\'\i}nez-Barco, {\'A}lvaro D{\'\i}ez
The recent failures of traditional poll models, like the predictions in United Kingdom with the Brexit, or in United States presidential elections with the victory of Donald Trump, have been noteworthy.
no code implementations • EACL 2017 • Diego Fern Vallejo Huanga, o, Paulina Adriana Morillo Alc{\'\i}var, C{\`e}sar Ferri Ram{\'\i}rez
Distributing papers into sessions in scientific conferences is a task consisting in grouping papers with common topics and considering the size restrictions imposed by the conference schedule.
1 code implementation • LREC 2016 • Andre Quispesaravia, Walter Perez, Marco Sobrevilla Cabezudo, Fern Alva-Manchego, o
Text Complexity Analysis is an useful task in Education.
no code implementations • LREC 2016 • Jorge Proen{\c{c}}a, Dirce Celorico, C, Sara eias, Carla Lopes, Fern Perdig{\~a}o, o
The motivation for the creation of this corpus stems from the inexistence of databases with recordings of reading tasks of Portuguese children with different performance levels and including all the common reading aloud disfluencies.
no code implementations • LREC 2016 • Fern Batista, o, Pedro Curto, Isabel Trancoso, Alberto Abad, Jaime Ferreira, Eug{\'e}nio Ribeiro, Helena Moniz, David Martins de Matos, Ricardo Ribeiro
This paper presents SPA, a web-based Speech Analytics platform that integrates several speech processing modules and that makes it possible to use them through the web.
no code implementations • WS 2015 • Roque L{\'o}pez, Thiago Pardo, Lucas Avan{\c{c}}o, Pedro Filho, Aless Bokan, ro, Paula Cardoso, M{\'a}rcio Dias, Fern N{\'o}brega, o, Marco Cabezudo, Jackson Souza, Andressa Zacarias, Eloize Seno, Ariani Di Felippo
no code implementations • TACL 2015 • Nevena Lazic, Amarnag Subramanya, Michael Ringgaard, Fern Pereira, o
We present Plato, a probabilistic model for entity resolution that includes a novel approach for handling noisy or uninformative features, and supplements labeled training data derived from Wikipedia with a very large unlabeled text corpus.
no code implementations • SEMEVAL 2014 • Yenier Casta{\~n}eda, Arm Collazo, o, Elvis Crego, Jorge L. Garcia, Yoan Guti{\'e}rrez, David Tom{\'a}s, Andr{\'e}s Montoyo, Rafael Mu{\~n}oz
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
+2
no code implementations • LREC 2014 • Anabela Barreiro, Fern Batista, o, Ricardo Ribeiro, Helena Moniz, Isabel Trancoso
This paper presents 3 sets of OpenLogos resources, namely the English-German, the English-French, and the English-Italian bilingual dictionaries.
no code implementations • LREC 2014 • Ana Isabel Mata, Helena Moniz, Telmo M{\'o}ia, Anabela Gon{\c{c}}alves, F{\'a}tima Silva, Fern Batista, o, In{\^e}s Duarte, F{\'a}tima Oliveira, Isabel Fal{\'e}
This paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese.
no code implementations • LREC 2014 • Manuel Fiorelli, Maria Teresa Pazienza, Arm Stellato, o
Ontology mediators often demand extensive configuration, or even the adaptation of the input ontologies for remedying unsupported modeling patterns.
no code implementations • LREC 2014 • Vera Cabarr{\~a}o, Helena Moniz, Fern Batista, o, Ricardo Ribeiro, Nuno Mamede, Hugo Meinedo, Isabel Trancoso, Ana Isabel Mata, David Martins de Matos
This paper presents a linguistic revision process of a speech corpus of Portuguese broadcast news focusing on metadata annotation for rich transcription, and reports on the impact of the new data on the performance for several modules.
no code implementations • LREC 2014 • Ana Isabel Mata, Helena Moniz, Fern Batista, o, Julia Hirschberg
We present a corpus of European Portuguese spoken by teenagers and adults in school context, CPE-FACES, with an overview of the differential characteristics of high school oral presentations and the challenges this data poses to automatic speech processing.
no code implementations • LREC 2014 • C, Sara eias, Dirce Celorico, Jorge Proen{\c{c}}a, Arlindo Veiga, Carla Lopes, Fern Perdig{\~a}o, o
Hesitations, so-called disfluencies, are a characteristic of spontaneous speech, playing a primary role in its structure, reflecting aspects of the language production and the management of inter-communication.
no code implementations • LREC 2014 • Anabela Barreiro, Johanna Monti, Brigitte Orliac, Susanne Preu{\ss}, Kutz Arrieta, Wang Ling, Fern Batista, o, Isabel Trancoso
This paper presents a systematic human evaluation of translations of English support verb constructions produced by a rule-based machine translation (RBMT) system (OpenLogos) and a statistical machine translation (SMT) system (Google Translate) for five languages: French, German, Italian, Portuguese and Spanish.
no code implementations • LREC 2014 • Tjerk Hagemeijer, Michel G{\'e}n{\'e}reux, Iris Hendrickx, Am{\'a}lia Mendes, Abigail Tiny, Arm Zamora, o
We present the process of building linguistic corpora of the Portuguese-related Gulf of Guinea creoles, a cluster of four historically related languages: Santome, Angolar, Principense and Fa dÂ’Amb{\^o}.
no code implementations • SEMEVAL 2013 • Arm Collazo, o, Alberto Ceballo, Dennys D. Puig, Yoan Guti{\'e}rrez, Jos{\'e} I. Abreu, Roger P{\'e}rez, Antonio Fern{\'a}ndez Orqu{\'\i}n, Andr{\'e}s Montoyo, Rafael Mu{\~n}oz, Franc Camara
no code implementations • LREC 2012 • Maria Teresa Pazienza, Noemi Scarpato, Arm Stellato, o
Submission of this graph is however not trivial as while a list of keywords of interest can be provided by any user, the formulation of semantic queries is not easy as well.
no code implementations • LREC 2012 • Maria Teresa Pazienza, Arm Stellato, o, Andrea Turbati
In this paper we present a language, PEARL, for projecting annotations based on the Unstructured Information Management Architecture (UIMA) over RDF triples.
no code implementations • LREC 2012 • Fern Castilho, o, Roger Granada, Breno Meneghetti, Leonardo Carvalho, Renata Vieira
For these seeds similar terms are extracted from the corpus using known thesaurus generation methods.
no code implementations • LREC 2012 • Llu{\'\i}s-F. Hurtado, Fern Garc{\'\i}a, o, Emilio Sanchis, Encarna Segarra
In this paper, we present the acquisition and labeling processes of the EDECAN-SPORTS corpus, which is a corpus that is oriented to the development of multimodal dialog systems acquired in Spanish and Catalan.