Search Results for author: o

Found 58 papers, 2 papers with code

TRANSLIT: A Large-scale Name Transliteration Resource

no code implementations LREC 2020 Fern Benites, o, Gilbert Fran{\c{c}}ois Duivesteijn, Pius von D{\"a}niken, Mark Cieliebak

Transliteration is the process of expressing a proper name from a source language in the characters of a target language (e. g. from Cyrillic to Latin characters).

Transliteration

Eco.pangeamt: Industrializing Neural MT

no code implementations LREC 2020 Mercedes Garc{\'\i}a-Mart{\'\i}nez, Manuel Herranz, Am Estela, o, {\'A}ngela Franco, Laurent Bi{\'e}

Eco is Pangeanic{'}s customer portal for generic or specialized translation services (machine translation and post-editing, generic API MT and custom API MT).

Machine Translation Translation

Editing OntoLex-Lemon in VocBench 3

no code implementations LREC 2020 Manuel Fiorelli, Arm Stellato, o, Tiziano Lorenzetti, Andrea Turbati, Peter Schmitz, Enrico Francesconi, Najeh Hajlaoui, Brahim Batouche

OntoLex-Lemon is a collection of RDF vocabularies for specifying the verbalization of ontologies in natural language.

A Lime-Flavored REST API for Alignment Services

no code implementations LREC 2020 Manuel Fiorelli, Arm Stellato, o

A practical alignment service should be flexible enough to handle the varied alignment scenarios that arise in the real world, while minimizing the need for manual configuration.

Data-Driven Sentence Simplification: Survey and Benchmark

no code implementations CL 2020 Fern Alva-Manchego, o, Carolina Scarton, Lucia Specia

Sentence Simplification (SS) aims to modify a sentence in order to make it easier to read and understand.

Team SVMrank: Leveraging Feature-rich Support Vector Machines for Ranking Explanations to Elementary Science Questions

no code implementations WS 2019 Jennifer D{'}Souza, Isaiah On Mulang{'}, o, S{\"o}ren Auer

The TextGraphs 2019 Shared Task on Multi-Hop Inference for Explanation Regeneration (MIER-19) tackles explanation generation for answers to elementary science questions.

Learning-To-Rank Question Answering

Contractions: To Align or Not to Align, That Is the Question

no code implementations COLING 2018 Anabela Barreiro, Fern Batista, o

On the other hand, contractions tend to be maintained when they occur in the beginning or in the middle of the multiword unit, i. e., in the frozen part of the multiword (e. g., [no que diz respeito a] | [with regard to] or [al{\'e}m disso] [in addition].

Machine Translation Translation

Attention-based Semantic Priming for Slot-filling

no code implementations WS 2018 Jiewen Wu, Rafael E. Banchs, Luis Fern D{'}Haro, o, Pavitra Krishnaswamy, Nancy Chen

The problem of sequence labelling in language understanding would benefit from approaches inspired by semantic priming phenomena.

Language understanding Slot Filling +2

Learning How to Simplify From Explicit Labeling of Complex-Simplified Text Pairs

1 code implementation IJCNLP 2017 Fern Alva-Manchego, o, Joachim Bingel, Gustavo Paetzold, Carolina Scarton, Lucia Specia

Current research in text simplification has been hampered by two central problems: (i) the small amount of high-quality parallel simplification data available, and (ii) the lack of explicit annotations of simplification operations, such as deletions or substitutions, on existing data.

Machine Translation Sentence Compression +1

MASSAlign: Alignment and Annotation of Comparable Documents

no code implementations IJCNLP 2017 Gustavo Paetzold, Fern Alva-Manchego, o, Lucia Specia

We introduce MASSAlign: a Python library for the alignment and annotation of monolingual comparable documents.

Opinion Mining in Social Networks versus Electoral Polls

no code implementations RANLP 2017 Javi Fern{\'a}ndez, Fern Llopis, o, Yoan Guti{\'e}rrez, Patricio Mart{\'\i}nez-Barco, {\'A}lvaro D{\'\i}ez

The recent failures of traditional poll models, like the predictions in United Kingdom with the Brexit, or in United States presidential elections with the victory of Donald Trump, have been noteworthy.

Opinion Mining

ADoCS: Automatic Designer of Conference Schedules

no code implementations EACL 2017 Diego Fern Vallejo Huanga, o, Paulina Adriana Morillo Alc{\'\i}var, C{\`e}sar Ferri Ram{\'\i}rez

Distributing papers into sessions in scientific conferences is a task consisting in grouping papers with common topics and considering the size restrictions imposed by the conference schedule.

Information Retrieval Tokenization

The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation

no code implementations LREC 2016 Jorge Proen{\c{c}}a, Dirce Celorico, C, Sara eias, Carla Lopes, Fern Perdig{\~a}o, o

The motivation for the creation of this corpus stems from the inexistence of databases with recordings of reading tasks of Portuguese children with different performance levels and including all the common reading aloud disfluencies.

Plato: A Selective Context Model for Entity Resolution

no code implementations TACL 2015 Nevena Lazic, Amarnag Subramanya, Michael Ringgaard, Fern Pereira, o

We present Plato, a probabilistic model for entity resolution that includes a novel approach for handling noisy or uninformative features, and supplements labeled training data derived from Wikipedia with a very large unlabeled text corpus.

Entity Linking Entity Resolution

HESITA(te) in Portuguese

no code implementations LREC 2014 C, Sara eias, Dirce Celorico, Jorge Proen{\c{c}}a, Arlindo Veiga, Carla Lopes, Fern Perdig{\~a}o, o

Hesitations, so-called disfluencies, are a characteristic of spontaneous speech, playing a primary role in its structure, reflecting aspects of the language production and the management of inter-communication.

Acoustic Modelling automatic-speech-recognition +2

The Gulf of Guinea Creole Corpora

no code implementations LREC 2014 Tjerk Hagemeijer, Michel G{\'e}n{\'e}reux, Iris Hendrickx, Am{\'a}lia Mendes, Abigail Tiny, Arm Zamora, o

We present the process of building linguistic corpora of the Portuguese-related Gulf of Guinea creoles, a cluster of four historically related languages: Santome, Angolar, Principense and Fa dÂ’Amb{\^o}.

Revising the annotation of a Broadcast News corpus: a linguistic approach

no code implementations LREC 2014 Vera Cabarr{\~a}o, Helena Moniz, Fern Batista, o, Ricardo Ribeiro, Nuno Mamede, Hugo Meinedo, Isabel Trancoso, Ana Isabel Mata, David Martins de Matos

This paper presents a linguistic revision process of a speech corpus of Portuguese broadcast news focusing on metadata annotation for rich transcription, and reports on the impact of the new data on the performance for several modules.

Speech Recognition

Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora

no code implementations LREC 2014 Ana Isabel Mata, Helena Moniz, Telmo M{\'o}ia, Anabela Gon{\c{c}}alves, F{\'a}tima Silva, Fern Batista, o, In{\^e}s Duarte, F{\'a}tima Oliveira, Isabel Fal{\'e}

This paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese.

Part-Of-Speech Tagging

A Meta-data Driven Platform for Semi-automatic Configuration of Ontology Mediators

no code implementations LREC 2014 Manuel Fiorelli, Maria Teresa Pazienza, Arm Stellato, o

Ontology mediators often demand extensive configuration, or even the adaptation of the input ontologies for remedying unsupported modeling patterns.

OpenLogos Semantico-Syntactic Knowledge-Rich Bilingual Dictionaries

no code implementations LREC 2014 Anabela Barreiro, Fern Batista, o, Ricardo Ribeiro, Helena Moniz, Isabel Trancoso

This paper presents 3 sets of OpenLogos resources, namely the English-German, the English-French, and the English-Italian bilingual dictionaries.

Machine Translation Translation

Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate

no code implementations LREC 2014 Anabela Barreiro, Johanna Monti, Brigitte Orliac, Susanne Preu{\ss}, Kutz Arrieta, Wang Ling, Fern Batista, o, Isabel Trancoso

This paper presents a systematic human evaluation of translations of English support verb constructions produced by a rule-based machine translation (RBMT) system (OpenLogos) and a statistical machine translation (SMT) system (Google Translate) for five languages: French, German, Italian, Portuguese and Spanish.

Machine Translation Translation

Teenage and adult speech in school context: building and processing a corpus of European Portuguese

no code implementations LREC 2014 Ana Isabel Mata, Helena Moniz, Fern Batista, o, Julia Hirschberg

We present a corpus of European Portuguese spoken by teenagers and adults in school context, CPE-FACES, with an overview of the differential characteristics of high school oral presentations and the challenges this data poses to automatic speech processing.

The acquisition and dialog act labeling of the EDECAN-SPORTS corpus

no code implementations LREC 2012 Llu{\'\i}s-F. Hurtado, Fern Garc{\'\i}a, o, Emilio Sanchis, Encarna Segarra

In this paper, we present the acquisition and labeling processes of the EDECAN-SPORTS corpus, which is a corpus that is oriented to the development of multimodal dialog systems acquired in Spanish and Catalan.

Speech Recognition

PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases

no code implementations LREC 2012 Maria Teresa Pazienza, Arm Stellato, o, Andrea Turbati

In this paper we present a language, PEARL, for projecting annotations based on the Unstructured Information Management Architecture (UIMA) over RDF triples.

Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation

no code implementations LREC 2012 Maria Teresa Pazienza, Noemi Scarpato, Arm Stellato, o

Submission of this graph is however not trivial as while a list of keywords of interest can be provided by any user, the formulation of semantic queries is not easy as well.

Information Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.