no code implementations • EAMT 2022 • José G.C. de Souza, Ricardo Rei, Ana C. Farinha, Helena Moniz, André F. T. Martins
This paper presents QUARTZ, QUality-AwaRe machine Translation, a project led by Unbabel which aims at developing machine translation systems that are more robust and produce fewer critical errors.
no code implementations • EAMT 2022 • Anabela Barreiro, José GC de Souza, Albert Gatt, Mehul Bhatt, Elena Lloret, Aykut Erdem, Dimitra Gkatzia, Helena Moniz, Irene Russo, Fabio Kepler, Iacer Calixto, Marcin Paprzycki, François Portet, Isabelle Augenstein, Mirela Alhasani
This paper presents the Multitask, Multilingual, Multimodal Language Generation COST Action – Multi3Generation (CA18231), an interdisciplinary network of research groups working on different aspects of language generation.
no code implementations • EAMT 2020 • André F. T. Martins, Joao Graca, Paulo Dimas, Helena Moniz, Graham Neubig
This paper presents the Multilingual Artificial Intelligence Agent Assistant (MAIA), a project led by Unbabel with the collaboration of CMU, INESC-ID and IT Lisbon.
no code implementations • EAMT 2022 • Madalena Gonçalves, Marianna Buchicchio, Craig Stewart, Helena Moniz, Alon Lavie
This paper illustrates a new evaluation framework developed at Unbabel for measuring the quality of source language text and its effect on both Machine Translation (MT) and Human Post-Edition (PE) performed by non-professional post-editors.
1 code implementation • 23 Nov 2023 • John Mendonça, Patrícia Pereira, Miguel Menezes, Vera Cabarrão, Ana C. Farinha, Helena Moniz, João Paulo Carvalho, Alon Lavie, Isabel Trancoso
Task-oriented conversational datasets often lack topic variability and linguistic diversity.
no code implementations • 8 Sep 2023 • Patrícia Pereira, Rui Ribeiro, Helena Moniz, Luisa Coheur, Joao Paulo Carvalho
Fuzzy Fingerprints have been successfully used as an interpretable text classification technique, but, like most other techniques, have been largely surpassed in performance by Large Pre-trained Language Models, such as BERT or RoBERTa.
1 code implementation • 31 Aug 2023 • John Mendonça, Patrícia Pereira, Helena Moniz, João Paulo Carvalho, Alon Lavie, Isabel Trancoso
Despite significant research effort in the development of automatic dialogue evaluation metrics, little thought is given to evaluating dialogues other than in English.
1 code implementation • 17 Apr 2023 • Patrícia Pereira, Helena Moniz, Isabel Dias, Joao Paulo Carvalho
The usual approach to model the conversational context has been to produce context-independent representations of each utterance and subsequently perform contextual modeling of these.
Ranked #1 on Emotion Recognition in Conversation on EmoWoz (Macro F1 metric)
no code implementations • 20 Dec 2022 • Mariana Julião, Alberto Abad, Helena Moniz
Of all components of Prosody, Rhythm has been regarded as the hardest to address, as it is utterly linked to Pitch and Intensity.
no code implementations • 16 Nov 2022 • Patrícia Pereira, Helena Moniz, Joao Paulo Carvalho
This is followed by descriptions of the most prominent works in ERC with explanations of the Deep Learning architectures employed.
1 code implementation • 25 Feb 2021 • Rita Parada Ramos, Patrícia Pereira, Helena Moniz, Joao Paulo Carvalho, Bruno Martins
Despite the use of large training datasets, most models are trained by iterating over single input-output pairs, discarding the remaining examples for the current prediction.
no code implementations • LREC 2016 • Fern Batista, o, Pedro Curto, Isabel Trancoso, Alberto Abad, Jaime Ferreira, Eug{\'e}nio Ribeiro, Helena Moniz, David Martins de Matos, Ricardo Ribeiro
This paper presents SPA, a web-based Speech Analytics platform that integrates several speech processing modules and that makes it possible to use them through the web.
no code implementations • LREC 2016 • Jos{\'e} Lopes, Arodami Chorianopoulou, Elisavet Palogiannidi, Helena Moniz, Alberto Abad, Katerina Louka, Elias Iosif, Alex Potamianos, ros
The SpeDial consortium is sharing two datasets that were used during the SpeDial project.
no code implementations • 31 Mar 2015 • António Lopes, David Martins de Matos, Vera Cabarrão, Ricardo Ribeiro, Helena Moniz, Isabel Trancoso, Ana Isabel Mata
Discourse markers are universal linguistic events subject to language variation.
no code implementations • LREC 2014 • Ana Isabel Mata, Helena Moniz, Telmo M{\'o}ia, Anabela Gon{\c{c}}alves, F{\'a}tima Silva, Fern Batista, o, In{\^e}s Duarte, F{\'a}tima Oliveira, Isabel Fal{\'e}
This paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese.
no code implementations • LREC 2014 • Ana Isabel Mata, Helena Moniz, Fern Batista, o, Julia Hirschberg
We present a corpus of European Portuguese spoken by teenagers and adults in school context, CPE-FACES, with an overview of the differential characteristics of high school oral presentations and the challenges this data poses to automatic speech processing.
no code implementations • LREC 2014 • Anabela Barreiro, Fern Batista, o, Ricardo Ribeiro, Helena Moniz, Isabel Trancoso
This paper presents 3 sets of OpenLogos resources, namely the English-German, the English-French, and the English-Italian bilingual dictionaries.
no code implementations • LREC 2014 • Vera Cabarr{\~a}o, Helena Moniz, Fern Batista, o, Ricardo Ribeiro, Nuno Mamede, Hugo Meinedo, Isabel Trancoso, Ana Isabel Mata, David Martins de Matos
This paper presents a linguistic revision process of a speech corpus of Portuguese broadcast news focusing on metadata annotation for rich transcription, and reports on the impact of the new data on the performance for several modules.