no code implementations • LREC 2022 • John Mendonca, Rui Correia, Mariana Lourenço, João Freitas, Isabel Trancoso
Crowdsourcing the collection of speech provides a scalable setting to access a customisable demographic according to each dataset’s needs.
1 code implementation • SIGDIAL (ACL) 2022 • John Mendonca, Alon Lavie, Isabel Trancoso
Despite considerable advances in open-domain neural dialogue systems, their evaluation remains a bottleneck.
no code implementations • 2 Mar 2025 • Ajinkya Kulkarni, Atharva Kulkarni, Miguel Couceiro, Isabel Trancoso
In this paper, we present a bias and sustainability focused investigation of Automatic Speech Recognition (ASR) systems, namely Whisper and Massively Multilingual Speech (MMS), which have achieved state-of-the-art (SOTA) performances.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • 30 Dec 2024 • Catarina Botelho, David Gimeno-Gómez, Francisco Teixeira, John Mendonça, Patrícia Pereira, Diogo A. P. Nunes, Thomas Rolland, Anna Pompili, Rubén Solera-Ureña, Maria Ponte, David Martins de Matos, Carlos-D. Martínez-Hinarejos, Isabel Trancoso, Alberto Abad
This work describes our group's submission to the PROCESS Challenge 2024, with the goal of assessing cognitive decline through spontaneous speech, using three guided clinical tasks.
no code implementations • 16 Sep 2024 • Catarina Botelho, Alberto Abad, Tanja Schultz, Isabel Trancoso
The proposed framework for reference speech characterization and disease detection is designed to support the medical community by providing clinically meaningful explanations that can serve as a valuable second opinion.
1 code implementation • 20 Aug 2024 • John Mendonça, Isabel Trancoso, Alon Lavie
Although human evaluation remains the gold standard for open-domain dialogue evaluation, the growing popularity of automated evaluation using Large Language Models (LLMs) has also extended to dialogue.
1 code implementation • 16 Jul 2024 • John Mendonça, Isabel Trancoso, Alon Lavie
Motivated by the need for lightweight, open source, and multilingual dialogue evaluators, this paper introduces GenResCoh (Generated Responses targeting Coherence).
no code implementations • 4 Jul 2024 • John Mendonça, Alon Lavie, Isabel Trancoso
Large Language Models (LLMs) have showcased remarkable capabilities in various Natural Language Processing tasks.
no code implementations • 2 May 2024 • Francisco Teixeira, Karla Pizzi, Raphael Olivier, Alberto Abad, Bhiksha Raj, Isabel Trancoso
Membership Inference (MI) poses a substantial privacy threat to the training data of Automatic Speech Recognition (ASR) systems, while also offering an opportunity to audit these models with regard to user data.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
1 code implementation • 23 Nov 2023 • John Mendonça, Patrícia Pereira, Miguel Menezes, Vera Cabarrão, Ana C. Farinha, Helena Moniz, João Paulo Carvalho, Alon Lavie, Isabel Trancoso
Task-oriented conversational datasets often lack topic variability and linguistic diversity.
no code implementations • 10 Oct 2023 • Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso
Speaker embeddings are ubiquitous, with applications ranging from speaker recognition and diarization to speech synthesis and voice anonymisation.
1 code implementation • 31 Aug 2023 • John Mendonça, Patrícia Pereira, Helena Moniz, João Paulo Carvalho, Alon Lavie, Isabel Trancoso
Despite significant research effort in the development of automatic dialogue evaluation metrics, little thought is given to evaluating dialogues other than in English.
1 code implementation • 31 Aug 2023 • John Mendonça, Alon Lavie, Isabel Trancoso
The main limiting factor in the development of robust multilingual dialogue evaluation metrics is the lack of multilingual data and the limited availability of open sourced multilingual dialogue systems.
no code implementations • 26 Oct 2022 • Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso
Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deals with recordings of multiple speakers, raising special concerns in terms of privacy.
no code implementations • 23 Jun 2022 • Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso
This poses two important issues: first, knowledge of the speaker embedding extraction model may create security and robustness liabilities for the authentication system, as this knowledge might help attackers in crafting adversarial examples able to mislead the system; second, from the point of view of a service provider the speaker embedding extraction model is arguably one of the most valuable components in the system and, as such, disclosing it would be highly undesirable.
no code implementations • 30 Jun 2021 • John Mendonça, Rubén Solera-Ureña, Alberto Abad, Isabel Trancoso
Experimental results demonstrate that models trained on features extracted from self-supervised models perform similarly or outperform fully-supervised models and models based on handcrafted features.
2 code implementations • 17 Nov 2020 • Ali Shahin Shamsabadi, Francisco Sepúlveda Teixeira, Alberto Abad, Bhiksha Raj, Andrea Cavallaro, Isabel Trancoso
Speaker identification models are vulnerable to carefully designed adversarial perturbations of their input signals that induce misclassification.
no code implementations • LREC 2020 • Joana Correia, Isabel Trancoso, Bhiksha Raj
The automation of the diagnosis and monitoring of speech affecting diseases in real life situations, such as Depression or Parkinson{'}s disease, depends on the existence of rich and large datasets that resemble real life conditions, such as those collected from in-the-wild multimedia repositories like YouTube.
no code implementations • 2 Mar 2020 • Catarina Botelho, Francisco Teixeira, Thomas Rolland, Alberto Abad, Isabel Trancoso
We test our approach against knowledge-based features and i-vectors, and report results for two European Portuguese corpora, for OSA and PD, as well as for an additional Spanish corpus for PD.
no code implementations • 18 Jan 2017 • Eugénio Ribeiro, Fernando Batista, Isabel Trancoso, José Lopes, Ricardo Ribeiro, David Martins de Matos
Identifying the level of expertise of its users is important for a system since it can lead to a better interaction through adaptation techniques.
no code implementations • LREC 2016 • Fern Batista, o, Pedro Curto, Isabel Trancoso, Alberto Abad, Jaime Ferreira, Eug{\'e}nio Ribeiro, Helena Moniz, David Martins de Matos, Ricardo Ribeiro
This paper presents SPA, a web-based Speech Analytics platform that integrates several speech processing modules and that makes it possible to use them through the web.
no code implementations • 14 Nov 2015 • Wang Ling, Isabel Trancoso, Chris Dyer, Alan W. black
We introduce a neural machine translation model that views the input and output sentences as sequences of characters rather than words.
1 code implementation • EMNLP 2015 • Wang Ling, Tiago Luís, Luís Marujo, Ramón Fernandez Astudillo, Silvio Amir, Chris Dyer, Alan W. black, Isabel Trancoso
We introduce a model for constructing vector representations of words by composing characters using bidirectional LSTMs.
Ranked #4 on
Part-Of-Speech Tagging
on Penn Treebank
no code implementations • 6 Aug 2015 • Luís Marujo, José Portêlo, Wang Ling, David Martins de Matos, João P. Neto, Anatole Gershman, Jaime Carbonell, Isabel Trancoso, Bhiksha Raj
State-of-the-art extractive multi-document summarization systems are usually designed without any concern about privacy issues, meaning that all documents are open to third parties.
no code implementations • 31 Mar 2015 • António Lopes, David Martins de Matos, Vera Cabarrão, Ricardo Ribeiro, Helena Moniz, Isabel Trancoso, Ana Isabel Mata
Discourse markers are universal linguistic events subject to language variation.
no code implementations • 17 Jun 2014 • Pedro Girão Antunes, David Martins de Matos, Ricardo Ribeiro, Isabel Trancoso
In late 2011, Fado was elevated to the oral and intangible heritage of humanity by UNESCO.
no code implementations • LREC 2014 • Anabela Barreiro, Fern Batista, o, Ricardo Ribeiro, Helena Moniz, Isabel Trancoso
This paper presents 3 sets of OpenLogos resources, namely the English-German, the English-French, and the English-Italian bilingual dictionaries.
no code implementations • LREC 2014 • Anabela Barreiro, Johanna Monti, Brigitte Orliac, Susanne Preu{\ss}, Kutz Arrieta, Wang Ling, Fern Batista, o, Isabel Trancoso
This paper presents a systematic human evaluation of translations of English support verb constructions produced by a rule-based machine translation (RBMT) system (OpenLogos) and a statistical machine translation (SMT) system (Google Translate) for five languages: French, German, Italian, Portuguese and Spanish.
no code implementations • LREC 2014 • Vera Cabarr{\~a}o, Helena Moniz, Fern Batista, o, Ricardo Ribeiro, Nuno Mamede, Hugo Meinedo, Isabel Trancoso, Ana Isabel Mata, David Martins de Matos
This paper presents a linguistic revision process of a speech corpus of Portuguese broadcast news focusing on metadata annotation for rich transcription, and reports on the impact of the new data on the performance for several modules.
no code implementations • LREC 2012 • Jo{\~a}o Silva, Lu{\'\i}sa Coheur, {\^A}ngela Costa, Isabel Trancoso
In Statistical Machine Translation, words that were not seen during training are unknown words, that is, words that the system will not know how to translate.