no code implementations • NAACL (GeBNLP) 2022 • Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi
In this work, we contribute to such a line of inquiry by exploring the emergence of gender bias in Speech Translation (ST).
no code implementations • EMNLP 2021 • Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli, Marco Turchi
Automatic translation systems are known to struggle with rare words.
no code implementations • EAMT 2022 • Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Matteo Negri, Marco Turchi
This project aimed at extending the test sets of the MuST-C speech translation (ST) corpus with new reference translations.
no code implementations • 4 Jan 2025 • Tsz Kin Lam, Marco Gaido, Sara Papi, Luisa Bentivogli, Barry Haddow
Following the remarkable success of Large Language Models (LLMs) in NLP tasks, there is increasing interest in extending their capabilities to speech -- the most common form in communication.
no code implementations • 16 Dec 2024 • Beomseok Lee, Marco Gaido, Ioan Calapodescu, Laurent Besacier, Matteo Negri
While crowdsourcing is an established solution for facilitating and scaling the collection of speech data, the involvement of non-experts necessitates protocols to ensure final data quality.
no code implementations • 3 Nov 2024 • Dennis Fucci, Marco Gaido, Beatrice Savoldi, Matteo Negri, Mauro Cettolo, Luisa Bentivogli
Spurred by the demand for interpretable models, research on eXplainable AI for language technologies has experienced significant growth, with feature attribution methods emerging as a cornerstone of this progress.
1 code implementation • 1 Oct 2024 • Marco Gaido, Sara Papi, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri
The rise of foundation models (FMs), coupled with regulatory efforts addressing their risks and impacts, has sparked significant interest in open-source models.
no code implementations • 25 Sep 2024 • Francesco Verdini, Pierfrancesco Melucci, Stefano Perna, Francesco Cariaggi, Marco Gaido, Sara Papi, Szymon Mazurek, Marek Kasztelnik, Luisa Bentivogli, Sébastien Bratières, Paolo Merialdo, Simone Scardapane
The remarkable performance achieved by Large Language Models (LLM) has driven research efforts to leverage them for a wide range of tasks and input modalities.
1 code implementation • 7 Aug 2024 • Beomseok Lee, Ioan Calapodescu, Marco Gaido, Matteo Negri, Laurent Besacier
We present Speech-MASSIVE, a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSIVE textual corpus.
1 code implementation • 20 Jun 2024 • Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli
This paper describes the FBK's participation in the Simultaneous Translation Evaluation Campaign at IWSLT 2024.
1 code implementation • 10 Jun 2024 • Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli
To fill this gap, we introduce StreamAtt, the first StreamST policy, and propose StreamLAAL, the first StreamST latency metric designed to be comparable with existing metrics for SimulST.
2 code implementations • 17 May 2024 • Marco Gaido, Sara Papi, Matteo Negri, Mauro Cettolo, Luisa Bentivogli
Subtitling plays a crucial role in enhancing the accessibility of audiovisual content and encompasses three primary subtasks: translating spoken dialogue, segmenting translations into concise textual units, and estimating timestamps that govern their on-screen duration.
1 code implementation • 20 Feb 2024 • Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli
The attention mechanism, a cornerstone of state-of-the-art neural models, faces computational hurdles in processing long sequences due to its quadratic complexity.
no code implementations • 19 Feb 2024 • Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli
The field of natural language processing (NLP) has recently witnessed a transformative shift with the emergence of foundation models, particularly Large Language Models (LLMs) that have revolutionized text-based NLP.
1 code implementation • 30 Oct 2023 • Beatrice Savoldi, Marco Gaido, Matteo Negri, Luisa Bentivogli
As part of the WMT-2023 "Test suites" shared task, in this paper we summarize the results of two test suites evaluations: MuST-SHE-WMT23 and INES.
1 code implementation • 24 Oct 2023 • Dennis Fucci, Marco Gaido, Sara Papi, Mauro Cettolo, Matteo Negri, Luisa Bentivogli
When translating words referring to the speaker, speech translation (ST) systems should not resort to default masculine generics nor rely on potentially misleading vocal traits.
1 code implementation • 23 Oct 2023 • Marco Gaido, Dennis Fucci, Matteo Negri, Luisa Bentivogli
When translating from notional gender languages (e. g., English) into grammatical gender languages (e. g., Italian), the generated translation requires explicit gender assignments for various words, including those referring to the speaker.
1 code implementation • 10 Oct 2023 • Dennis Fucci, Marco Gaido, Matteo Negri, Mauro Cettolo, Luisa Bentivogli
Automatic speech recognition (ASR) systems are known to be sensitive to the sociolinguistic variability of speech data, in which gender plays a crucial role.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 27 Sep 2023 • Sara Papi, Marco Gaido, Matteo Negri
This paper describes the FBK's participation in the Simultaneous Translation and Automatic Subtitling tracks of the IWSLT 2023 Evaluation Campaign.
2 code implementations • 28 Mar 2023 • Sara Papi, Marco Gaido, Andrea Pilzer, Matteo Negri
Despite its crucial role in research experiments, code correctness is often presumed only on the basis of the perceived quality of results.
no code implementations • 24 Mar 2023 • Marco Gaido
To fill this gap, in this paper we introduce the first algorithm that computes the Silhouette metric with linear complexity and can easily execute in parallel in a distributed environment.
1 code implementation • 21 Oct 2022 • Marco Gaido, Sara Papi, Matteo Negri, Marco Turchi
Modern automatic translation systems aim at place the human at the center by providing contextual support and knowledge.
no code implementations • 21 Oct 2022 • Marco Gaido, Yun Tang, Ilia Kulikov, Rongqing Huang, Hongyu Gong, Hirofumi Inaguma
In a sentence, certain words are critical for its semantic.
1 code implementation • 27 Sep 2022 • Sara Papi, Marco Gaido, Alina Karakanta, Mauro Cettolo, Matteo Negri, Marco Turchi
Automatic subtitling is the task of automatically translating the speech of audiovisual content into short pieces of timed text, i. e. subtitles and their corresponding timestamps.
1 code implementation • NAACL (AutoSimTrans) 2022 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi
Simultaneous speech translation (SimulST) systems aim at generating their output with the lowest possible latency, which is normally computed in terms of Average Lagging (AL).
1 code implementation • IWSLT (ACL) 2022 • Marco Gaido, Matteo Negri, Marco Turchi
Recent work has shown that systems for speech translation (ST) -- similarly to automatic speech recognition (ASR) -- poorly handle person names.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • IWSLT (ACL) 2022 • Marco Gaido, Sara Papi, Dennis Fucci, Giuseppe Fiameni, Matteo Negri, Marco Turchi
The primary goal of this FBK's systems submission to the IWSLT 2022 offline and simultaneous speech translation tasks is to reduce model training costs without sacrificing translation quality.
1 code implementation • 8 Apr 2022 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi
In simultaneous speech translation (SimulST), finding the best trade-off between high translation quality and low latency is a challenging task.
1 code implementation • ACL 2022 • Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi
Gender bias is largely recognized as a problematic phenomenon affecting language technologies, with recent studies underscoring that it might surface differently across languages.
1 code implementation • 15 Sep 2021 • Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli, Marco Turchi
Automatic translation systems are known to struggle with rare words.
1 code implementation • EMNLP 2021 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi
Transformer-based models have gained increasing popularity achieving state-of-the-art performance in many research fields including speech translation.
Ranked #1 on Speech-to-Text Translation on MuST-C EN->NL
1 code implementation • ACL (IWSLT) 2021 • Alina Karakanta, Marco Gaido, Matteo Negri, Marco Turchi
Speech translation (ST) has lately received growing interest for the generation of subtitles without the need for an intermediate source language transcription and timing (i. e. captions).
no code implementations • ACL (IWSLT) 2021 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi
Both knowledge distillation and the first fine-tuning step are carried out on manually segmented real and synthetic data, the latter being generated with an MT system trained on the available corpora.
no code implementations • ACL 2021 • Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Alberto Martinelli, Matteo Negri, Marco Turchi
Five years after the first published proofs of concept, direct approaches to speech translation (ST) are now competing with traditional cascade solutions.
1 code implementation • Findings (ACL) 2021 • Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi
In light of this finding, we propose a combined approach that preserves BPE overall translation quality, while leveraging the higher ability of character-based segmentation to properly translate gender.
no code implementations • ICNLSP 2021 • Marco Gaido, Matteo Negri, Mauro Cettolo, Marco Turchi
The audio segmentation mismatch between training data and those seen at run-time is a major problem in direct speech translation.
1 code implementation • 13 Apr 2021 • Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi
Machine translation (MT) technology has facilitated our daily tasks by providing accessible shortcuts for gathering, elaborating and communicating information.
1 code implementation • EACL 2021 • Marco Gaido, Mauro Cettolo, Matteo Negri, Marco Turchi
Previous studies demonstrated that a dynamic phone-informed compression of the input audio is beneficial for speech translation (ST).
no code implementations • COLING 2020 • Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi
In particular, by translating speech audio data without intermediate transcription, direct ST models are able to leverage and preserve essential information present in the input (e. g. speaker's vocal characteristics) that is otherwise lost in the cascade framework.
1 code implementation • 9 Dec 2020 • Marco Gaido, Mattia A. Di Gangi, Matteo Negri, Marco Turchi
Direct speech translation (ST) has shown to be a complex task requiring knowledge transfer from its sub-tasks: automatic speech recognition (ASR) and machine translation (MT).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • AMTA 2020 • Mattia Antonino Di Gangi, Marco Gaido, Matteo Negri, Marco Turchi
Then, subword-level segmentation became the state of the art in neural machine translation as it produces shorter sequences that reduce the training time, while being superior to word-level models.
1 code implementation • 5 Aug 2020 • Marco Gaido, Mattia Antonino Di Gangi, Matteo Negri, Mauro Cettolo, Marco Turchi
We show that our context-aware solution is more robust to VAD-segmented input, outperforming a strong base model and the fine-tuning on different VAD segmentations of an English-German test set by up to 4. 25 BLEU points.
no code implementations • WS 2020 • Marco Gaido, Mattia Antonino Di Gangi, Matteo Negri, Marco Turchi
The test talks are provided in two versions: one contains the data already segmented with automatic tools and the other is the raw data without any segmentation.