Search Results for author: Marco Gaido

Found 34 papers, 21 papers with code

Does Simultaneous Speech Translation need Simultaneous Models?

1 code implementation • 8 Apr 2022 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi

In simultaneous speech translation (SimulST), finding the best trade-off between high translation quality and low latency is a challenging task.

Translation

Paper
Code

Efficient yet Competitive Speech Translation: FBK@IWSLT2022

1 code implementation • IWSLT (ACL) 2022 • Marco Gaido, Sara Papi, Dennis Fucci, Giuseppe Fiameni, Matteo Negri, Marco Turchi

The primary goal of this FBK's systems submission to the IWSLT 2022 offline and simultaneous speech translation tasks is to reduce model training costs without sacrificing translation quality.

Sentence Translation

Paper
Code

Who Are We Talking About? Handling Person Names in Speech Translation

1 code implementation • IWSLT (ACL) 2022 • Marco Gaido, Matteo Negri, Marco Turchi

Recent work has shown that systems for speech translation (ST) -- similarly to automatic speech recognition (ASR) -- poorly handle person names.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation

1 code implementation • NAACL (AutoSimTrans) 2022 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi

Simultaneous speech translation (SimulST) systems aim at generating their output with the lowest possible latency, which is normally computed in terms of Average Lagging (AL).

Translation

Paper
Code

Direct Speech Translation for Automatic Subtitling

1 code implementation • 27 Sep 2022 • Sara Papi, Marco Gaido, Alina Karakanta, Mauro Cettolo, Matteo Negri, Marco Turchi

Automatic subtitling is the task of automatically translating the speech of audiovisual content into short pieces of timed text, i. e. subtitles and their corresponding timestamps.

Translation

Paper
Code

Joint Speech Translation and Named Entity Recognition

1 code implementation • 21 Oct 2022 • Marco Gaido, Sara Papi, Matteo Negri, Marco Turchi

Modern automatic translation systems aim at place the human at the center by providing contextual support and knowledge.

Computational Efficiency Entity Linking +4

Paper
Code

Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023

1 code implementation • 27 Sep 2023 • Sara Papi, Marco Gaido, Matteo Negri

This paper describes the FBK's participation in the Simultaneous Translation and Automatic Subtitling tracks of the IWSLT 2023 Evaluation Campaign.

Translation

Paper
Code

No Pitch Left Behind: Addressing Gender Unbalance in Automatic Speech Recognition through Pitch Manipulation

1 code implementation • 10 Oct 2023 • Dennis Fucci, Marco Gaido, Matteo Negri, Mauro Cettolo, Luisa Bentivogli

Automatic speech recognition (ASR) systems are known to be sensitive to the sociolinguistic variability of speech data, in which gender plays a crucial role.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

How To Build Competitive Multi-gender Speech Translation Models For Controlling Speaker Gender Translation

1 code implementation • 23 Oct 2023 • Marco Gaido, Dennis Fucci, Matteo Negri, Luisa Bentivogli

When translating from notional gender languages (e. g., English) into grammatical gender languages (e. g., Italian), the generated translation requires explicit gender assignments for various words, including those referring to the speaker.

Sentence Translation

Paper
Code

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection

1 code implementation • 24 Oct 2023 • Dennis Fucci, Marco Gaido, Sara Papi, Mauro Cettolo, Matteo Negri, Luisa Bentivogli

When translating words referring to the speaker, speech translation (ST) systems should not resort to default masculine generics nor rely on potentially misleading vocal traits.

Language Modelling

Paper
Code

Test Suites Task: Evaluation of Gender Fairness in MT with MuST-SHE and INES

1 code implementation • 30 Oct 2023 • Beatrice Savoldi, Marco Gaido, Matteo Negri, Luisa Bentivogli

As part of the WMT-2023 "Test suites" shared task, in this paper we summarize the results of two test suites evaluations: MuST-SHE-WMT23 and INES.

Fairness

Paper
Code

How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena

1 code implementation • 20 Feb 2024 • Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli

The attention mechanism, a cornerstone of state-of-the-art neural models, faces computational hurdles in processing long sequences due to its quadratic complexity.

Automatic Speech Recognition Image Classification +3

Paper
Code

Contextualized Translation of Automatically Segmented Speech

1 code implementation • 5 Aug 2020 • Marco Gaido, Mattia Antonino Di Gangi, Matteo Negri, Mauro Cettolo, Marco Turchi

We show that our context-aware solution is more robust to VAD-segmented input, outperforming a strong base model and the fine-tuning on different VAD segmentations of an English-German test set by up to 4. 25 BLEU points.

Segmentation Sentence +2

Paper
Code

On Knowledge Distillation for Direct Speech Translation

1 code implementation • 9 Dec 2020 • Marco Gaido, Mattia A. Di Gangi, Matteo Negri, Marco Turchi

Direct speech translation (ST) has shown to be a complex task requiring knowledge transfer from its sub-tasks: automatic speech recognition (ASR) and machine translation (MT).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Code

CTC-based Compression for Direct Speech Translation

1 code implementation • EACL 2021 • Marco Gaido, Mauro Cettolo, Matteo Negri, Marco Turchi

Previous studies demonstrated that a dynamic phone-informed compression of the input audio is beneficial for speech translation (ST).

Translation

Paper
Code

How to Split: the Effect of Word Segmentation on Gender Bias in Speech Translation

1 code implementation • Findings (ACL) 2021 • Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi

In light of this finding, we propose a combined approach that preserves BPE overall translation quality, while leveraging the higher ability of character-based segmentation to properly translate gender.

Segmentation Translation

Paper
Code

Between Flexibility and Consistency: Joint Generation of Captions and Subtitles

1 code implementation • ACL (IWSLT) 2021 • Alina Karakanta, Marco Gaido, Matteo Negri, Marco Turchi

Speech translation (ST) has lately received growing interest for the generation of subtitles without the need for an intermediate source language transcription and timing (i. e. captions).

Translation

Paper
Code

Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation

1 code implementation • 15 Sep 2021 • Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli, Marco Turchi

Automatic translation systems are known to struggle with rare words.

Translation

Paper
Code

Under the Morphosyntactic Lens: A Multifaceted Evaluation of Gender Bias in Speech Translation

1 code implementation • ACL 2022 • Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

Gender bias is largely recognized as a problematic phenomenon affecting language technologies, with recent studies underscoring that it might surface differently across languages.

POS Translation

Paper
Code

Speechformer: Reducing Information Loss in Direct Speech Translation

1 code implementation • EMNLP 2021 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi

Transformer-based models have gained increasing popularity achieving state-of-the-art performance in many research fields including speech translation.

Ranked #1 on Speech-to-Text Translation on MuST-C EN->NL

Speech-to-Text Translation Translation

Paper
Code

End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020

no code implementations • WS 2020 • Marco Gaido, Mattia Antonino Di Gangi, Matteo Negri, Marco Turchi

The test talks are provided in two versions: one contains the data already segmented with automatic tools and the other is the raw data without any segmentation.

Data Augmentation Knowledge Distillation +3

Paper
Add Code

On Target Segmentation for Direct Speech Translation

no code implementations • AMTA 2020 • Mattia Antonino Di Gangi, Marco Gaido, Matteo Negri, Marco Turchi

Then, subword-level segmentation became the state of the art in neural machine translation as it produces shorter sequences that reduce the training time, while being superior to word-level models.

Data Augmentation Machine Translation +2

Paper
Add Code

Breeding Gender-aware Direct Speech Translation Systems

no code implementations • COLING 2020 • Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi

In particular, by translating speech audio data without intermediate transcription, direct ST models are able to leverage and preserve essential information present in the input (e. g. speaker's vocal characteristics) that is otherwise lost in the cascade framework.

Machine Translation Translation

Paper
Add Code

Gender Bias in Machine Translation

1 code implementation • 13 Apr 2021 • Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

Machine translation (MT) technology has facilitated our daily tasks by providing accessible shortcuts for gathering, elaborating and communicating information.

Machine Translation Translation

Paper
Code

Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation

no code implementations • ICNLSP 2021 • Marco Gaido, Matteo Negri, Mauro Cettolo, Marco Turchi

The audio segmentation mismatch between training data and those seen at run-time is a major problem in direct speech translation.

Action Detection Activity Detection +2

Paper
Add Code

Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?

no code implementations • ACL 2021 • Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Alberto Martinelli, Matteo Negri, Marco Turchi

Five years after the first published proofs of concept, direct approaches to speech translation (ST) are now competing with traditional cascade solutions.

Translation

Paper
Add Code

Dealing with training and test segmentation mismatch: FBK@IWSLT2021

no code implementations • ACL (IWSLT) 2021 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi

Both knowledge distillation and the first fine-tuning step are carried out on manually segmented real and synthetic data, the latter being generated with an MT system trained on the available corpora.

Action Detection Activity Detection +4

Paper
Add Code

Is “moby dick” a Whale or a Bird? Named Entities and Terminology in Speech Translation

no code implementations • EMNLP 2021 • Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli, Marco Turchi

Automatic translation systems are known to struggle with rare words.

Translation

Paper
Add Code

Extending the MuST-C Corpus for a Comparative Evaluation of Speech Translation Technology

no code implementations • EAMT 2022 • Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Matteo Negri, Marco Turchi

This project aimed at extending the test sets of the MuST-C speech translation (ST) corpus with new reference translations.

Machine Translation Translation

Paper
Add Code

On the Dynamics of Gender Learning in Speech Translation

no code implementations • NAACL (GeBNLP) 2022 • Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

In this work, we contribute to such a line of inquiry by exploring the emergence of gender bias in Speech Translation (ST).

Translation

Paper
Add Code

Named Entity Detection and Injection for Direct Speech Translation

no code implementations • 21 Oct 2022 • Marco Gaido, Yun Tang, Ilia Kulikov, Rongqing Huang, Hongyu Gong, Hirofumi Inaguma

In a sentence, certain words are critical for its semantic.

Sentence Translation

Paper
Add Code

Distributed Silhouette Algorithm: Evaluating Clustering on Big Data

no code implementations • 24 Mar 2023 • Marco Gaido

To fill this gap, in this paper we introduce the first algorithm that computes the Silhouette metric with linear complexity and can easily execute in parallel in a distributed environment.

Clustering

Paper
Add Code

When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP

no code implementations • 28 Mar 2023 • Sara Papi, Marco Gaido, Andrea Pilzer, Matteo Negri

Despite its crucial role in research experiments, code correctness is often presumed only on the basis of the perceived quality of results.

Automatic Speech Recognition speech-recognition +1

Paper
Add Code

Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?

no code implementations • 19 Feb 2024 • Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli

The field of natural language processing (NLP) has recently witnessed a transformative shift with the emergence of foundation models, particularly Large Language Models (LLMs) that have revolutionized text-based NLP.

Speech-to-Text Translation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.