no code implementations • 1 Oct 2024 • Beatrice Savoldi, Sara Papi, Matteo Negri, Ana Guerberof, Luisa Bentivogli
Gender bias in machine translation (MT) is recognized as an issue that can harm people and society.
1 code implementation • 1 Oct 2024 • Marco Gaido, Sara Papi, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri
The rise of foundation models (FMs), coupled with regulatory efforts addressing their risks and impacts, has sparked significant interest in open-source models.
no code implementations • 25 Sep 2024 • Francesco Verdini, Pierfrancesco Melucci, Stefano Perna, Francesco Cariaggi, Marco Gaido, Sara Papi, Szymon Mazurek, Marek Kasztelnik, Luisa Bentivogli, Sébastien Bratières, Paolo Merialdo, Simone Scardapane
The remarkable performance achieved by Large Language Models (LLM) has driven research efforts to leverage them for a wide range of tasks and input modalities.
1 code implementation • 20 Jun 2024 • Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli
This paper describes the FBK's participation in the Simultaneous Translation Evaluation Campaign at IWSLT 2024.
1 code implementation • 10 Jun 2024 • Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli
To fill this gap, we introduce StreamAtt, the first StreamST policy, and propose StreamLAAL, the first StreamST latency metric designed to be comparable with existing metrics for SimulST.
2 code implementations • 17 May 2024 • Marco Gaido, Sara Papi, Matteo Negri, Mauro Cettolo, Luisa Bentivogli
Subtitling plays a crucial role in enhancing the accessibility of audiovisual content and encompasses three primary subtasks: translating spoken dialogue, segmenting translations into concise textual units, and estimating timestamps that govern their on-screen duration.
1 code implementation • 20 Feb 2024 • Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli
The attention mechanism, a cornerstone of state-of-the-art neural models, faces computational hurdles in processing long sequences due to its quadratic complexity.
no code implementations • 19 Feb 2024 • Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli
The field of natural language processing (NLP) has recently witnessed a transformative shift with the emergence of foundation models, particularly Large Language Models (LLMs) that have revolutionized text-based NLP.
1 code implementation • 24 Oct 2023 • Dennis Fucci, Marco Gaido, Sara Papi, Mauro Cettolo, Matteo Negri, Luisa Bentivogli
When translating words referring to the speaker, speech translation (ST) systems should not resort to default masculine generics nor rely on potentially misleading vocal traits.
no code implementations • 23 Oct 2023 • Sara Papi, Peidong Wang, Junkun Chen, Jian Xue, Naoyuki Kanda, Jinyu Li, Yashesh Gaur
The growing need for instant spoken language transcription and translation is driven by increased global communication and cross-lingual interactions.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
1 code implementation • 27 Sep 2023 • Sara Papi, Marco Gaido, Matteo Negri
This paper describes the FBK's participation in the Simultaneous Translation and Automatic Subtitling tracks of the IWSLT 2023 Evaluation Campaign.
no code implementations • 7 Jul 2023 • Sara Papi, Peidong Wang, Junkun Chen, Jian Xue, Jinyu Li, Yashesh Gaur
In real-world applications, users often require both translations and transcriptions of speech to enhance their comprehension, particularly in streaming scenarios where incremental generation is necessary.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
2 code implementations • 19 May 2023 • Sara Papi, Marco Turchi, Matteo Negri
Attention is the core mechanism of today's most used architectures for natural language processing and has been analyzed from many perspectives, including its effectiveness for machine translation-related tasks.
2 code implementations • 28 Mar 2023 • Sara Papi, Marco Gaido, Andrea Pilzer, Matteo Negri
Despite its crucial role in research experiments, code correctness is often presumed only on the basis of the perceived quality of results.
2 code implementations • 15 Dec 2022 • Sara Papi, Matteo Negri, Marco Turchi
The study of the attention mechanism has sparked interest in many fields, such as language modeling and machine translation.
1 code implementation • 21 Oct 2022 • Marco Gaido, Sara Papi, Matteo Negri, Marco Turchi
Modern automatic translation systems aim at place the human at the center by providing contextual support and knowledge.
1 code implementation • 27 Sep 2022 • Sara Papi, Marco Gaido, Alina Karakanta, Mauro Cettolo, Matteo Negri, Marco Turchi
Automatic subtitling is the task of automatically translating the speech of audiovisual content into short pieces of timed text, i. e. subtitles and their corresponding timestamps.
1 code implementation • 21 Sep 2022 • Sara Papi, Alina Karakanta, Matteo Negri, Marco Turchi
Speech translation for subtitling (SubST) is the task of automatically translating speech data into well-formed subtitles by inserting subtitle breaks compliant to specific displaying guidelines.
1 code implementation • NAACL (AutoSimTrans) 2022 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi
Simultaneous speech translation (SimulST) systems aim at generating their output with the lowest possible latency, which is normally computed in terms of Average Lagging (AL).
1 code implementation • IWSLT (ACL) 2022 • Marco Gaido, Sara Papi, Dennis Fucci, Giuseppe Fiameni, Matteo Negri, Marco Turchi
The primary goal of this FBK's systems submission to the IWSLT 2022 offline and simultaneous speech translation tasks is to reduce model training costs without sacrificing translation quality.
1 code implementation • 8 Apr 2022 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi
In simultaneous speech translation (SimulST), finding the best trade-off between high translation quality and low latency is a challenging task.
no code implementations • 31 Oct 2021 • Sara Papi, Matteo Negri, Marco Turchi
Simultaneous speech translation (SimulST) is the task in which output generation has to be performed on partial, incremental speech input.
1 code implementation • EMNLP 2021 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi
Transformer-based models have gained increasing popularity achieving state-of-the-art performance in many research fields including speech translation.
Ranked #1 on Speech-to-Text Translation on MuST-C EN->NL
1 code implementation • MTSummit 2021 • Alina Karakanta, Sara Papi, Matteo Negri, Marco Turchi
Experiments on three language pairs (en$\rightarrow$it, de, fr) show that scrolling lines is the only mode achieving an acceptable reading speed while keeping delay close to a 4-second threshold.
no code implementations • 23 Jun 2021 • Sara Papi, Edmondo Trentin, Roberto Gretter, Marco Matassoni, Daniele Falavigna
The paper copes with the task of automatic assessment of second language proficiency from the language learners' spoken responses to test prompts.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • ACL (IWSLT) 2021 • Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi
Both knowledge distillation and the first fine-tuning step are carried out on manually segmented real and synthetic data, the latter being generated with an MT system trained on the available corpora.