Search Results for author: Luisa Bentivogli

Found 44 papers, 12 papers with code

Extending the MuST-C Corpus for a Comparative Evaluation of Speech Translation Technology

no code implementations • EAMT 2022 • Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Matteo Negri, Marco Turchi

This project aimed at extending the test sets of the MuST-C speech translation (ST) corpus with new reference translations.

Machine Translation Translation

Paper
Add Code

Machine Translation Human Evaluation: an investigation of evaluation based on Post-Editing and its relation with Direct Assessment

no code implementations • IWSLT (EMNLP) 2018 • Luisa Bentivogli, Mauro Cettolo, Marcello Federico, Christian Federmann

In this paper we present an analysis of the two most prominent methodologies used for the human evaluation of MT quality, namely evaluation based on Post-Editing (PE) and evaluation based on Direct Assessment (DA).

Machine Translation

Paper
Add Code

On the Dynamics of Gender Learning in Speech Translation

no code implementations • NAACL (GeBNLP) 2022 • Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

In this work, we contribute to such a line of inquiry by exploring the emergence of gender bias in Speech Translation (ST).

Translation

Paper
Add Code

CEF Data Marketplace: Powering a Long-term Supply of Language Data

no code implementations • EAMT 2020 • Amir Kamran, Dace Dzeguze, Jaap van der Meer, Milica Panic, Alessandro Cattelan, Daniele Patrioli, Luisa Bentivogli, Marco Turchi

We describe the CEF Data Marketplace project, which focuses on the development of a trading platform of translation data for language professionals: translators, machine translation (MT) developers, language service providers (LSPs), translation buyers and government bodies.

Machine Translation Translation

Paper
Add Code

Towards a methodology for evaluating automatic subtitling

no code implementations • EAMT 2022 • Alina Karakanta, Luisa Bentivogli, Mauro Cettolo, Matteo Negri, Marco Turchi

In response to the increasing interest towards automatic subtitling, this EAMT-funded project aimed at collecting subtitle post-editing data in a real use case scenario where professional subtitlers edit automatically generated subtitles.

Segmentation

Paper
Add Code

Post-editing in Automatic Subtitling: A Subtitlers’ perspective

1 code implementation • EAMT 2022 • Alina Karakanta, Luisa Bentivogli, Mauro Cettolo, Matteo Negri, Marco Turchi

Subtitling tools are recently being adapted for post-editing by providing automatically generated subtitles, and featuring not only machine translation, but also automatic segmentation and synchronisation.

Machine Translation Translation

Paper
Code

Is “moby dick” a Whale or a Bird? Named Entities and Terminology in Speech Translation

no code implementations • EMNLP 2021 • Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli, Marco Turchi

Automatic translation systems are known to struggle with rare words.

Translation

Paper
Add Code

Findings of the IWSLT 2022 Evaluation Campaign

no code implementations • IWSLT (ACL) 2022 • Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondřej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vĕra Kloudová, Surafel Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nǎdejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Pino, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alexander Waibel, Changhan Wang, Shinji Watanabe

The evaluation campaign of the 19th International Conference on Spoken Language Translation featured eight shared tasks: (i) Simultaneous speech translation, (ii) Offline speech translation, (iii) Speech to speech translation, (iv) Low-resource speech translation, (v) Multilingual speech translation, (vi) Dialect speech translation, (vii) Formality control for speech translation, (viii) Isometric speech translation.

Speech-to-Speech Translation Translation

Paper
Add Code

The IWSLT 2016 Evaluation Campaign

no code implementations • IWSLT 2016 • Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Rolando Cattoni, Marcello Federico

The IWSLT 2016 Evaluation Campaign featured two tasks: the translation of talks and the translation of video conference conversations.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Overview of the IWSLT 2017 Evaluation Campaign

no code implementations • IWSLT 2017 • Mauro Cettolo, Marcello Federico, Luisa Bentivogli, Jan Niehues, Sebastian Stüker, Katsuhito Sudoh, Koichiro Yoshino, Christian Federmann

The IWSLT 2017 evaluation campaign has organised three tasks.

Machine Translation Translation

Paper
Add Code

SBAAM! Eliminating Transcript Dependency in Automatic Subtitling

1 code implementation • 17 May 2024 • Marco Gaido, Sara Papi, Matteo Negri, Mauro Cettolo, Luisa Bentivogli

Subtitling plays a crucial role in enhancing the accessibility of audiovisual content and encompasses three primary subtasks: translating spoken dialogue, segmenting translations into concise textual units, and estimating timestamps that govern their on-screen duration.

Paper
Code

Enhancing Gender-Inclusive Machine Translation with Neomorphemes and Large Language Models

no code implementations • 14 May 2024 • Andrea Piergentili, Beatrice Savoldi, Matteo Negri, Luisa Bentivogli

In this direction, we explore prompting techniques with large language models (LLMs) to translate from English into Italian using neomorphemes.

Machine Translation Translation

Paper
Add Code

How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena

1 code implementation • 20 Feb 2024 • Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli

The attention mechanism, a cornerstone of state-of-the-art neural models, faces computational hurdles in processing long sequences due to its quadratic complexity.

Automatic Speech Recognition Image Classification +3

Paper
Code

Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?

no code implementations • 19 Feb 2024 • Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli

The field of natural language processing (NLP) has recently witnessed a transformative shift with the emergence of foundation models, particularly Large Language Models (LLMs) that have revolutionized text-based NLP.

Speech-to-Text Translation

Paper
Add Code

A Prompt Response to the Demand for Automatic Gender-Neutral Translation

no code implementations • 8 Feb 2024 • Beatrice Savoldi, Andrea Piergentili, Dennis Fucci, Matteo Negri, Luisa Bentivogli

Gender-neutral translation (GNT) that avoids biased and undue binary assumptions is a pivotal challenge for the creation of more inclusive translation technologies.

Machine Translation Translation

Paper
Add Code

Test Suites Task: Evaluation of Gender Fairness in MT with MuST-SHE and INES

1 code implementation • 30 Oct 2023 • Beatrice Savoldi, Marco Gaido, Matteo Negri, Luisa Bentivogli

As part of the WMT-2023 "Test suites" shared task, in this paper we summarize the results of two test suites evaluations: MuST-SHE-WMT23 and INES.

Fairness

Paper
Code

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection

1 code implementation • 24 Oct 2023 • Dennis Fucci, Marco Gaido, Sara Papi, Mauro Cettolo, Matteo Negri, Luisa Bentivogli

When translating words referring to the speaker, speech translation (ST) systems should not resort to default masculine generics nor rely on potentially misleading vocal traits.

Decoder Language Modelling

Paper
Code

How To Build Competitive Multi-gender Speech Translation Models For Controlling Speaker Gender Translation

1 code implementation • 23 Oct 2023 • Marco Gaido, Dennis Fucci, Matteo Negri, Luisa Bentivogli

When translating from notional gender languages (e. g., English) into grammatical gender languages (e. g., Italian), the generated translation requires explicit gender assignments for various words, including those referring to the speaker.

Sentence Translation

Paper
Code

No Pitch Left Behind: Addressing Gender Unbalance in Automatic Speech Recognition through Pitch Manipulation

1 code implementation • 10 Oct 2023 • Dennis Fucci, Marco Gaido, Matteo Negri, Mauro Cettolo, Luisa Bentivogli

Automatic speech recognition (ASR) systems are known to be sensitive to the sociolinguistic variability of speech data, in which gender plays a crucial role.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with the GeNTE Corpus

1 code implementation • 8 Oct 2023 • Andrea Piergentili, Beatrice Savoldi, Dennis Fucci, Matteo Negri, Luisa Bentivogli

Gender inequality is embedded in our communication practices and perpetuated in translation technologies.

Benchmarking Machine Translation +1

Paper
Code

Good, but not always Fair: An Evaluation of Gender Bias for three commercial Machine Translation Systems

no code implementations • 9 Jun 2023 • Silvia Alma Piazzolla, Beatrice Savoldi, Luisa Bentivogli

Machine Translation (MT) continues to make significant strides in quality and is increasingly adopted on a larger scale.

Machine Translation Translation

Paper
Add Code

Gender Neutralization for an Inclusive Machine Translation: from Theoretical Foundations to Open Challenges

no code implementations • 24 Jan 2023 • Andrea Piergentili, Dennis Fucci, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri

Gender inclusivity in language technologies has become a prominent research topic.

Machine Translation Translation

Paper
Add Code

Under the Morphosyntactic Lens: A Multifaceted Evaluation of Gender Bias in Speech Translation

1 code implementation • ACL 2022 • Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

Gender bias is largely recognized as a problematic phenomenon affecting language technologies, with recent studies underscoring that it might surface differently across languages.

POS Translation

Paper
Code

Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation

1 code implementation • 15 Sep 2021 • Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli, Marco Turchi

Automatic translation systems are known to struggle with rare words.

Translation

Paper
Code

Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?

no code implementations • ACL 2021 • Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Alberto Martinelli, Matteo Negri, Marco Turchi

Five years after the first published proofs of concept, direct approaches to speech translation (ST) are now competing with traditional cascade solutions.

Translation

Paper
Add Code

How to Split: the Effect of Word Segmentation on Gender Bias in Speech Translation

1 code implementation • Findings (ACL) 2021 • Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi

In light of this finding, we propose a combined approach that preserves BPE overall translation quality, while leveraging the higher ability of character-based segmentation to properly translate gender.

Segmentation Translation

Paper
Code

Gender Bias in Machine Translation

1 code implementation • 13 Apr 2021 • Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

Machine translation (MT) technology has facilitated our daily tasks by providing accessible shortcuts for gathering, elaborating and communicating information.

Machine Translation Translation

Paper
Code

Breeding Gender-aware Direct Speech Translation Systems

no code implementations • COLING 2020 • Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi

In particular, by translating speech audio data without intermediate transcription, direct ST models are able to leverage and preserve essential information present in the input (e. g. speaker's vocal characteristics) that is otherwise lost in the cascade framework.

Machine Translation Translation

Paper
Add Code

Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus

no code implementations • ACL 2020 • Luisa Bentivogli, Beatrice Savoldi, Matteo Negri, Mattia Antonino Di Gangi, Roldano Cattoni, Marco Turchi

Translating from languages without productive grammatical gender like English into gender-marked languages is a well-known difficulty for machines.

Machine Translation Sentence +1

Paper
Add Code

Machine Translation for Machines: the Sentiment Classification Use Case

no code implementations • IJCNLP 2019 • Amirhossein Tebbifakhr, Luisa Bentivogli, Matteo Negri, Marco Turchi

Towards this objective, we present a reinforcement learning technique based on a new candidate sampling strategy, which exploits the results obtained on the downstream task as weak feedback.

Classification General Classification +7

Paper
Add Code

MAGMATic: A Multi-domain Academic Gold Standard with Manual Annotation of Terminology for Machine Translation Evaluation

no code implementations • WS 2019 • R Scansani, y, Luisa Bentivogli, Silvia Bernardini, Adriano Ferraresi

Machine Translation Translation

Paper
Add Code

Do translator trainees trust machine translation? An experiment on post-editing and revision

no code implementations • WS 2019 • R Scansani, y, Silvia Bernardini, Adriano Ferraresi, Luisa Bentivogli

Machine Translation Translation

Paper
Add Code

MuST-C: a Multilingual Speech Translation Corpus

no code implementations • NAACL 2019 • Mattia A. Di Gangi, Roldano Cattoni, Luisa Bentivogli, Matteo Negri, Marco Turchi

Current research on spoken language translation (SLT) has to confront with the scarcity of sizeable and publicly available training corpora.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Neural versus Phrase-Based Machine Translation Quality: a Case Study

no code implementations • EMNLP 2016 • Luisa Bentivogli, Arianna Bisazza, Mauro Cettolo, Marcello Federico

Within the field of Statistical Machine Translation (SMT), the neural approach (NMT) has recently emerged as the first technology able to challenge the long-standing dominance of phrase-based approaches (PBMT).

Machine Translation NMT +1

Paper
Add Code

WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words

no code implementations • LREC 2016 • Luisa Bentivogli, Mauro Cettolo, M. Amin Farajian, Marcello Federico

This paper presents WAGS (Word Alignment Gold Standard), a novel benchmark which allows extensive evaluation of WA tools on out-of-vocabulary (OOV) and rare words.

Sentence Word Alignment