Search Results for author: Luisa Bentivogli

Found 46 papers, 16 papers with code

Towards a methodology for evaluating automatic subtitling

no code implementations EAMT 2022 Alina Karakanta, Luisa Bentivogli, Mauro Cettolo, Matteo Negri, Marco Turchi

In response to the increasing interest towards automatic subtitling, this EAMT-funded project aimed at collecting subtitle post-editing data in a real use case scenario where professional subtitlers edit automatically generated subtitles.

Segmentation

Machine Translation Human Evaluation: an investigation of evaluation based on Post-Editing and its relation with Direct Assessment

no code implementations IWSLT (EMNLP) 2018 Luisa Bentivogli, Mauro Cettolo, Marcello Federico, Christian Federmann

In this paper we present an analysis of the two most prominent methodologies used for the human evaluation of MT quality, namely evaluation based on Post-Editing (PE) and evaluation based on Direct Assessment (DA).

Machine Translation

CEF Data Marketplace: Powering a Long-term Supply of Language Data

no code implementations EAMT 2020 Amir Kamran, Dace Dzeguze, Jaap van der Meer, Milica Panic, Alessandro Cattelan, Daniele Patrioli, Luisa Bentivogli, Marco Turchi

We describe the CEF Data Marketplace project, which focuses on the development of a trading platform of translation data for language professionals: translators, machine translation (MT) developers, language service providers (LSPs), translation buyers and government bodies.

Machine Translation Translation

Post-editing in Automatic Subtitling: A Subtitlers’ perspective

1 code implementation EAMT 2022 Alina Karakanta, Luisa Bentivogli, Mauro Cettolo, Matteo Negri, Marco Turchi

Subtitling tools are recently being adapted for post-editing by providing automatically generated subtitles, and featuring not only machine translation, but also automatic segmentation and synchronisation.

Machine Translation Translation

Findings of the IWSLT 2022 Evaluation Campaign

no code implementations IWSLT (ACL) 2022 Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondřej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vĕra Kloudová, Surafel Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nǎdejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Pino, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alexander Waibel, Changhan Wang, Shinji Watanabe

The evaluation campaign of the 19th International Conference on Spoken Language Translation featured eight shared tasks: (i) Simultaneous speech translation, (ii) Offline speech translation, (iii) Speech to speech translation, (iv) Low-resource speech translation, (v) Multilingual speech translation, (vi) Dialect speech translation, (vii) Formality control for speech translation, (viii) Isometric speech translation.

Speech-to-Speech Translation Translation

On the Dynamics of Gender Learning in Speech Translation

no code implementations NAACL (GeBNLP) 2022 Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

In this work, we contribute to such a line of inquiry by exploring the emergence of gender bias in Speech Translation (ST).

Translation

SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation

1 code implementation20 Jun 2024 Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli

This paper describes the FBK's participation in the Simultaneous Translation Evaluation Campaign at IWSLT 2024.

Speech-to-Text Translation Translation

StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection

1 code implementation10 Jun 2024 Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli

To fill this gap, we introduce StreamAtt, the first StreamST policy, and propose StreamLAAL, the first StreamST latency metric designed to be comparable with existing metrics for SimulST.

Speech-to-Text Translation Translation

SBAAM! Eliminating Transcript Dependency in Automatic Subtitling

2 code implementations17 May 2024 Marco Gaido, Sara Papi, Matteo Negri, Mauro Cettolo, Luisa Bentivogli

Subtitling plays a crucial role in enhancing the accessibility of audiovisual content and encompasses three primary subtasks: translating spoken dialogue, segmenting translations into concise textual units, and estimating timestamps that govern their on-screen duration.

Enhancing Gender-Inclusive Machine Translation with Neomorphemes and Large Language Models

no code implementations14 May 2024 Andrea Piergentili, Beatrice Savoldi, Matteo Negri, Luisa Bentivogli

In this direction, we explore prompting techniques with large language models (LLMs) to translate from English into Italian using neomorphemes.

Machine Translation Translation

How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena

1 code implementation20 Feb 2024 Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli

The attention mechanism, a cornerstone of state-of-the-art neural models, faces computational hurdles in processing long sequences due to its quadratic complexity.

Automatic Speech Recognition Image Classification +3

Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?

no code implementations19 Feb 2024 Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli

The field of natural language processing (NLP) has recently witnessed a transformative shift with the emergence of foundation models, particularly Large Language Models (LLMs) that have revolutionized text-based NLP.

Speech-to-Text Translation

A Prompt Response to the Demand for Automatic Gender-Neutral Translation

1 code implementation8 Feb 2024 Beatrice Savoldi, Andrea Piergentili, Dennis Fucci, Matteo Negri, Luisa Bentivogli

Gender-neutral translation (GNT) that avoids biased and undue binary assumptions is a pivotal challenge for the creation of more inclusive translation technologies.

Machine Translation Translation

Test Suites Task: Evaluation of Gender Fairness in MT with MuST-SHE and INES

1 code implementation30 Oct 2023 Beatrice Savoldi, Marco Gaido, Matteo Negri, Luisa Bentivogli

As part of the WMT-2023 "Test suites" shared task, in this paper we summarize the results of two test suites evaluations: MuST-SHE-WMT23 and INES.

de-en Fairness

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection

1 code implementation24 Oct 2023 Dennis Fucci, Marco Gaido, Sara Papi, Mauro Cettolo, Matteo Negri, Luisa Bentivogli

When translating words referring to the speaker, speech translation (ST) systems should not resort to default masculine generics nor rely on potentially misleading vocal traits.

Decoder Language Modelling

How To Build Competitive Multi-gender Speech Translation Models For Controlling Speaker Gender Translation

1 code implementation23 Oct 2023 Marco Gaido, Dennis Fucci, Matteo Negri, Luisa Bentivogli

When translating from notional gender languages (e. g., English) into grammatical gender languages (e. g., Italian), the generated translation requires explicit gender assignments for various words, including those referring to the speaker.

Sentence Translation

No Pitch Left Behind: Addressing Gender Unbalance in Automatic Speech Recognition through Pitch Manipulation

1 code implementation10 Oct 2023 Dennis Fucci, Marco Gaido, Matteo Negri, Mauro Cettolo, Luisa Bentivogli

Automatic speech recognition (ASR) systems are known to be sensitive to the sociolinguistic variability of speech data, in which gender plays a crucial role.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Under the Morphosyntactic Lens: A Multifaceted Evaluation of Gender Bias in Speech Translation

1 code implementation ACL 2022 Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

Gender bias is largely recognized as a problematic phenomenon affecting language technologies, with recent studies underscoring that it might surface differently across languages.

POS Translation

Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?

no code implementations ACL 2021 Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Alberto Martinelli, Matteo Negri, Marco Turchi

Five years after the first published proofs of concept, direct approaches to speech translation (ST) are now competing with traditional cascade solutions.

Translation

How to Split: the Effect of Word Segmentation on Gender Bias in Speech Translation

1 code implementation Findings (ACL) 2021 Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi

In light of this finding, we propose a combined approach that preserves BPE overall translation quality, while leveraging the higher ability of character-based segmentation to properly translate gender.

Segmentation Translation

Gender Bias in Machine Translation

1 code implementation13 Apr 2021 Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

Machine translation (MT) technology has facilitated our daily tasks by providing accessible shortcuts for gathering, elaborating and communicating information.

Machine Translation Translation

Breeding Gender-aware Direct Speech Translation Systems

no code implementations COLING 2020 Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi

In particular, by translating speech audio data without intermediate transcription, direct ST models are able to leverage and preserve essential information present in the input (e. g. speaker's vocal characteristics) that is otherwise lost in the cascade framework.

Machine Translation Translation

Machine Translation for Machines: the Sentiment Classification Use Case

no code implementations IJCNLP 2019 Amirhossein Tebbifakhr, Luisa Bentivogli, Matteo Negri, Marco Turchi

Towards this objective, we present a reinforcement learning technique based on a new candidate sampling strategy, which exploits the results obtained on the downstream task as weak feedback.

Classification General Classification +7

Neural versus Phrase-Based Machine Translation Quality: a Case Study

no code implementations EMNLP 2016 Luisa Bentivogli, Arianna Bisazza, Mauro Cettolo, Marcello Federico

Within the field of Statistical Machine Translation (SMT), the neural approach (NMT) has recently emerged as the first technology able to challenge the long-standing dominance of phrase-based approaches (PBMT).

Machine Translation NMT +1

WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words

no code implementations LREC 2016 Luisa Bentivogli, Mauro Cettolo, M. Amin Farajian, Marcello Federico

This paper presents WAGS (Word Alignment Gold Standard), a novel benchmark which allows extensive evaluation of WA tools on out-of-vocabulary (OOV) and rare words.

Sentence Word Alignment

Cannot find the paper you are looking for? You can Submit a new open access paper.