Search Results for author: Belen Alastruey

Found 8 papers, 3 papers with code

Towards Real-World Streaming Speech Translation for Code-Switched Speech

1 code implementation • 19 Oct 2023 • Belen Alastruey, Matthias Sperber, Christian Gollan, Dominic Telaar, Tim Ng, Aashish Agarwal

Code-switching (CS), i. e. mixing different languages in a single sentence, is a common phenomenon in communication and can be challenging in many Natural Language Processing (NLP) settings.

Sentence Translation

Paper
Code

SpeechAlign: a Framework for Speech Translation Alignment Evaluation

no code implementations • 20 Sep 2023 • Belen Alastruey, Aleix Sant, Gerard I. Gállego, David Dale, Marta R. Costa-jussà

To contribute to these fields, we present SpeechAlign, a framework to evaluate the underexplored field of source-target alignment in speech models.

Speech-to-Text Translation Translation

Paper
Add Code

The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages

1 code implementation • 31 Aug 2023 • Benjamin Muller, Belen Alastruey, Prangthip Hansanti, Elahe Kalbassi, Christophe Ropers, Eric Michael Smith, Adina Williams, Luke Zettlemoyer, Pierre Andrews, Marta R. Costa-jussà

We showcase it to report gender representation in WMT training data and development data for the News task, confirming that current data is skewed towards masculine representation.

Data Augmentation Text Generation

152

Paper
Code

Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition

no code implementations • 12 Jun 2023 • Belen Alastruey, Lukas Drude, Jahn Heymann, Simon Wiesler

Convolutional frontends are a typical choice for Transformer-based automatic speech recognition to preprocess the spectrogram, reduce its sequence length, and combine local information in time and frequency similarly.

Automatic Speech Recognition speech-recognition +1

Paper
Add Code

Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer

1 code implementation • 23 May 2022 • Javier Ferrando, Gerard I. Gállego, Belen Alastruey, Carlos Escolano, Marta R. Costa-jussà

In Neural Machine Translation (NMT), each token prediction is conditioned on the source sentence and the target prefix (what has been previously translated at a decoding step).

Machine Translation NMT +2