Search Results for author: Alexis Conneau

To better understand the scaling properties of such mixed-modal models, we conducted over 250 experiments using seven different modalities and model sizes ranging from 8 million to 30 billion, trained on 5-100 billion tokens.

Paper
Add Code

FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech

1 code implementation • 25 May 2022 • Alexis Conneau, Min Ma, Simran Khanuja, Yu Zhang, Vera Axelrod, Siddharth Dalmia, Jason Riesa, Clara Rivera, Ankur Bapna

We introduce FLEURS, the Few-shot Learning Evaluation of Universal Representations of Speech benchmark.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Paper
Code

Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation

no code implementations • 24 Mar 2022 • Ye Jia, Yifan Ding, Ankur Bapna, Colin Cherry, Yu Zhang, Alexis Conneau, Nobuyuki Morioka

End-to-end speech-to-speech translation (S2ST) without relying on intermediate text representations is a rapidly emerging frontier of research.

Representation Learning Speech-to-Speech Translation +1

Paper
Add Code

XTREME-S: Evaluating Cross-lingual Speech Representations

no code implementations • 21 Mar 2022 • Alexis Conneau, Ankur Bapna, Yu Zhang, Min Ma, Patrick von Platen, Anton Lozhkov, Colin Cherry, Ye Jia, Clara Rivera, Mihir Kale, Daan van Esch, Vera Axelrod, Simran Khanuja, Jonathan H. Clark, Orhan Firat, Michael Auli, Sebastian Ruder, Jason Riesa, Melvin Johnson

Covering 102 languages from 10+ language families, 3 different domains and 4 task families, XTREME-S aims to simplify multilingual speech representation evaluation, as well as catalyze research in "universal" speech representation learning.

Representation Learning Retrieval +4

Paper
Add Code

mSLAM: Massively multilingual joint pre-training for speech and text

no code implementations • 3 Feb 2022 • Ankur Bapna, Colin Cherry, Yu Zhang, Ye Jia, Melvin Johnson, Yong Cheng, Simran Khanuja, Jason Riesa, Alexis Conneau

We present mSLAM, a multilingual Speech and LAnguage Model that learns cross-lingual cross-modal representations of speech and text by pre-training jointly on large amounts of unlabeled speech and text in multiple languages.

intent-classification Intent Classification +4

Paper
Add Code

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale

2 code implementations • 17 Nov 2021 • Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli

On the CoVoST-2 speech translation benchmark, we improve the previous state of the art by an average of 7. 4 BLEU over 21 translation directions into English.

Ranked #1 on Language Identification on VoxLingua107 (using extra training data)

Language Identification Representation Learning +3

29,470

Paper
Code

SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training

no code implementations • 20 Oct 2021 • Ankur Bapna, Yu-An Chung, Nan Wu, Anmol Gulati, Ye Jia, Jonathan H. Clark, Melvin Johnson, Jason Riesa, Alexis Conneau, Yu Zhang

We build a single encoder with the BERT objective on unlabeled text together with the w2v-BERT objective on unlabeled speech.

Language Modelling Text Matching +2

Paper
Add Code

Multilingual Speech Translation from Efficient Finetuning of Pretrained Models

no code implementations • ACL 2021 • Xian Li, Changhan Wang, Yun Tang, Chau Tran, Yuqing Tang, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli

We present a simple yet effective approach to build multilingual speech-to-text (ST) translation through efficient transfer learning from a pretrained speech encoder and text decoder.

Decoder Text Generation +2

Paper
Add Code

Improved Language Identification Through Cross-Lingual Self-Supervised Learning

no code implementations • 8 Jul 2021 • Andros Tjandra, Diptanu Gon Choudhury, Frank Zhang, Kritika Singh, Alexis Conneau, Alexei Baevski, Assaf Sela, Yatharth Saraf, Michael Auli

Language identification greatly impacts the success of downstream tasks such as automatic speech recognition.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Unsupervised Speech Recognition

4 code implementations • NeurIPS 2021 • Alexei Baevski, Wei-Ning Hsu, Alexis Conneau, Michael Auli

Despite rapid progress in the recent past, current speech recognition systems still require labeled training data which limits this technology to a small fraction of the languages spoken around the globe.

speech-recognition Speech Recognition +1

29,470

Paper
Code

Larger-Scale Transformers for Multilingual Masked Language Modeling

no code implementations • ACL (RepL4NLP) 2021 • Naman Goyal, Jingfei Du, Myle Ott, Giri Anantharaman, Alexis Conneau

Our model also outperforms the RoBERTa-Large model on several English tasks of the GLUE benchmark by 0. 3% on average while handling 99 more languages.

Masked Language Modeling XLM-R

Paper
Add Code

Large-Scale Self- and Semi-Supervised Learning for Speech Translation

no code implementations • 14 Apr 2021 • Changhan Wang, Anne Wu, Juan Pino, Alexei Baevski, Michael Auli, Alexis Conneau

In this paper, we improve speech translation (ST) through effectively leveraging large quantities of unlabeled speech and text data in different and complementary ways.

Language Modelling Translation

Paper
Add Code

Supervised Contrastive Learning for Pre-trained Language Model Fine-tuning

1 code implementation • ICLR 2021 • Beliz Gunel, Jingfei Du, Alexis Conneau, Ves Stoyanov

Our proposed fine-tuning objective leads to models that are more robust to different levels of noise in the fine-tuning training data, and can generalize better to related tasks with limited labeled data.

Contrastive Learning Data Augmentation +4

Paper
Code

Multilingual Speech Translation with Efficient Finetuning of Pretrained Models

no code implementations • 24 Oct 2020 • Xian Li, Changhan Wang, Yun Tang, Chau Tran, Yuqing Tang, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli

We present a simple yet effective approach to build multilingual speech-to-text (ST) translation by efficient transfer learning from pretrained speech encoder and text decoder.

Cross-Lingual Transfer Decoder +3

Paper
Add Code

Self-training and Pre-training are Complementary for Speech Recognition

3 code implementations • 22 Oct 2020 • Qiantong Xu, Alexei Baevski, Tatiana Likhomanenko, Paden Tomasello, Alexis Conneau, Ronan Collobert, Gabriel Synnaeve, Michael Auli

Self-training and unsupervised pre-training have emerged as effective approaches to improve speech recognition systems using unlabeled data.

Ranked #1 on Speech Recognition on LibriSpeech train-clean-100 test-other (using extra training data)

speech-recognition Speech Recognition +1

29,470

Paper
Code

Self-training Improves Pre-training for Natural Language Understanding

1 code implementation • NAACL 2021 • Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Ves Stoyanov, Alexis Conneau

Unsupervised pre-training has led to much recent progress in natural language understanding.

Data Augmentation Few-Shot Learning +5

363

Paper
Code

Unsupervised Cross-lingual Representation Learning for Speech Recognition

6 code implementations • 24 Jun 2020 • Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdel-rahman Mohamed, Michael Auli

This paper presents XLSR which learns cross-lingual speech representations by pretraining a single model from the raw waveform of speech in multiple languages.

Quantization Representation Learning +2

126,923

Paper
Code

Unsupervised Cross-lingual Representation Learning at Scale

28 code implementations • ACL 2020 • Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov

We also present a detailed empirical analysis of the key factors that are required to achieve these gains, including the trade-offs between (1) positive transfer and capacity dilution and (2) the performance of high and low resource languages at scale.

Cross-Lingual Transfer Multilingual NLP +2

126,923

Paper
Code

Emerging Cross-lingual Structure in Pretrained Language Models

no code implementations • ACL 2020 • Shijie Wu, Alexis Conneau, Haoran Li, Luke Zettlemoyer, Veselin Stoyanov

We study the problem of multilingual masked language modeling, i. e. the training of a single model on concatenated text from multiple languages, and present a detailed study of several factors that influence why these models are so effective for cross-lingual transfer.

Cross-Lingual Transfer Language Modelling +2

Paper
Add Code

CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data

2 code implementations • LREC 2020 • Guillaume Wenzek, Marie-Anne Lachaux, Alexis Conneau, Vishrav Chaudhary, Francisco Guzmán, Armand Joulin, Edouard Grave

Pre-training text representations have led to significant improvements in many areas of natural language processing.

Vocal Bursts Intensity Prediction

913

Paper
Code

Cross-lingual Language Model Pretraining

16 code implementations • NeurIPS 2019 • Guillaume Lample, Alexis Conneau

On unsupervised machine translation, we obtain 34. 3 BLEU on WMT'16 German-English, improving the previous state of the art by more than 9 BLEU.

Ranked #2 on Unsupervised Machine Translation on WMT2016 English--Romanian

Language Modelling Natural Language Understanding +2

126,923

Paper
Code

Phrase-Based \& Neural Unsupervised Machine Translation

no code implementations • EMNLP 2018 • Guillaume Lample, Myle Ott, Alexis Conneau, Ludovic Denoyer, Marc{'}Aurelio Ranzato

Machine translation systems achieve near human-level performance on some languages, yet their effectiveness strongly relies on the availability of large amounts of parallel sentences, which hinders their applicability to the majority of language pairs.

Denoising NMT +3

Paper
Add Code

XNLI: Evaluating Cross-lingual Sentence Representations

10 code implementations • EMNLP 2018 • Alexis Conneau, Guillaume Lample, Ruty Rinott, Adina Williams, Samuel R. Bowman, Holger Schwenk, Veselin Stoyanov

State-of-the-art natural language processing systems rely on supervision in the form of annotated data to learn competent models.

Ranked #5 on Natural Language Inference on XNLI French

Cross-Lingual Natural Language Inference Machine Translation +2

2,861

Paper
Code

What you can cram into a single \$\&!\#* vector: Probing sentence embeddings for linguistic properties

no code implementations • ACL 2018 • Alexis Conneau, German Kruszewski, Guillaume Lample, Lo{\"\i}c Barrault, Marco Baroni

Although much effort has recently been devoted to training high-quality sentence embeddings, we still have a poor understanding of what they are capturing.

General Classification Machine Translation +3

Paper
Add Code

What you can cram into a single vector: Probing sentence embeddings for linguistic properties

6 code implementations • 3 May 2018 • Alexis Conneau, German Kruszewski, Guillaume Lample, Loïc Barrault, Marco Baroni

Although much effort has recently been devoted to training high-quality sentence embeddings, we still have a poor understanding of what they are capturing.

General Classification Sentence +2

2,279

Paper
Code

Phrase-Based & Neural Unsupervised Machine Translation

15 code implementations • EMNLP 2018 • Guillaume Lample, Myle Ott, Alexis Conneau, Ludovic Denoyer, Marc'Aurelio Ranzato

Ranked #2 on Machine Translation on WMT2016 English-Russian

NMT Sentence +2

126,923

Paper
Code

SentEval: An Evaluation Toolkit for Universal Sentence Representations

11 code implementations • LREC 2018 • Alexis Conneau, Douwe Kiela

We introduce SentEval, a toolkit for evaluating the quality of universal sentence representations.

General Classification Multi-class Classification +3

2,279

Paper
Code

Unsupervised Machine Translation Using Monolingual Corpora Only

15 code implementations • ICLR 2018 • Guillaume Lample, Alexis Conneau, Ludovic Denoyer, Marc'Aurelio Ranzato

By learning to reconstruct in both languages from this shared feature space, the model effectively learns to translate without using any labeled data.

Ranked #7 on Machine Translation on WMT2016 German-English

Sentence Translation +1

3,169

Paper
Code

Word Translation Without Parallel Data

19 code implementations • ICLR 2018 • Alexis Conneau, Guillaume Lample, Marc'Aurelio Ranzato, Ludovic Denoyer, Hervé Jégou

We finally describe experiments on the English-Esperanto low-resource language pair, on which there only exists a limited amount of parallel data, to show the potential impact of our method in fully unsupervised machine translation.

Ranked #2 on Word Alignment on en-es

Cross-Lingual Word Embeddings Translation +4

3,169

Paper
Code

Learning Visually Grounded Sentence Representations

no code implementations • NAACL 2018 • Douwe Kiela, Alexis Conneau, Allan Jabri, Maximilian Nickel

We introduce a variety of models, trained on a supervised image captioning corpus to predict the image features for a given caption, to perform sentence representation grounding.

Language Modelling Representation Learning +2

Paper
Add Code

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

22 code implementations • EMNLP 2017 • Alexis Conneau, Douwe Kiela, Holger Schwenk, Loic Barrault, Antoine Bordes

Many modern NLP systems rely on word embeddings, previously trained in an unsupervised manner on large corpora, as base features.

Ranked #1 on Semantic Textual Similarity on SentEval

Cross-Lingual Natural Language Inference Semantic Textual Similarity +3

2,279

Paper
Code

Meta-Prod2Vec - Product Embeddings Using Side-Information for Recommendation

2 code implementations • 25 Jul 2016 • Flavian Vasile, Elena Smirnova, Alexis Conneau

We propose Meta-Prod2vec, a novel method to compute item similarities for recommendation that leverages existing item metadata.

Paper
Code

Very Deep Convolutional Networks for Text Classification

23 code implementations • EACL 2017 • Alexis Conneau, Holger Schwenk, Loïc Barrault, Yann Lecun

The dominant approach for many NLP tasks are recurrent neural networks, in particular LSTMs, and convolutional neural networks.

Ranked #17 on Text Classification on AG News

General Classification Text Classification

504

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.