Search Results for author: Onur Çelebi

Found 4 papers, 3 papers with code

xSIM++: An Improved Proxy to Bitext Mining Performance for Low-Resource Languages

1 code implementation • 22 Jun 2023 • Mingda Chen, Kevin Heffernan, Onur Çelebi, Alex Mourachko, Holger Schwenk

In comparison to xSIM, we show that xSIM++ is better correlated with the downstream BLEU scores of translation systems trained on mined bitexts, providing a reliable proxy of bitext mining performance without needing to run expensive bitext mining pipelines.

NMT

3,520

Paper
Code

Listen to what they say: Better understand and detect online misinformation with user feedback

no code implementations • 31 Oct 2022 • Hubert Etienne, Onur Çelebi

Social media users who report content are key allies in the management of online misinformation, however, no research has been conducted yet to understand their role and the different trends underlying their reporting activity.

Management Misinformation

Paper
Add Code

No Language Left Behind: Scaling Human-Centered Machine Translation

8 code implementations • Meta AI 2022 • NLLB team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loic Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran, Pierre Andrews, Necip Fazil Ayan, Shruti Bhosale, Sergey Edunov, Angela Fan, Cynthia Gao, Vedanuj Goswami, Francisco Guzmán, Philipp Koehn, Alexandre Mourachko, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Jeff Wang

Driven by the goal of eradicating language barriers on a global scale, machine translation has solidified itself as a key focus of artificial intelligence research today.

Ranked #1 on Machine Translation on IWSLT2017 French-English (SacreBLEU metric)

Machine Translation Translation

29,251

Paper
Code

Bitext Mining Using Distilled Sentence Representations for Low-Resource Languages

1 code implementation • 25 May 2022 • Kevin Heffernan, Onur Çelebi, Holger Schwenk

To achieve this, we focus on teacher-student training, allowing all encoders to be mutually compatible for bitext mining, and enabling fast learning of new languages.

Cross-Lingual Transfer NMT +2

3,520

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.