Search Results for author: Marzieh Fadaee

Found 18 papers, 9 papers with code

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

no code implementations • 22 Feb 2024 • Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker

AI alignment in the shape of Reinforcement Learning from Human Feedback (RLHF) is increasingly treated as a crucial ingredient for high performance large language models.

Paper
Add Code

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

no code implementations • 12 Feb 2024 • Ahmet Üstün, Viraat Aryabumi, Zheng-Xin Yong, Wei-Yin Ko, Daniel D'souza, Gbemileke Onilude, Neel Bhandari, Shivalika Singh, Hui-Lee Ooi, Amr Kayid, Freddie Vargus, Phil Blunsom, Shayne Longpre, Niklas Muennighoff, Marzieh Fadaee, Julia Kreutzer, Sara Hooker

Recent breakthroughs in large language models (LLMs) have centered around a handful of data-rich languages.

Language Modelling

Paper
Add Code

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

no code implementations • 9 Feb 2024 • Shivalika Singh, Freddie Vargus, Daniel Dsouza, Börje F. Karlsson, Abinaya Mahendiran, Wei-Yin Ko, Herumb Shandilya, Jay Patel, Deividas Mataciunas, Laura OMahony, Mike Zhang, Ramith Hettiarachchi, Joseph Wilson, Marina Machado, Luisa Souza Moura, Dominik Krzemiński, Hakimeh Fadaei, Irem Ergün, Ifeoma Okoh, Aisha Alaagib, Oshan Mudannayake, Zaid Alyafeai, Vu Minh Chien, Sebastian Ruder, Surya Guthikonda, Emad A. Alghamdi, Sebastian Gehrmann, Niklas Muennighoff, Max Bartolo, Julia Kreutzer, Ahmet Üstün, Marzieh Fadaee, Sara Hooker

The Aya initiative also serves as a valuable case study in participatory research, involving collaborators from 119 countries.

Instruction Following Language Modelling +1

Paper
Add Code

Elo Uncovered: Robustness and Best Practices in Language Model Evaluation

no code implementations • 29 Nov 2023 • Meriem Boubdir, Edward Kim, Beyza Ermis, Sara Hooker, Marzieh Fadaee

In Natural Language Processing (NLP), the Elo rating system, originally designed for ranking players in dynamic games such as chess, is increasingly being used to evaluate Large Language Models (LLMs) through "A vs B" paired comparisons.

Language Modelling

Paper
Add Code

Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation

no code implementations • 22 Oct 2023 • Meriem Boubdir, Edward Kim, Beyza Ermis, Marzieh Fadaee, Sara Hooker

Human evaluation is increasingly critical for assessing large language models, capturing linguistic nuances, and reflecting user preferences more accurately than traditional automated metrics.

Language Modelling Large Language Model

Paper
Add Code

When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale

no code implementations • 8 Sep 2023 • Max Marion, Ahmet Üstün, Luiza Pozzobon, Alex Wang, Marzieh Fadaee, Sara Hooker

In this work, we take a wider view and explore scalable estimates of data quality that can be used to systematically measure the quality of pretraining data.

Memorization

Paper
Add Code

InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval

1 code implementation • 4 Jan 2023 • Vitor Jeronymo, Luiz Bonifacio, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo, Jakub Zavrel, Rodrigo Nogueira

Recently, InPars introduced a method to efficiently use large language models (LLMs) in information retrieval tasks: via few-shot examples, an LLM is induced to generate relevant queries for documents.

Information Retrieval Retrieval

154

Paper
Code

In Defense of Cross-Encoders for Zero-Shot Retrieval

1 code implementation • 12 Dec 2022 • Guilherme Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo, Rodrigo Nogueira

We find that the number of parameters and early query-document interactions of cross-encoders play a significant role in the generalization ability of retrieval models.

Retrieval

Paper
Code

No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval

1 code implementation • 6 Jun 2022 • Guilherme Moraes Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo, Rodrigo Nogueira

This has made distilled and dense models, due to latency constraints, the go-to choice for deployment in real-world retrieval applications.

Ranked #1 on Citation Prediction on SciDocs (BEIR)

Argument Retrieval Biomedical Information Retrieval +9

Paper
Code

InPars: Data Augmentation for Information Retrieval using Large Language Models

1 code implementation • 10 Feb 2022 • Luiz Bonifacio, Hugo Abonizio, Marzieh Fadaee, Rodrigo Nogueira

In this work, we harness the few-shot capabilities of large pretrained language models as synthetic data generators for IR tasks.

Data Augmentation Information Retrieval +2

154

Paper
Code

mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset

1 code implementation • 31 Aug 2021 • Luiz Bonifacio, Vitor Jeronymo, Hugo Queiroz Abonizio, Israel Campiotti, Marzieh Fadaee, Roberto Lotufo, Rodrigo Nogueira

In this work, we present mMARCO, a multilingual version of the MS MARCO passage ranking dataset comprising 13 languages that was created using machine translation.

Information Retrieval Machine Translation +4

128

Paper
Code

Understanding and Enhancing the Use of Context for Machine Translation

no code implementations • 20 Feb 2021 • Marzieh Fadaee

To understand and infer meaning in language, neural models have to learn complicated nuances.

Machine Translation Translation

Paper
Add Code

A New Neural Search and Insights Platform for Navigating and Organizing AI Research

no code implementations • EMNLP (sdp) 2020 • Marzieh Fadaee, Olga Gureenkova, Fernando Rejon Barrera, Carsten Schnober, Wouter Weerkamp, Jakub Zavrel

We give an overview of the overall architecture of the system and of the components for document analysis, question answering, search, analytics, expert search, and recommendations.

Question Answering Retrieval

Paper
Add Code

The Unreasonable Volatility of Neural Machine Translation Models

1 code implementation • WS 2020 • Marzieh Fadaee, Christof Monz

Recent works have shown that Neural Machine Translation (NMT) models achieve impressive performance, however, questions about understanding the behavior of these models remain unanswered.

Machine Translation NMT +2

Paper
Code

Back-Translation Sampling by Targeting Difficult Words in Neural Machine Translation

no code implementations • EMNLP 2018 • Marzieh Fadaee, Christof Monz

In this work, we explore different aspects of back-translation, and show that words with high prediction loss during training benefit most from the addition of synthetic data.

Machine Translation Translation

Paper
Add Code

Examining the Tip of the Iceberg: A Data Set for Idiom Translation

1 code implementation • LREC 2018 • Marzieh Fadaee, Arianna Bisazza, Christof Monz

Neural Machine Translation (NMT) has been widely used in recent years with significant improvements for many language pairs.

Machine Translation NMT +1

Paper
Code

Learning Topic-Sensitive Word Representations

1 code implementation • ACL 2017 • Marzieh Fadaee, Arianna Bisazza, Christof Monz

Distributed word representations are widely used for modeling words in NLP tasks.

Paper
Code

Data Augmentation for Low-Resource Neural Machine Translation

1 code implementation • ACL 2017 • Marzieh Fadaee, Arianna Bisazza, Christof Monz

The quality of a Neural Machine Translation system depends substantially on the availability of sizable parallel corpora.

Data Augmentation Low-Resource Neural Machine Translation +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.