Search Results for author: Alexandre Ramé

Found 6 papers, 3 papers with code

Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization

1 code implementation20 Dec 2022 Alexandre Ramé, Kartik Ahuja, Jianyu Zhang, Matthieu Cord, Léon Bottou, David Lopez-Paz

In this paper, we thus propose model ratatouille, a new strategy to recycle the multiple fine-tunings of the same foundation model on diverse auxiliary tasks.

Domain Generalization Out-of-Distribution Generalization

Towards efficient feature sharing in MIMO architectures

no code implementations20 May 2022 Rémy Sun, Alexandre Ramé, Clément Masson, Nicolas Thome, Matthieu Cord

To solve this issue, we propose a novel unmixing step in MIMO architectures that allows subnetworks to properly share features.

WARM: On the Benefits of Weight Averaged Reward Models

no code implementations22 Jan 2024 Alexandre Ramé, Nino Vieillard, Léonard Hussenot, Robert Dadashi, Geoffrey Cideron, Olivier Bachem, Johan Ferret

We identify two primary challenges when designing RMs to mitigate reward hacking: distribution shifts during the RL process and inconsistencies in human preferences.

Cannot find the paper you are looking for? You can Submit a new open access paper.