Search Results for author: Matthias Gerstgrasser

Found 8 papers, 1 papers with code

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

no code implementations1 Apr 2024 Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Henry Sleight, John Hughes, Tomasz Korbak, Rajashree Agrawal, Dhruv Pai, Andrey Gromov, Daniel A. Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo

The proliferation of generative models, combined with pretraining on web-scale data, raises a timely question: what happens when these models are trained on their own generated outputs?

Image Generation

Grounding Gaps in Language Model Generations

no code implementations15 Nov 2023 Omar Shaikh, Kristina Gligorić, Ashna Khetan, Matthias Gerstgrasser, Diyi Yang, Dan Jurafsky

To understand the roots of the identified grounding gap, we examine the role of instruction tuning and preference optimization, finding that training on contemporary preference data leads to a reduction in generated grounding acts.

Language Modelling

Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning

1 code implementation NeurIPS 2023 Matthias Gerstgrasser, Tom Danino, Sarah Keren

We present a novel multi-agent RL approach, Selective Multi-Agent Prioritized Experience Relay, in which agents share with other agents a limited number of transitions they observe during training.

Multi-agent Reinforcement Learning reinforcement-learning

Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning

no code implementations19 Oct 2022 Matthias Gerstgrasser, David C. Parkes

Stackelberg equilibria arise naturally in a range of popular learning problems, such as in security games or indirect mechanism design, and have received increasing attention in the reinforcement learning literature.

Multi-agent Reinforcement Learning reinforcement-learning +1

Collaboration Promotes Group Resilience in Multi-Agent AI

no code implementations12 Nov 2021 Sarah Keren, Matthias Gerstgrasser, Ofir Abu, Jeffrey Rosenschein

AI agents need to be robust to unexpected changes in their environment in order to safely operate in real-world scenarios.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games

no code implementations ICLR 2022 Matthias Gerstgrasser, Rakshit Trivedi, David C. Parkes

Human demonstrations of video game play can serve as vital surrogate representations of real-world behaviors, access to which would facilitate rapid progress in several complex learning settings (e. g. behavior classification, imitation learning, offline RL etc.).

Atari Games Imitation Learning +2

Reinforcement Learning of Sequential Price Mechanisms

no code implementations2 Oct 2020 Gianluca Brero, Alon Eden, Matthias Gerstgrasser, David C. Parkes, Duncan Rheingans-Yoo

We introduce the use of reinforcement learning for indirect mechanisms, working with the existing class of sequential price mechanisms, which generalizes both serial dictatorship and posted price mechanisms and essentially characterizes all strongly obviously strategyproof mechanisms.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.