Search Results for author: Harri Edwards

Found 6 papers, 5 papers with code

Prover-Verifier Games improve legibility of LLM outputs

1 code implementation18 Jul 2024 Jan Hendrik Kirchner, Yining Chen, Harri Edwards, Jan Leike, Nat McAleese, Yuri Burda

One way to increase confidence in the outputs of Large Language Models (LLMs) is to support them with reasoning that is clear and easy to check -- a property we call legibility.

Math

Let's Verify Step by Step

3 code implementations Preprint 2023 Hunter Lightman, Vineet Kosaraju, Yura Burda, Harri Edwards, Bowen Baker, Teddy Lee, Jan Leike, John Schulman, Ilya Sutskever, Karl Cobbe

We conduct our own investigation, finding that process supervision significantly outperforms outcome supervision for training models to solve problems from the challenging MATH dataset.

 Ranked #1 on Math Word Problem Solving on MATH minival (using extra training data)

Active Learning Math +2

AutoDIME: Automatic Design of Interesting Multi-Agent Environments

no code implementations4 Mar 2022 Ingmar Kanitscheider, Harri Edwards

One approach is to train a second RL agent, called a teacher, who samples environments that are conducive for the learning of student agents.

Value prediction

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

6 code implementations6 Jan 2022 Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin, Vedant Misra

In this paper we propose to study generalization of neural networks on small algorithmically generated datasets.

Memorization

Large-Scale Study of Curiosity-Driven Learning

5 code implementations ICLR 2019 Yuri Burda, Harri Edwards, Deepak Pathak, Amos Storkey, Trevor Darrell, Alexei A. Efros

However, annotating each environment with hand-designed, dense rewards is not scalable, motivating the need for developing reward functions that are intrinsic to the agent.

Atari Games Reinforcement Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.