Search Results for author: Carroll L. Wainwright

Found 3 papers, 3 papers with code

Training language models to follow instructions with human feedback

7 code implementations • 4 Mar 2022 • Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe

In this paper, we show an avenue for aligning language models with user intent on a wide range of tasks by fine-tuning with human feedback.

55,567

Paper
Code

SafeLife 1.0: Exploring Side Effects in Complex Environments

1 code implementation • 3 Dec 2019 • Carroll L. Wainwright, Peter Eckersley

We present SafeLife, a publicly available reinforcement learning environment that tests the safety of reinforcement learning agents.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

CosmoTransitions: Computing Cosmological Phase Transition Temperatures and Bubble Profiles with Multiple Fields

2 code implementations • 19 Sep 2011 • Carroll L. Wainwright

I present a numerical package (CosmoTransitions) for analyzing finite-temperature cosmological phase transitions driven by single or multiple scalar fields.

High Energy Physics - Phenomenology Cosmology and Nongalactic Astrophysics

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.