7 code implementations • 4 Mar 2022 • Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe
In this paper, we show an avenue for aligning language models with user intent on a wide range of tasks by fine-tuning with human feedback.
1 code implementation • 3 Dec 2019 • Carroll L. Wainwright, Peter Eckersley
We present SafeLife, a publicly available reinforcement learning environment that tests the safety of reinforcement learning agents.
2 code implementations • 19 Sep 2011 • Carroll L. Wainwright
I present a numerical package (CosmoTransitions) for analyzing finite-temperature cosmological phase transitions driven by single or multiple scalar fields.
High Energy Physics - Phenomenology Cosmology and Nongalactic Astrophysics