1 code implementation • 12 Apr 2022 • Yuntao Bai, Andy Jones, Kamal Ndousse, Amanda Askell, Anna Chen, Nova DasSarma, Dawn Drain, Stanislav Fort, Deep Ganguli, Tom Henighan, Nicholas Joseph, Saurav Kadavath, Jackson Kernion, Tom Conerly, Sheer El-Showk, Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez, Tristan Hume, Scott Johnston, Shauna Kravec, Liane Lovitt, Neel Nanda, Catherine Olsson, Dario Amodei, Tom Brown, Jack Clark, Sam McCandlish, Chris Olah, Ben Mann, Jared Kaplan
We apply preference modeling and reinforcement learning from human feedback (RLHF) to finetune language models to act as helpful and harmless assistants.
no code implementations • 13 Dec 2021 • Amin Shokri Gazafroudi, Elisabeth Zeyen, Martha Frysztacki, Fabian Neumann, Tom Brown
Using corrective actions to overcome network loading when single lines fail has the potential to free up network capacity that is otherwise underused in preventive N-1 security strategies.
no code implementations • 1 Dec 2021 • Amanda Askell, Yuntao Bai, Anna Chen, Dawn Drain, Deep Ganguli, Tom Henighan, Andy Jones, Nicholas Joseph, Ben Mann, Nova DasSarma, Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez, Jackson Kernion, Kamal Ndousse, Catherine Olsson, Dario Amodei, Tom Brown, Jack Clark, Sam McCandlish, Chris Olah, Jared Kaplan
We find that the benefits from modest interventions increase with model size, generalize to a variety of alignment evaluations, and do not compromise the performance of large models.
no code implementations • 22 Jan 2021 • Martha Maria Frysztacki, Jonas Hörsch, Veit Hagenmeyer, Tom Brown
If we focus on the effect of renewable resource resolution and ignore network restrictions, we find that a higher resolution allows the optimal solution to concentrate wind and solar capacity at sites with better capacity factors and thus reduces system costs by up to 10% compared to a low resolution model.
Physics and Society Computation
1 code implementation • 14 Dec 2020 • Nicholas Carlini, Florian Tramer, Eric Wallace, Matthew Jagielski, Ariel Herbert-Voss, Katherine Lee, Adam Roberts, Tom Brown, Dawn Song, Ulfar Erlingsson, Alina Oprea, Colin Raffel
We demonstrate our attack on GPT-2, a language model trained on scrapes of the public Internet, and are able to extract hundreds of verbatim text sequences from the model's training data.
no code implementations • 3 Dec 2020 • Elisabeth Zeyen, Veit Hagenmeyer, Tom Brown
Space and water heating accounts for about 40% of final energy consumption in the European Union and thus plays a key role in reducing overall costs and greenhouse gas emissions.
Physics and Society
1 code implementation • 4 Oct 2019 • Fabian Neumann, Tom Brown
Models for long-term investment planning of the power system typically return a single optimal solution per set of cost assumptions.
Physics and Society Systems and Control Systems and Control
2 code implementations • 21 Aug 2019 • Daniel Kang, Yi Sun, Dan Hendrycks, Tom Brown, Jacob Steinhardt
Adversaries adapt and evolve their attacks; hence adversarial defenses must be robust to a broad range of unforeseen attacks.
no code implementations • 3 May 2019 • Daniel Kang, Yi Sun, Tom Brown, Dan Hendrycks, Jacob Steinhardt
We study the transfer of adversarial robustness of deep neural networks between different perturbation types.
no code implementations • 14 Aug 2018 • Catherine Olsson, Surya Bhupatiraju, Tom Brown, Augustus Odena, Ian Goodfellow
We explore a new way to evaluate generative models using insights from evaluation of competitive games between human players.
4 code implementations • 5 Jun 2018 • Jonas Hörsch, Fabian Hofmann, David Schlachtberger, Tom Brown
PyPSA-Eur, the first open model dataset of the European power system at the transmission network level to cover the full ENTSO-E area, is presented.
Physics and Society
2 code implementations • 31 Jul 2017 • Tom Brown, Jonas Hörsch, David Schlachtberger
In this paper the basic functionality of PyPSA is described, including the formulation of the full power flow equations and the multi-period optimisation of operation and investment with linear power flow equations.
Physics and Society
2 code implementations • 22 May 2017 • Jonas Hörsch, Tom Brown
The effects of the spatial scale on the results of the optimisation of transmission and generation capacity in Europe are quantified under a 95% CO2 reduction compared to 1990 levels, interpolating between one-node-per-country solutions and many-nodes-per-country.
Physics and Society Applied Physics
14 code implementations • 3 Oct 2016 • Nicolas Papernot, Fartash Faghri, Nicholas Carlini, Ian Goodfellow, Reuben Feinman, Alexey Kurakin, Cihang Xie, Yash Sharma, Tom Brown, Aurko Roy, Alexander Matyasko, Vahid Behzadan, Karen Hambardzumyan, Zhishuai Zhang, Yi-Lin Juang, Zhi Li, Ryan Sheatsley, Abhibhav Garg, Jonathan Uesato, Willi Gierke, Yinpeng Dong, David Berthelot, Paul Hendricks, Jonas Rauber, Rujun Long, Patrick McDaniel
An adversarial example library for constructing attacks, building defenses, and benchmarking both