Search Results for author: Markus Anderljung

Found 16 papers, 1 papers with code

IDs for AI Systems

no code implementations17 Jun 2024 Alan Chan, Noam Kolt, Peter Wills, Usman Anwar, Christian Schroeder de Witt, Nitarshan Rajkumar, Lewis Hammond, David Krueger, Lennart Heim, Markus Anderljung

AI systems are increasingly pervasive, yet information needed to decide whether and how to engage with them may not exist or be accessible.

Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks

no code implementations17 May 2024 Lujain Ibrahim, Saffron Huang, Lama Ahmad, Markus Anderljung

In this paper, we discuss and operationalize a definition of an emerging category of evaluations -- "human interaction evaluations" (HIEs) -- which focus on the assessment of human-model interactions or the process and the outcomes of humans using models.

Societal Adaptation to Advanced AI

no code implementations16 May 2024 Jamie Bernardi, Gabriel Mukobi, Hilary Greaves, Lennart Heim, Markus Anderljung

Existing strategies for managing risks from advanced AI systems often focus on affecting what AI systems are developed and how they diffuse.

Visibility into AI Agents

no code implementations23 Jan 2024 Alan Chan, Carson Ezell, Max Kaufmann, Kevin Wei, Lewis Hammond, Herbie Bradley, Emma Bluemke, Nitarshan Rajkumar, David Krueger, Noam Kolt, Lennart Heim, Markus Anderljung

Increased delegation of commercial, scientific, governmental, and personal activities to AI agents -- systems capable of pursuing complex goals with limited supervision -- may exacerbate existing societal risks and introduce new risks.

Informativeness

Towards Publicly Accountable Frontier LLMs: Building an External Scrutiny Ecosystem under the ASPIRE Framework

no code implementations15 Nov 2023 Markus Anderljung, Everett Thornton Smith, Joe O'Brien, Lisa Soder, Benjamin Bucknall, Emma Bluemke, Jonas Schuett, Robert Trager, Lacey Strahm, Rumman Chowdhury

With the increasing integration of frontier large language models (LLMs) into society and the economy, decisions related to their training, deployment, and use have far-reaching implications.

Frontier AI Regulation: Managing Emerging Risks to Public Safety

no code implementations6 Jul 2023 Markus Anderljung, Joslyn Barnhart, Anton Korinek, Jade Leung, Cullen O'Keefe, Jess Whittlestone, Shahar Avin, Miles Brundage, Justin Bullock, Duncan Cass-Beggs, Ben Chang, Tantum Collins, Tim Fist, Gillian Hadfield, Alan Hayes, Lewis Ho, Sara Hooker, Eric Horvitz, Noam Kolt, Jonas Schuett, Yonadav Shavit, Divya Siddarth, Robert Trager, Kevin Wolf

To address these challenges, at least three building blocks for the regulation of frontier models are needed: (1) standard-setting processes to identify appropriate requirements for frontier AI developers, (2) registration and reporting requirements to provide regulators with visibility into frontier AI development processes, and (3) mechanisms to ensure compliance with safety standards for the development and deployment of frontier AI models.

Protecting Society from AI Misuse: When are Restrictions on Capabilities Warranted?

no code implementations16 Mar 2023 Markus Anderljung, Julian Hazell

We also contend that some restrictions on non-AI capabilities needed to cause harm will be required.

The Brussels Effect and Artificial Intelligence: How EU regulation will impact the global AI market

no code implementations23 Aug 2022 Charlotte Siegmann, Markus Anderljung

We consider both the possibility that the EU's AI regulation will incentivise changes in products offered in non-EU countries (a de facto Brussels Effect) and the possibility it will influence regulation adopted by other jurisdictions (a de jure Brussels Effect).

Institutionalising Ethics in AI through Broader Impact Requirements

no code implementations30 May 2021 Carina Prunkl, Carolyn Ashurst, Markus Anderljung, Helena Webb, Jan Leike, Allan Dafoe

In 2020, the Conference on Neural Information Processing Systems (NeurIPS) introduced a requirement for submitting authors to include a statement on the broader societal impacts of their research.

Ethics

Social and Governance Implications of Improved Data Efficiency

no code implementations14 Jan 2020 Aaron D. Tucker, Markus Anderljung, Allan Dafoe

Many researchers work on improving the data efficiency of machine learning.

Cannot find the paper you are looking for? You can Submit a new open access paper.