Search Results for author: Anka Reuel

Found 3 papers, 1 papers with code

Escalation Risks from Language Models in Military and Diplomatic Decision-Making

1 code implementation7 Jan 2024 Juan-Pablo Rivera, Gabriel Mukobi, Anka Reuel, Max Lamparth, Chandler Smith, Jacquelyn Schneider

Governments are increasingly considering integrating autonomous AI agents in high-stakes military and foreign-policy decision-making, especially with the emergence of advanced generative AI models like GPT-4.

Decision Making Language Modelling

International Governance of Civilian AI: A Jurisdictional Certification Approach

no code implementations29 Aug 2023 Robert Trager, Ben Harack, Anka Reuel, Allison Carnegie, Lennart Heim, Lewis Ho, Sarah Kreps, Ranjit Lall, Owen Larter, Seán Ó hÉigeartaigh, Simon Staffell, José Jaime Villalobos

As international actors reach consensus on risks of and minimum standards for advanced AI, a jurisdictional certification regime could mitigate a broad range of potential harms, including threats to public safety.

Analyzing And Editing Inner Mechanisms Of Backdoored Language Models

no code implementations24 Feb 2023 Max Lamparth, Anka Reuel

Poisoning of data sets is a potential security threat to large language models that can lead to backdoored models.

Cannot find the paper you are looking for? You can Submit a new open access paper.