Search Results for author: Charlotte Stix

Found 4 papers, 0 papers with code

Pre-Deployment Information Sharing: A Zoning Taxonomy for Precursory Capabilities

no code implementations18 Nov 2024 Matteo Pistillo, Charlotte Stix

(3) AISIs should establish adequate information protection infrastructure and guarantee increased information security as precursory capabilities move through the zones and towards red lines, including, if necessary, by classifying the information on precursory capabilities or marking it as controlled.

Towards evaluations-based safety cases for AI scheming

no code implementations29 Oct 2024 Mikita Balesni, Marius Hobbhahn, David Lindner, Alexander Meinke, Tomek Korbak, Joshua Clymer, Buck Shlegeris, Jérémy Scheurer, Charlotte Stix, Rusheb Shah, Nicholas Goldowsky-Dill, Dan Braun, Bilal Chughtai, Owain Evans, Daniel Kokotajlo, Lucius Bushnaq

We sketch how developers of frontier AI systems could construct a structured rationale -- a 'safety case' -- that an AI system is unlikely to cause catastrophic outcomes through scheming.

Actionable Principles for Artificial Intelligence Policy: Three Pathways

no code implementations24 Feb 2021 Charlotte Stix

Subsequently, these elements are expanded on and evaluated in light of their ability to contribute to a prototype framework for the development of Actionable Principles for AI.

Computers and Society

Cannot find the paper you are looking for? You can Submit a new open access paper.