Search Results for author: Adam Bales

Found 2 papers, 0 papers with code

Artificial Intelligence: Arguments for Catastrophic Risk

no code implementations • 27 Jan 2024 • Adam Bales, William D'Alessandro, Cameron Domenico Kirk-Giannini

The first argument -- the Problem of Power-Seeking -- claims that, under certain assumptions, advanced AI systems are likely to engage in dangerous power-seeking behavior in pursuit of their goals.

Paper
Add Code

Truthful AI: Developing and governing AI that does not lie

no code implementations • 13 Oct 2021 • Owain Evans, Owen Cotton-Barratt, Lukas Finnveden, Adam Bales, Avital Balwit, Peter Wills, Luca Righetti, William Saunders

Establishing norms or laws of AI truthfulness will require significant work to: (1) identify clear truthfulness standards; (2) create institutions that can judge adherence to those standards; and (3) develop AI systems that are robustly truthful.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.