no code implementations • 30 Aug 2024 • Tan Zhi-Xuan, Micah Carroll, Matija Franklin, Hal Ashton
We first survey the limits of rational choice theory as a descriptive model, explaining how preferences fail to capture the thick semantic content of human values, and how utility representations neglect the possible incommensurability of those values.
no code implementations • 19 Jun 2023 • Matija Franklin, Rebecca Gorman, Hal Ashton, Stuart Armstrong
This article is a primer on concept extrapolation - the ability to take a concept, a feature, or a goal that is defined in one context and extrapolate it safely to a more general context.
no code implementations • 14 Sep 2022 • Hal Ashton, Matija Franklin
Iterative machine learning algorithms used to power recommender systems often change people's preferences by trying to learn them.
no code implementations • 20 Mar 2022 • Matija Franklin, Hal Ashton, Rebecca Gorman, Stuart Armstrong
We operationalize preference to incorporate concepts from various disciplines, outlining the importance of meta-preferences and preference-change preferences, and proposing a preliminary framework for how preferences change.
no code implementations • 8 Jun 2021 • Hal Ashton
Intent modifies an actor's culpability of many types wrongdoing.
no code implementations • 7 Jun 2021 • Hal Ashton
One approach to defining Intention is to use the counterfactual tools developed to define Causality.
1 code implementation • 2 Nov 2020 • Hal Ashton
Campbell-Goodhart's law relates to the causal inference error whereby decision-making agents aim to influence variables which are correlated to their goal objective but do not reliably cause it.