Search Results for author: Hal Ashton

Found 7 papers, 1 papers with code

Beyond Preferences in AI Alignment

no code implementations30 Aug 2024 Tan Zhi-Xuan, Micah Carroll, Matija Franklin, Hal Ashton

We first survey the limits of rational choice theory as a descriptive model, explaining how preferences fail to capture the thick semantic content of human values, and how utility representations neglect the possible incommensurability of those values.

Descriptive

Concept Extrapolation: A Conceptual Primer

no code implementations19 Jun 2023 Matija Franklin, Rebecca Gorman, Hal Ashton, Stuart Armstrong

This article is a primer on concept extrapolation - the ability to take a concept, a feature, or a goal that is defined in one context and extrapolate it safely to a more general context.

Solutions to preference manipulation in recommender systems require knowledge of meta-preferences

no code implementations14 Sep 2022 Hal Ashton, Matija Franklin

Iterative machine learning algorithms used to power recommender systems often change people's preferences by trying to learn them.

Recommendation Systems

Recognising the importance of preference change: A call for a coordinated multidisciplinary research effort in the age of AI

no code implementations20 Mar 2022 Matija Franklin, Hal Ashton, Rebecca Gorman, Stuart Armstrong

We operationalize preference to incorporate concepts from various disciplines, outlining the importance of meta-preferences and preference-change preferences, and proposing a preliminary framework for how preferences change.

Diversity Recommendation Systems

Definitions of intent suitable for algorithms

no code implementations8 Jun 2021 Hal Ashton

Intent modifies an actor's culpability of many types wrongdoing.

Extending counterfactual accounts of intent to include oblique intent

no code implementations7 Jun 2021 Hal Ashton

One approach to defining Intention is to use the counterfactual tools developed to define Causality.

counterfactual

Causal Campbell-Goodhart's law and Reinforcement Learning

1 code implementation2 Nov 2020 Hal Ashton

Campbell-Goodhart's law relates to the causal inference error whereby decision-making agents aim to influence variables which are correlated to their goal objective but do not reliably cause it.

Causal Inference Decision Making +3

Cannot find the paper you are looking for? You can Submit a new open access paper.