Search Results for author: Markus Kunesch

Found 8 papers, 0 papers with code

Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity

no code implementations • 29 May 2023 • Yiran Mao, Madeline G. Reinecke, Markus Kunesch, Edgar A. Duéñez-Guzmán, Ramona Comanescu, Julia Haas, Joel Z. Leibo

Is it possible to evaluate the moral cognition of complex artificial agents?

Meta Reinforcement Learning reinforcement-learning

Paper
Add Code

Beyond Bayes-optimality: meta-learning what you know you don't know

no code implementations • 30 Sep 2022 • Jordi Grau-Moya, Grégoire Delétang, Markus Kunesch, Tim Genewein, Elliot Catt, Kevin Li, Anian Ruoss, Chris Cundy, Joel Veness, Jane Wang, Marcus Hutter, Christopher Summerfield, Shane Legg, Pedro Ortega

This is in contrast to risk-sensitive agents, which additionally exploit the higher-order moments of the return, and ambiguity-sensitive agents, which act differently when recognizing situations in which they lack knowledge.

Decision Making Meta-Learning

Paper
Add Code

Your Policy Regularizer is Secretly an Adversary

no code implementations • 23 Mar 2022 • Rob Brekelmans, Tim Genewein, Jordi Grau-Moya, Grégoire Delétang, Markus Kunesch, Shane Legg, Pedro Ortega

Policy regularization methods such as maximum entropy regularization are widely used in reinforcement learning to improve the robustness of a learned policy.

Paper
Add Code

Model-Free Risk-Sensitive Reinforcement Learning

no code implementations • 4 Nov 2021 • Grégoire Delétang, Jordi Grau-Moya, Markus Kunesch, Tim Genewein, Rob Brekelmans, Shane Legg, Pedro A. Ortega

Since the Gaussian free energy is known to be a certainty-equivalent sensitive to the mean and the variance, the learning rule has applications in risk-sensitive decision-making.

Decision Making reinforcement-learning +1

Paper
Add Code

Shaking the foundations: delusions in sequence models for interaction and control

no code implementations • 20 Oct 2021 • Pedro A. Ortega, Markus Kunesch, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Joel Veness, Jonas Buchli, Jonas Degrave, Bilal Piot, Julien Perolat, Tom Everitt, Corentin Tallec, Emilio Parisotto, Tom Erez, Yutian Chen, Scott Reed, Marcus Hutter, Nando de Freitas, Shane Legg

The recent phenomenal success of language models has reinvigorated machine learning research, and large sequence models such as transformers are being applied to a variety of domains.

counterfactual

Paper
Add Code

Stochastic Approximation of Gaussian Free Energy for Risk-Sensitive Reinforcement Learning

no code implementations • NeurIPS 2021 • Grégoire Delétang, Jordi Grau-Moya, Markus Kunesch, Tim Genewein, Rob Brekelmans, Shane Legg, Pedro A Ortega

Since the Gaussian free energy is known to be a certainty-equivalent sensitive to the mean and the variance, the learning rule has applications in risk-sensitive decision-making.

Decision Making reinforcement-learning +1

Paper
Add Code

Causal Analysis of Agent Behavior for AI Safety

no code implementations • 5 Mar 2021 • Grégoire Déletang, Jordi Grau-Moya, Miljan Martic, Tim Genewein, Tom McGrath, Vladimir Mikulik, Markus Kunesch, Shane Legg, Pedro A. Ortega

As machine learning systems become more powerful they also become increasingly unpredictable and opaque.

BIG-bench Machine Learning

Paper
Add Code

Human-interpretable model explainability on high-dimensional data

no code implementations • 14 Oct 2020 • Damien de Mijolla, Christopher Frye, Markus Kunesch, John Mansir, Ilya Feige

The importance of explainability in machine learning continues to grow, as both neural-network architectures and the data they model become increasingly complex.

Image Classification Image-to-Image Translation +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.