Search Results for author: Christian Gumbsch

Found 7 papers, 2 papers with code

Intelligent problem-solving as integrated hierarchical reinforcement learning

no code implementations18 Aug 2022 Manfred Eppe, Christian Gumbsch, Matthias Kerzel, Phuong D. H. Nguyen, Martin V. Butz, Stefan Wermter

According to cognitive psychology and related disciplines, the development of complex problem-solving behaviour in biological agents depends on hierarchical cognitive mechanisms.

Hierarchical Reinforcement Learning reinforcement-learning +1

Inference of Affordances and Active Motor Control in Simulated Agents

no code implementations23 Feb 2022 Fedor Scholz, Christian Gumbsch, Sebastian Otte, Martin V. Butz

We show that our architecture, which is trained end-to-end to minimize an approximation of free energy, develops latent states that can be interpreted as affordance maps.

Zero-shot Generalization

Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains

1 code implementation NeurIPS 2021 Christian Gumbsch, Martin V. Butz, Georg Martius

A common approach to prediction and planning in partially observable domains is to use recurrent neural networks (RNNs), which ideally develop and maintain a latent memory about hidden, task-relevant factors.

Inductive Bias

Hierarchical principles of embodied reinforcement learning: A review

no code implementations18 Dec 2020 Manfred Eppe, Christian Gumbsch, Matthias Kerzel, Phuong D. H. Nguyen, Martin V. Butz, Stefan Wermter

We then relate these insights with contemporary hierarchical reinforcement learning methods, and identify the key machine intelligence approaches that realise these mechanisms.

Hierarchical Reinforcement Learning reinforcement-learning +1

Autonomous Identification and Goal-Directed Invocation of Event-Predictive Behavioral Primitives

no code implementations26 Feb 2019 Christian Gumbsch, Martin V. Butz, Georg Martius

Here, we introduce a computational learning architecture, termed surprise-based behavioral modularization into event-predictive structures (SUBMODES), that explores behavior and identifies the underlying behavioral units completely from scratch.

Cannot find the paper you are looking for? You can Submit a new open access paper.