Search Results for author: Mike Preuss

Found 29 papers, 3 papers with code

Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents

1 code implementation • 29 Sep 2023 • Marco Pleines, Matthias Pallasch, Frank Zimmer, Mike Preuss

Memory Gym presents a suite of 2D partially observable environments, namely Mortar Mayhem, Mystery Path, and Searing Spotlights, designed to benchmark memory capabilities in decision-making agents.

Decision Making

Paper
Code

Believable Minecraft Settlements by Means of Decentralised Iterative Planning

no code implementations • 19 Sep 2023 • Arthur van der Staaij, Jelmer Prins, Vincent L. Prins, Julian Poelsma, Thera Smit, Matthias Müller-Brockhausen, Mike Preuss

Procedural city generation that focuses on believability and adaptability to random terrain is a difficult challenge in the field of Procedural Content Generation (PCG).

Paper
Add Code

Models Matter: The Impact of Single-Step Retrosynthesis on Synthesis Planning

no code implementations • 10 Aug 2023 • Paula Torren-Peraire, Alan Kai Hassen, Samuel Genheden, Jonas Verhoeven, Djork-Arne Clevert, Mike Preuss, Igor Tetko

Furthermore, we show that the commonly used single-step retrosynthesis benchmark dataset USPTO-50k is insufficient as this evaluation task does not represent model performance and scalability on larger and more diverse datasets.

Retrosynthesis Single-step retrosynthesis

Paper
Add Code

Two-Memory Reinforcement Learning

no code implementations • 20 Apr 2023 • Zhao Yang, Thomas. M. Moerland, Mike Preuss, Aske Plaat

While deep reinforcement learning has shown important empirical success, it tends to learn relatively slow due to slow propagation of rewards information and slow update of parametric neural networks.

reinforcement-learning Representation Learning +1

Paper
Add Code

Mind the Retrosynthesis Gap: Bridging the divide between Single-step and Multi-step Retrosynthesis Prediction

no code implementations • 12 Dec 2022 • Alan Kai Hassen, Paula Torren-Peraire, Samuel Genheden, Jonas Verhoeven, Mike Preuss, Igor Tetko

Retrosynthesis is the task of breaking down a chemical compound recursively step-by-step into molecular precursors until a set of commercially available molecules is found.

Benchmarking Multi-step retrosynthesis +3

Paper
Add Code

First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation

no code implementations • 6 Dec 2022 • Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat

In this paper, we present a clear ablation study of post-exploration in a general intrinsically motivated goal exploration process (IMGEP) framework, that the Go-Explore paper did not show.

Continuous Control Reinforcement Learning (RL)

Paper
Add Code

Continuous Episodic Control

no code implementations • 28 Nov 2022 • Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat

Therefore, this paper introduces Continuous Episodic Control (CEC), a novel non-parametric episodic memory algorithm for sequential decision making in problems with a continuous action space.

Continuous Control Decision Making +2

Paper
Add Code

Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization

no code implementations • 23 May 2022 • Marco Pleines, Matthias Pallasch, Frank Zimmer, Mike Preuss

At first sight it may seem straightforward to use recurrent layers in Deep Reinforcement Learning algorithms to enable agents to make use of memory in the setting of partially observable environments.

Benchmarking

Paper
Add Code

On the Verge of Solving Rocket League using Deep Reinforcement Learning and Sim-to-sim Transfer

no code implementations • 10 May 2022 • Marco Pleines, Konstantin Ramthun, Yannik Wegener, Hendrik Meyer, Matthias Pallasch, Sebastian Prior, Jannik Drögemüller, Leon Büttinghaus, Thilo Röthemeyer, Alexander Kaschwig, Oliver Chmurzynski, Frederik Rohkrähmer, Roman Kalkreuth, Frank Zimmer, Mike Preuss

Autonomously trained agents that are supposed to play video games reasonably well rely either on fast simulation speeds or heavy parallelization across thousands of machines running concurrently.

Reinforcement Learning (RL)

Paper
Add Code

When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation

no code implementations • 29 Mar 2022 • Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat

Go-Explore achieved breakthrough performance on challenging reinforcement learning (RL) tasks with sparse rewards.

Reinforcement Learning (RL)

Paper
Add Code

Reliable validation of Reinforcement Learning Benchmarks

no code implementations • 2 Mar 2022 • Matthias Müller-Brockhausen, Aske Plaat, Mike Preuss

Reinforcement Learning (RL) is one of the most dynamic research areas in Game AI and AI as a whole, and a wide variety of games are used as its prominent test problems.

Benchmarking Data Compression +3

Paper
Add Code

Potential-based Reward Shaping in Sokoban

no code implementations • 10 Sep 2021 • Zhao Yang, Mike Preuss, Aske Plaat

While previous work has investigated the use of expert knowledge to generate potential functions, in this work, we study whether we can use a search algorithm(A*) to automatically generate a potential function for reward shaping in Sokoban, a well-known planning task.

Paper
Add Code

High-Accuracy Model-Based Reinforcement Learning, a Survey

no code implementations • 17 Jul 2021 • Aske Plaat, Walter Kosters, Mike Preuss

Deep reinforcement learning has shown remarkable success in the past few years.

Decision Making Model-based Reinforcement Learning +4

Paper
Add Code

Procedural Content Generation: Better Benchmarks for Transfer Reinforcement Learning

no code implementations • 31 May 2021 • Matthias Müller-Brockhausen, Mike Preuss, Aske Plaat

We note a surprisingly late adoption of deep learning that starts in 2018.

Benchmarking reinforcement-learning +2

Paper
Add Code

Transfer Learning and Curriculum Learning in Sokoban

no code implementations • 25 May 2021 • Zhao Yang, Mike Preuss, Aske Plaat

In reinforcement learning, learning actions for a behavior policy that can be applied to new environments is still a challenge, especially for tasks that involve much planning.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Adaptive Warm-Start MCTS in AlphaZero-like Deep Reinforcement Learning

no code implementations • 13 May 2021 • Hui Wang, Mike Preuss, Aske Plaat

AlphaZero has achieved impressive performance in deep reinforcement learning by utilizing an architecture that combines search and training of a neural network in self-play.

Board Games reinforcement-learning +1

Paper
Add Code

An Analysis of Phenotypic Diversity in Multi-Solution Optimization

no code implementations • 10 May 2021 • Alexander Hagg, Mike Preuss, Alexander Asteroth, Thomas Bäck

More and more, optimization methods are used to find diverse solution sets.

Multiobjective Optimization

Paper
Add Code

Applications of Artificial Intelligence in Live Action Role-Playing Games (LARP)

no code implementations • 25 Aug 2020 • Christoph Salge, Emily Short, Mike Preuss, Spyridion Samothrakis, Pieter Spronck

Live Action Role-Playing (LARP) games and similar experiences are becoming a popular game genre.

Paper
Add Code

Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey

no code implementations • 11 Aug 2020 • Aske Plaat, Walter Kosters, Mike Preuss

In recent years, many model-based methods have been introduced to address this challenge.

Decision Making Model-based Reinforcement Learning +4

Paper
Add Code

Tackling Morpion Solitaire with AlphaZero-likeRanked Reward Reinforcement Learning

no code implementations • 14 Jun 2020 • Hui Wang, Mike Preuss, Michael Emmerich, Aske Plaat

A later algorithm, Nested Rollout Policy Adaptation, was able to find a new record of 82 steps, albeit with large computational resources.

Game of Go reinforcement-learning +3

Paper
Add Code

Versatile Black-Box Optimization

no code implementations • 29 Apr 2020 • Jialin Liu, Antoine Moreau, Mike Preuss, Baptiste Roziere, Jeremy Rapin, Fabien Teytaud, Olivier Teytaud

Choosing automatically the right algorithm using problem descriptors is a classical component of combinatorial optimization.

Combinatorial Optimization Evolutionary Algorithms

Paper
Add Code

Warm-Start AlphaZero Self-Play Search Enhancements

no code implementations • 26 Apr 2020 • Hui Wang, Mike Preuss, Aske Plaat

Recently, AlphaZero has achieved landmark results in deep reinforcement learning, by providing a single self-play architecture that learned three different games at super human level.

Board Games Evolutionary Algorithms

Paper
Add Code

A New Challenge: Approaching Tetris Link with AI

no code implementations • 1 Apr 2020 • Matthias Muller-Brockhausen, Mike Preuss, Aske Plaat

This paper focuses on a new game, Tetris Link, a board game that is still lacking any scientific analysis.

Paper
Add Code

Obstacle Tower Without Human Demonstrations: How Far a Deep Feed-Forward Network Goes with Reinforcement Learning

1 code implementation • 1 Apr 2020 • Marco Pleines, Jenia Jitsev, Mike Preuss, Frank Zimmer

The Obstacle Tower Challenge is the task to master a procedurally generated chain of levels that subsequently get harder to complete.

Paper
Code

Analysis of Hyper-Parameters for Small Games: Iterations or Epochs in Self-Play?

no code implementations • 12 Mar 2020 • Hui Wang, Michael Emmerich, Mike Preuss, Aske Plaat

A secondary result of our experiments concerns the choice of optimization goals, for which we also provide recommendations.

Paper
Add Code

From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the World of AI

no code implementations • 24 Feb 2020 • Sebastian Risi, Mike Preuss

This paper reviews the field of Game AI, which not only deals with creating agents that can play a certain game, but also with areas as diverse as creating game content automatically, game analytics, or player modelling.

Starcraft

Paper
Add Code

Hyper-Parameter Sweep on AlphaZero General

1 code implementation • 19 Mar 2019 • Hui Wang, Michael Emmerich, Mike Preuss, Aske Plaat

Therefore, in this paper, we choose 12 parameters in AlphaZero and evaluate how these parameters contribute to training.

Game of Go

230

Paper
Code

Learning to Plan Chemical Syntheses

no code implementations • 14 Aug 2017 • Marwin H. S. Segler, Mike Preuss, Mark P. Waller

We anticipate that our method will accelerate drug and materials discovery by assisting chemists to plan better syntheses faster, and by enabling fully automated robot synthesis.

Retrosynthesis

Paper
Add Code

The True Destination of EGO is Multi-local Optimization

no code implementations • 19 Apr 2017 • Simon Wessing, Mike Preuss

Efficient global optimization is a popular algorithm for the optimization of expensive multimodal black-box functions.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.