Search Results for author: Safa Alver

Found 7 papers, 2 papers with code

Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning

1 code implementation30 Sep 2023 Mingde Zhao, Safa Alver, Harm van Seijen, Romain Laroche, Doina Precup, Yoshua Bengio

Inspired by human conscious planning, we propose Skipper, a model-based reinforcement learning framework utilizing spatio-temporal abstractions to generalize better in novel situations.

Decision Making Model-based Reinforcement Learning +2

Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning

no code implementations24 Jan 2023 Safa Alver, Doina Precup

Learning models of the environment from pure interaction is often considered an essential component of building lifelong reinforcement learning agents.

Model-based Reinforcement Learning reinforcement-learning +1

Understanding Decision-Time vs. Background Planning in Model-Based Reinforcement Learning

no code implementations16 Jun 2022 Safa Alver, Doina Precup

After viewing them through the lens of dynamic programming, we first consider the classical instantiations of these planning styles and provide theoretical results and hypotheses on which one will perform better in the pure planning, planning & learning, and transfer learning settings.

Model-based Reinforcement Learning reinforcement-learning +2

Constructing a Good Behavior Basis for Transfer using Generalized Policy Updates

no code implementations ICLR 2022 Safa Alver, Doina Precup

We study the problem of learning a good set of policies, so that when combined together, they can solve a wide variety of unseen reinforcement learning tasks with no or very little new data.

reinforcement-learning Reinforcement Learning (RL)

What is Going on Inside Recurrent Meta Reinforcement Learning Agents?

no code implementations29 Apr 2021 Safa Alver, Doina Precup

Recurrent meta reinforcement learning (meta-RL) agents are agents that employ a recurrent neural network (RNN) for the purpose of "learning a learning algorithm".

Meta Reinforcement Learning reinforcement-learning +1

A Brief Look at Generalization in Visual Meta-Reinforcement Learning

no code implementations ICML Workshop LifelongML 2020 Safa Alver, Doina Precup

Due to the realization that deep reinforcement learning algorithms trained on high-dimensional tasks can strongly overfit to their training environments, there have been several studies that investigated the generalization performance of these algorithms.

Meta Reinforcement Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.