Search Results for author: Rinu Boney

Found 9 papers, 5 papers with code

Simplified Temporal Consistency Reinforcement Learning

1 code implementation15 Jun 2023 Yi Zhao, Wenshuai Zhao, Rinu Boney, Juho Kannala, Joni Pajarinen

This applies when using pure planning with a dynamics model conditioned on the representation, but, also when utilizing the representation as policy and value function features in model-free RL.

Decision Making reinforcement-learning +2

Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning

2 code implementations25 Oct 2022 Yi Zhao, Rinu Boney, Alexander Ilin, Juho Kannala, Joni Pajarinen

Offline reinforcement learning, by learning from a fixed dataset, makes it possible to learn agent behaviors without interacting with the environment.

D4RL Offline RL +2

Learning of feature points without additional supervision improves reinforcement learning from images

2 code implementations15 Jun 2021 Rinu Boney, Alexander Ilin, Juho Kannala

In many control problems that include vision, optimal controls can be inferred from the location of the objects in the scene.

Continuous Control reinforcement-learning +2

Learning to Play Imperfect-Information Games by Imitating an Oracle Planner

1 code implementation22 Dec 2020 Rinu Boney, Alexander Ilin, Juho Kannala, Jarno Seppänen

We experimentally show that planning with naive Monte Carlo tree search does not perform very well in large combinatorial action spaces.

Thompson Sampling

Regularizing Model-Based Planning with Energy-Based Models

no code implementations12 Oct 2019 Rinu Boney, Juho Kannala, Alexander Ilin

Model-based reinforcement learning could enable sample-efficient learning by quickly acquiring rich knowledge about the world and using it to improve behaviour without additional data.

Continuous Control Model-based Reinforcement Learning

Semi-Supervised and Active Few-Shot Learning with Prototypical Networks

no code implementations29 Nov 2017 Rinu Boney, Alexander Ilin

We consider the problem of semi-supervised few-shot classification where a classifier needs to adapt to new tasks using a few labeled examples and (potentially many) unlabeled examples.

Clustering Few-Shot Learning +1

Recurrent Ladder Networks

no code implementations NeurIPS 2017 Isabeau Prémont-Schwarz, Alexander Ilin, Tele Hotloo Hao, Antti Rasmus, Rinu Boney, Harri Valpola

We propose a recurrent extension of the Ladder networks whose structure is motivated by the inference required in hierarchical latent variable models.

Music Modeling

Cannot find the paper you are looking for? You can Submit a new open access paper.