Search Results for author: Joni Pajarinen

Found 39 papers, 15 papers with code

Function-space Parameterization of Neural Networks for Sequential Learning

1 code implementation16 Mar 2024 Aidan Scannell, Riccardo Mereu, Paul Chang, Ella Tamir, Joni Pajarinen, Arno Solin

Our parameterization offers: (i) a way to scale function-space methods to large data sets via sparsification, (ii) retention of prior knowledge when access to past data is limited, and (iii) a mechanism to incorporate new data without retraining.

Continual Learning Gaussian Processes +1

AgentMixer: Multi-Agent Correlated Policy Factorization

no code implementations16 Jan 2024 Zhiyuan Li, Wenshuai Zhao, Lijun Wu, Joni Pajarinen

Inspired by the concept of correlated equilibrium, we propose to introduce a \textit{strategy modification} to provide a mechanism for agents to correlate their policies.

Imitation Learning Multi-agent Reinforcement Learning

Optimistic Multi-Agent Policy Gradient for Cooperative Tasks

1 code implementation3 Nov 2023 Wenshuai Zhao, Yi Zhao, Zhiyuan Li, Juho Kannala, Joni Pajarinen

However, with function approximation optimism can amplify overestimation and thus fail on complex tasks.

Q-Learning

On the Benefit of Optimal Transport for Curriculum Reinforcement Learning

no code implementations25 Sep 2023 Pascal Klink, Carlo D'Eramo, Jan Peters, Joni Pajarinen

In this work, we focus on framing curricula as interpolations between task distributions, which has previously been shown to be a viable approach to CRL.

reinforcement-learning

Monte-Carlo tree search with uncertainty propagation via optimal transport

no code implementations19 Sep 2023 Tuan Dam, Pascal Stenger, Lukas Schneider, Joni Pajarinen, Carlo D'Eramo, Odalric-Ambrym Maillard

We introduce a novel backup operator that computes value nodes as the Wasserstein barycenter of their action-value children nodes; thus, propagating the uncertainty of the estimate across the tree to the root node.

Thompson Sampling

Sparse Function-space Representation of Neural Networks

2 code implementations5 Sep 2023 Aidan Scannell, Riccardo Mereu, Paul Chang, Ella Tamir, Joni Pajarinen, Arno Solin

Deep neural networks (NNs) are known to lack uncertainty estimates and struggle to incorporate new data.

Towards Energy Efficient Control for Commercial Heavy-Duty Mobile Cranes: Modeling Hydraulic Pressures using Machine Learning

no code implementations31 Jul 2023 Abdolreza Taheri, Robert Pettersson, Pelle Gustafsson, Joni Pajarinen, Reza Ghabcheloo

A sizable part of the fleet of heavy-duty machinery in the construction equipment industry uses the conventional valve-controlled load-sensing hydraulics.

Simplified Temporal Consistency Reinforcement Learning

1 code implementation15 Jun 2023 Yi Zhao, Wenshuai Zhao, Rinu Boney, Juho Kannala, Joni Pajarinen

This applies when using pure planning with a dynamics model conditioned on the representation, but, also when utilizing the representation as policy and value function features in model-free RL.

Decision Making reinforcement-learning +2

Swapped goal-conditioned offline reinforcement learning

1 code implementation17 Feb 2023 Wenyan Yang, Huiling Wang, Dingding Cai, Joni Pajarinen, Joni-Kristen Kämäräinen

Offline goal-conditioned reinforcement learning (GCRL) can be challenging due to overfitting to the given dataset.

Offline RL reinforcement-learning +1

Prioritized offline Goal-swapping Experience Replay

no code implementations15 Feb 2023 Wenyan Yang, Joni Pajarinen, Dinging Cai, Joni Kämäräinen

In goal-conditioned offline reinforcement learning, an agent learns from previously collected data to go to an arbitrary goal.

reinforcement-learning Reinforcement Learning (RL)

Hierarchical Imitation Learning with Vector Quantized Models

1 code implementation30 Jan 2023 Kalle Kujanpää, Joni Pajarinen, Alexander Ilin

The ability to plan actions on multiple levels of abstraction enables intelligent agents to solve complex tasks effectively.

Imitation Learning

Redeeming Intrinsic Rewards via Constrained Optimization

1 code implementation14 Nov 2022 Eric Chen, Zhang-Wei Hong, Joni Pajarinen, Pulkit Agrawal

However, on easy exploration tasks, the agent gets distracted by intrinsic rewards and performs unnecessary exploration even when sufficient task (also called extrinsic) reward is available.

Montezuma's Revenge Reinforcement Learning (RL)

Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning

2 code implementations25 Oct 2022 Yi Zhao, Rinu Boney, Alexander Ilin, Juho Kannala, Joni Pajarinen

Offline reinforcement learning, by learning from a fixed dataset, makes it possible to learn agent behaviors without interacting with the environment.

D4RL Offline RL +2

Continuous Monte Carlo Graph Search

1 code implementation4 Oct 2022 Kalle Kujanpää, Amin Babadi, Yi Zhao, Juho Kannala, Alexander Ilin, Joni Pajarinen

To address this problem, we propose Continuous Monte Carlo Graph Search (CMCGS), an extension of MCTS to online planning in environments with continuous state and action spaces.

Continuous Control Decision Making

Partially Observable Markov Decision Processes in Robotics: A Survey

no code implementations21 Sep 2022 Mikko Lauri, David Hsu, Joni Pajarinen

Noisy sensing, imperfect control, and environment changes are defining characteristics of many real-world robot tasks.

Autonomous Driving

Learning Progress Driven Multi-Agent Curriculum

no code implementations20 May 2022 Wenshuai Zhao, Zhiyuan Li, Joni Pajarinen

Inspired by the success of CRL in single-agent settings, a few works have attempted to apply CRL to multi-agent reinforcement learning (MARL) using the number of agents to control task difficulty.

Multi-agent Reinforcement Learning Open-Ended Question Answering +3

Topological Experience Replay

1 code implementation ICLR 2022 Zhang-Wei Hong, Tao Chen, Yen-Chen Lin, Joni Pajarinen, Pulkit Agrawal

State-of-the-art deep Q-learning methods update Q-values using state transition tuples sampled from the experience replay buffer.

Q-Learning

GPU-Accelerated Policy Optimization via Batch Automatic Differentiation of Gaussian Processes for Real-World Control

no code implementations28 Feb 2022 Abdolreza Taheri, Joni Pajarinen, Reza Ghabcheloo

The ability of Gaussian processes (GPs) to predict the behavior of dynamical systems as a more sample-efficient alternative to parametric models seems promising for real-world robotics research.

Gaussian Processes

A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search

no code implementations11 Feb 2022 Tuan Dam, Carlo D'Eramo, Jan Peters, Joni Pajarinen

In this work, we propose two methods for improving the convergence rate and exploration based on a newly introduced backup operator and entropy regularization.

Atari Games Decision Making +2

State-Conditioned Adversarial Subgoal Generation

no code implementations24 Jan 2022 Vivienne Huiling Wang, Joni Pajarinen, Tinghuai Wang, Joni-Kristian Kämäräinen

Hierarchical reinforcement learning (HRL) proposes to solve difficult tasks by performing decision-making and control at successively higher levels of temporal abstraction.

Continuous Control Decision Making +3

Metrics Matter: A Closer Look on Self-Paced Reinforcement Learning

no code implementations29 Sep 2021 Pascal Klink, Haoyi Yang, Jan Peters, Joni Pajarinen

Experiments demonstrate that the resulting introduction of metric structure into the curriculum allows for a well-behaving non-parametric version of SPRL that leads to stable learning performance across tasks.

reinforcement-learning Reinforcement Learning (RL)

Boosted Curriculum Reinforcement Learning

no code implementations ICLR 2022 Pascal Klink, Carlo D'Eramo, Jan Peters, Joni Pajarinen

This approach, which we refer to as boosted curriculum reinforcement learning (BCRL), has the benefit of naturally increasing the representativeness of the functional space by adding a new residual each time a new task is presented.

reinforcement-learning Reinforcement Learning (RL)

Reinforcement Learning using Guided Observability

no code implementations22 Apr 2021 Stephan Weigand, Pascal Klink, Jan Peters, Joni Pajarinen

Due to recent breakthroughs, reinforcement learning (RL) has demonstrated impressive performance in challenging sequential decision-making problems.

Decision Making OpenAI Gym +3

Neural Network Controller for Autonomous Pile Loading Revised

no code implementations23 Mar 2021 Wenyan Yang, Nataliya Strokina, Nikolay Serbenyuk, Joni Pajarinen, Reza Ghabcheloo, Juho Vihonen, Mohammad M. Aref, Joni-Kristian Kämäräinen

We have recently proposed two pile loading controllers that learn from human demonstrations: a neural network (NNet) [1] and a random forest (RF) controller [2].

A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning

1 code implementation25 Feb 2021 Pascal Klink, Hany Abdulsamad, Boris Belousov, Carlo D'Eramo, Jan Peters, Joni Pajarinen

Across machine learning, the use of curricula has shown strong empirical potential to improve learning from data by avoiding local optima of training objectives.

reinforcement-learning Reinforcement Learning (RL)

Technical Report: The Policy Graph Improvement Algorithm

no code implementations4 Sep 2020 Joni Pajarinen

The policy graph improvement (PGI) algorithm for POMDPs represents the policy as a fixed size policy graph and improves the policy monotonically.

Multi-Sensor Next-Best-View Planning as Matroid-Constrained Submodular Maximization

no code implementations4 Jul 2020 Mikko Lauri, Joni Pajarinen, Jan Peters, Simone Frintrop

We consider the problem of creating a 3D model using depth images captured by a team of multiple robots.

Convex Regularization in Monte-Carlo Tree Search

no code implementations1 Jul 2020 Tuan Dam, Carlo D'Eramo, Jan Peters, Joni Pajarinen

Monte-Carlo planning and Reinforcement Learning (RL) are essential to sequential decision making.

Atari Games Decision Making +1

Machine Learning Based Mobile Network Throughput Classification

no code implementations27 Apr 2020 Lauri Alho, Adrian Burian, Janne Helenius, Joni Pajarinen

Identifying mobile network problems in 4G cells is more challenging when the complexity of the network increases, and privacy concerns limit the information content of the data.

BIG-bench Machine Learning Classification +2

Self-Paced Deep Reinforcement Learning

1 code implementation NeurIPS 2020 Pascal Klink, Carlo D'Eramo, Jan Peters, Joni Pajarinen

Curriculum reinforcement learning (CRL) improves the learning speed and stability of an agent by exposing it to a tailored series of tasks throughout learning.

Open-Ended Question Answering reinforcement-learning +1

Deep Adversarial Reinforcement Learning for Object Disentangling

no code implementations8 Mar 2020 Melvin Laux, Oleg Arenz, Jan Peters, Joni Pajarinen

The ARL framework utilizes an adversary, which is trained to steer the original agent, the protagonist, to challenging states.

Object reinforcement-learning +1

Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning

1 code implementation1 Jan 2020 Simone Parisi, Davide Tateo, Maximilian Hensel, Carlo D'Eramo, Jan Peters, Joni Pajarinen

Empirical results on classic and novel benchmarks show that the proposed approach outperforms existing methods in environments with sparse rewards, especially in the presence of rewards that create suboptimal modes of the objective function.

Benchmarking reinforcement-learning +1

Generalized Mean Estimation in Monte-Carlo Tree Search

no code implementations1 Nov 2019 Tuan Dam, Pascal Klink, Carlo D'Eramo, Jan Peters, Joni Pajarinen

Finally, we empirically demonstrate the effectiveness of our method in well-known MDP and POMDP benchmarks, showing significant improvement in performance and convergence speed w. r. t.

Model-based Lookahead Reinforcement Learning

no code implementations15 Aug 2019 Zhang-Wei Hong, Joni Pajarinen, Jan Peters

Model-based Reinforcement Learning (MBRL) allows data-efficient learning which is required in real world applications such as robotics.

Continuous Control Model-based Reinforcement Learning +3

Information Gathering in Decentralized POMDPs by Policy Graph Improvement

1 code implementation26 Feb 2019 Mikko Lauri, Joni Pajarinen, Jan Peters

Decentralized policies for information gathering are required when multiple autonomous agents are deployed to collect data about a phenomenon of interest without the ability to communicate.

Decision Making

An Algorithmic Perspective on Imitation Learning

no code implementations16 Nov 2018 Takayuki Osa, Joni Pajarinen, Gerhard Neumann, J. Andrew Bagnell, Pieter Abbeel, Jan Peters

This process of learning from demonstrations, and the study of algorithms to do so, is called imitation learning.

Imitation Learning Learning Theory

Cannot find the paper you are looking for? You can Submit a new open access paper.