Search Results for author: Leo Ardon

Found 7 papers, 1 papers with code

On the Sample Efficiency of Abstractions and Potential-Based Reward Shaping in Reinforcement Learning

no code implementations11 Apr 2024 Giuseppe Canonaco, Leo Ardon, Alberto Pozanco, Daniel Borrajo

The use of Potential Based Reward Shaping (PBRS) has shown great promise in the ongoing research effort to tackle sample inefficiency in Reinforcement Learning (RL).

Reinforcement Learning (RL)

Learning Reward Machines in Cooperative Multi-Agent Tasks

no code implementations24 Mar 2023 Leo Ardon, Daniel Furelos-Blanco, Alessandra Russo

This paper presents a novel approach to Multi-Agent Reinforcement Learning (MARL) that combines cooperative task decomposition with the learning of reward machines (RMs) encoding the structure of the sub-tasks.

Multi-agent Reinforcement Learning

Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

no code implementations28 Nov 2022 Leo Ardon, Alberto Pozanco, Daniel Borrajo, Sumitra Ganesh

Knowing this information can help reduce the sample complexity of RL algorithms by masking the inapplicable actions from the policy distribution to only explore actions relevant to finding an optimal policy.

reinforcement-learning Reinforcement Learning (RL) +1

Phantom -- A RL-driven multi-agent framework to model complex systems

1 code implementation12 Oct 2022 Leo Ardon, Jared Vann, Deepeka Garg, Tom Spooner, Sumitra Ganesh

Agent based modelling (ABM) is a computational approach to modelling complex systems by specifying the behaviour of autonomous decision-making components or agents in the system and allowing the system dynamics to emerge from their interactions.

Decision Making Multi-agent Reinforcement Learning

Reinforcement Learning to Solve NP-hard Problems: an Application to the CVRP

no code implementations14 Jan 2022 Leo Ardon

In this paper, we evaluate the use of Reinforcement Learning (RL) to solve a classic combinatorial optimization problem: the Capacitated Vehicle Routing Problem (CVRP).

Combinatorial Optimization reinforcement-learning +2

Towards a fully RL-based Market Simulator

no code implementations13 Oct 2021 Leo Ardon, Nelson Vadori, Thomas Spooner, Mengda Xu, Jared Vann, Sumitra Ganesh

We present a new financial framework where two families of RL-based agents representing the Liquidity Providers and Liquidity Takers learn simultaneously to satisfy their objective.

Cannot find the paper you are looking for? You can Submit a new open access paper.