no code implementations • 9 Feb 2024 • Jasmina Gajcin, Ivana Dusparic
In this work, we propose ACTER (Actionable Counterfactual Sequences for Explaining Reinforcement Learning Outcomes), an algorithm for generating counterfactual sequences that provides actionable advice on how failure can be avoided.
1 code implementation • 30 Aug 2023 • Jasmina Gajcin, James McCarthy, Rahul Nair, Radu Marinescu, Elizabeth Daly, Ivana Dusparic
Our approach allows the user to provide trajectory-level feedback on agent's behavior during training, which can be integrated as a reward shaping signal in the following training iteration.
no code implementations • 14 Jun 2023 • Babatunji Omoniwa, Boris Galkin, Ivana Dusparic
Unmanned aerial vehicles (UAVs) serving as aerial base stations can be deployed to provide wireless connectivity to mobile users, such as vehicles.
no code implementations • 15 Mar 2023 • Jean-Baptiste Monteil, George Iosifidis, Ivana Dusparic
The different service providers (SPs) have the opportunity to lease the network resources from the NO to constitute slices that address the demand of their specific network service.
1 code implementation • 8 Mar 2023 • Jasmina Gajcin, Ivana Dusparic
In this work, we propose RACCER, the first RL-specific approach to generating counterfactual explanations for the behavior of RL agents.
no code implementations • 2 Mar 2023 • Alberto Castagna, Ivana Dusparic
As an alternative, in this paper we propose Expert-Free Online Transfer Learning (EF-OnTL), an algorithm that enables expert-free real-time dynamic transfer learning in multi-agent system.
Multi-agent Reinforcement Learning reinforcement-learning +2
1 code implementation • 9 Nov 2022 • Jernej Hribar, Luke Hackett, Ivana Dusparic
In this paper, we build on advances introduced by the Deep Q-Networks (DQN) approach to extend the multi-objective tabular Reinforcement Learning (RL) algorithm W-learning to large state spaces.
2 code implementations • 2 Nov 2022 • Tom He, Jasmina Gajcin, Ivana Dusparic
We apply CausalCF to complex robotic tasks and show that it improves the RL agent's robustness using CausalWorld.
no code implementations • 21 Oct 2022 • Jasmina Gajcin, Ivana Dusparic
Additionally, we explore the differences between counterfactual explanations in supervised learning and RL and identify the main challenges that prevent the adoption of methods from supervised in reinforcement learning.
no code implementations • 18 Jul 2022 • James McCarthy, Rahul Nair, Elizabeth Daly, Radu Marinescu, Ivana Dusparic
Explainability of Reinforcement Learning (RL) policies remains a challenging research problem, particularly when considering RL in a safety context.
no code implementations • 23 May 2022 • Helio N. Cunha Neto, Ivana Dusparic, Diogo M. F. Mattos, Natalia C. Fernandes
This paper proposes the Federated Simulated Annealing (FedSA) metaheuristic to select the hyperparameters and a subset of participants for each aggregation round in federated learning.
no code implementations • 4 Apr 2022 • Babatunji Omoniwa, Boris Galkin, Ivana Dusparic
In this letter, we study the energy efficiency (EE) optimisation of unmanned aerial vehicles (UAVs) providing wireless coverage to static and mobile ground users.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 21 Mar 2022 • Jasmina Gajcin, Ivana Dusparic
We propose ReCCoVER, an algorithm which detects causal confusion in agent's reasoning before deployment, by executing its policy in alternative environments where certain correlations between features do not hold.
no code implementations • 17 Dec 2021 • Jasmina Gajcin, Rahul Nair, Tejaswini Pedapati, Radu Marinescu, Elizabeth Daly, Ivana Dusparic
In complex tasks where the reward function is not straightforward and consists of a set of objectives, multiple reinforcement learning (RL) policies that perform task adequately, but employ different strategies can be trained by adjusting the impact of individual objectives on reward function.
no code implementations • 1 Dec 2021 • Alberto Castagna, Ivana Dusparic
Reinforcement learning (RL) has been used in a range of simulated real-world tasks, e. g., sensor coordination, traffic light control, and on-demand mobility services.
no code implementations • 3 Nov 2021 • Boris Galkin, Babatunji Omoniwa, Ivana Dusparic
In this paper, we propose a multi-agent deep reinforcement learning approach to optimise the energy efficiency of fixed-wing UAV cellular access points while still allowing them to deliver high-quality service to users on the ground.
no code implementations • 1 Jun 2021 • Babatunji Omoniwa, Maxime Gueriau, Ivana Dusparic
As such, power-control mechanisms and intelligent mobility of the relay devices are critical in improving communication performance and energy utilization.
no code implementations • 1 Jun 2021 • Babatunji Omoniwa, Boris Galkin, Ivana Dusparic
Unmanned aerial vehicles serving as aerial base stations (UAV-BSs) can be deployed to provide wireless connectivity to ground devices in events of increased network demand, points-of-failure in existing infrastructure, or disasters.
no code implementations • 11 Mar 2021 • Nicolás Cardozo, Ivana Dusparic
Options are explored in interaction with the environment, and the most suitable options for each context are used to generate adaptations exploiting COP.
no code implementations • 11 Mar 2021 • Ivana Dusparic, Nicolas Cardozo
Capturing all possible changes to define suitable behaviour beforehand is unfeasible, or even impossible in the case of unknown changes, hence human intervention may be required.
no code implementations • 25 Feb 2021 • Erika Fonseca, Boris Galkin, Marvin Kelly, Luiz A. DaSilva, Ivana Dusparic
Unmanned Aerial Vehicle (UAV) technology is becoming more prevalent and more diverse in its application.
Networking and Internet Architecture
no code implementations • 6 Nov 2020 • Boris Galkin, Erika Fonseca, Gavin Lee, Conor Duff, Marvin Kelly, Edward Emmanuel, Ivana Dusparic
Our results show that increasing the UAV height reduces the performance in both tiers, due to issues such as antenna misalignment.
Networking and Internet Architecture
no code implementations • 27 Jul 2020 • Erika Fonseca, Boris Galkin, Ramy Amer, Luiz A. DaSilva, Ivana Dusparic
On the other hand, BS density can negatively impact UAV QoS, with higher numbers of BSs generating more interference and deteriorating UAV performance.
no code implementations • 8 Oct 2018 • Amit Prasad, Ivana Dusparic
This paper addresses the problem of energy sharing in such a community.
no code implementations • 16 Sep 2014 • Andrei Marinescu, Ivana Dusparic, Adam Taylor, Vinny Cahill, Siobhán Clarke
In this paper we propose P-MARL, a decentralised MARL algorithm enhanced by a prediction mechanism that provides accurate information regarding up-coming changes in the environment.
Multiagent Systems