Search Results for author: Alessandro Abate

Found 58 papers, 21 papers with code

Data-driven Interval MDP for Robust Control Synthesis

no code implementations • 12 Apr 2024 • Rudi Coppola, Andrea Peruffo, Licio Romao, Alessandro Abate, Manuel Mazo Jr

The abstraction of dynamical systems is a powerful tool that enables the design of feedback controllers using a correct-by-design framework.

Paper
Add Code

A Stability-Based Abstraction Framework for Reach-Avoid Control of Stochastic Dynamical Systems with Unknown Noise Distributions

no code implementations • 2 Apr 2024 • Thom Badings, Licio Romao, Alessandro Abate, Nils Jansen

To address this issue, we propose a novel abstraction scheme for stochastic linear systems that exploits the system's stability to obtain significantly smaller abstract models.

Paper
Add Code

Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification

no code implementations • 11 Mar 2024 • Joar Skalse, Alessandro Abate

In addition to this, we also characterise the conditions under which a behavioural model is robust to small perturbations of the observed policy, and we analyse how robust many behavioural models are to misspecification of their parameter values (such as e. g.\ the discount rate).

reinforcement-learning

Paper
Add Code

Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

1 code implementation • 29 Jan 2024 • Alexandros E. Tzikas, Licio Romao, Mert Pilanci, Alessandro Abate, Mykel J. Kochenderfer

Many machine learning applications require operating on a spatially distributed dataset.

Bayesian Inference Distributed Optimization +1

Paper
Code

On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks

no code implementations • 26 Jan 2024 • Joar Skalse, Alessandro Abate

Moreover, we find that scalar, Markovian rewards are unable to express most of the instances in each of these three classes.

Reinforcement Learning (RL)

Paper
Add Code

Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis

no code implementations • 18 Dec 2023 • Rohan Mitta, Hosein Hasanbeig, Jun Wang, Daniel Kroening, Yiannis Kantaros, Alessandro Abate

This paper addresses the problem of maintaining safety during training in Reinforcement Learning (RL), such that the safety constraint violations are bounded at any point during learning.

Bayesian Inference Reinforcement Learning (RL)

Paper
Add Code

Learning Robust Policies for Uncertain Parametric Markov Decision Processes

no code implementations • 11 Dec 2023 • Luke Rickard, Alessandro Abate, Kostas Margellos

Synthesising verifiably correct controllers for dynamical systems is crucial for safety-critical problems.

Paper
Add Code

Fossil 2.0: Formal Certificate Synthesis for the Verification and Control of Dynamical Models

no code implementations • 16 Nov 2023 • Alec Edwards, Andrea Peruffo, Alessandro Abate

This paper presents Fossil 2. 0, a new major release of a software tool for the synthesis of certificates (e. g., Lyapunov and barrier functions) for dynamical systems modelled as ordinary differential and difference equations.

Paper
Add Code

Correct-by-Construction Control for Stochastic and Uncertain Dynamical Models via Formal Abstractions

no code implementations • 16 Nov 2023 • Thom Badings, Nils Jansen, Licio Romao, Alessandro Abate

Such autonomous systems are naturally modeled as stochastic dynamical models.

Paper
Add Code

Probabilistic Reach-Avoid for Bayesian Neural Networks

1 code implementation • 3 Oct 2023 • Matthew Wicker, Luca Laurenti, Andrea Patane, Nicola Paoletti, Alessandro Abate, Marta Kwiatkowska

Such computed lower bounds provide safety certification for the given policy and BNN model.

Model-based Reinforcement Learning

Paper
Code

STARC: A General Framework For Quantifying Differences Between Reward Functions

no code implementations • 26 Sep 2023 • Joar Skalse, Lucy Farnik, Sumeet Ramesh Motwani, Erik Jenner, Adam Gleave, Alessandro Abate

This means that reward learning algorithms generally must be evaluated empirically, which is expensive, and that their failure modes are difficult to anticipate in advance.

Paper
Add Code

A General Verification Framework for Dynamical and Control Models via Certificate Synthesis

no code implementations • 12 Sep 2023 • Alec Edwards, Andrea Peruffo, Alessandro Abate

An emerging branch of control theory specialises in certificate learning, concerning the specification of a desired (possibly complex) system behaviour for an autonomous or control model, which is then analytically verified by means of a function-based proof.

Paper
Add Code

On the Trade-off Between Efficiency and Precision of Neural Abstraction

no code implementations • 28 Jul 2023 • Alec Edwards, Mirco Giacobbe, Alessandro Abate

Neural abstractions have been recently introduced as formal approximations of complex, nonlinear dynamical models.

Paper
Add Code

On Imperfect Recall in Multi-Agent Influence Diagrams

no code implementations • 11 Jul 2023 • James Fox, Matt MacDermott, Lewis Hammond, Paul Harrenstein, Alessandro Abate, Michael Wooldridge

Multi-agent influence diagrams (MAIDs) are a popular game-theoretic model based on Bayesian networks.

Paper
Add Code

An Exact Characterisation of Flexibility in Populations of Electric Vehicles

no code implementations • 29 Jun 2023 • Karan Mukhi, Alessandro Abate

The flexibility of an individual EV can be quantified as a convex polytope and the flexibility of a population of EVs is the Minkowski sum of these polytopes.

Paper
Add Code

Networked Communication for Decentralised Agents in Mean-Field Games

no code implementations • 5 Jun 2023 • Patrick Benjamin, Alessandro Abate

We introduce networked communication to the mean-field game framework, in particular to oracle-free settings where $N$ decentralised agents learn along a single, non-episodic evolution path of the empirical system.

Paper
Add Code

Abstracting Linear Stochastic Systems via Knowledge Filtering

no code implementations • 12 Apr 2023 • Maico Hendrikus Wilhelmus Engelaar, Licio Romao, Yulong Gao, Mircea Lazar, Alessandro Abate, Sofie Haesaert

In this paper, we propose a new model reduction technique for linear stochastic systems that builds upon knowledge filtering and utilizes optimal Kalman filtering techniques.

Paper
Add Code

Inner approximations of stochastic programs for data-driven stochastic barrier function design

no code implementations • 10 Apr 2023 • Frederik Baymler Mathiesen, Licio Romao, Simeon C. Calvert, Alessandro Abate, Luca Laurenti

In particular, we show that the stochastic program to synthesize a SBF can be relaxed into a chance-constrained optimisation problem on which scenario approach theory applies.

Paper
Add Code

Distributionally Robust Optimal and Safe Control of Stochastic Systems via Kernel Conditional Mean Embedding

no code implementations • 2 Apr 2023 • Licio Romao, Ashish R. Hota, Alessandro Abate

We present a novel distributionally robust framework for dynamic programming that uses kernel methods to design feedback control policies.

Paper
Add Code

Data-driven abstractions via adaptive refinements and a Kantorovich metric [extended version]

no code implementations • 30 Mar 2023 • Adrien Banse, Licio Romao, Alessandro Abate, Raphaël M. Jungers

In order to learn the optimal structure, we define a Kantorovich-inspired metric between Markov chains, and we use it as a loss function.

Paper
Add Code

Policy Evaluation in Distributional LQR

no code implementations • 23 Mar 2023 • Zifan Wang, Yulong Gao, Siyi Wang, Michael M. Zavlanos, Alessandro Abate, Karl H. Johansson

Distributional reinforcement learning (DRL) enhances the understanding of the effects of the randomness in the environment by letting agents learn the distribution of a random return, rather than its expected value as in standard RL.

Distributional Reinforcement Learning

Paper
Add Code

Neural Abstractions

1 code implementation • 27 Jan 2023 • Alessandro Abate, Alec Edwards, Mirco Giacobbe

We present a novel method for the safety verification of nonlinear dynamical models that uses neural networks to represent abstractions of their dynamics.

Paper
Code

Reasoning about Causality in Games

no code implementations • 5 Jan 2023 • Lewis Hammond, James Fox, Tom Everitt, Ryan Carey, Alessandro Abate, Michael Wooldridge

Regarding question iii), we describe correspondences between causal games and other formalisms, and explain how causal games can be used to answer queries that other causal or game-theoretic models do not support.

Paper
Add Code

Robust Control for Dynamical Systems With Non-Gaussian Noise via Formal Abstractions

1 code implementation • 4 Jan 2023 • Thom Badings, Licio Romao, Alessandro Abate, David Parker, Hasan A. Poonawala, Marielle Stoelinga, Nils Jansen

This iMDP is, with a user-specified confidence probability, robust against uncertainty in the transition probabilities, and the tightness of the probability intervals can be controlled through the number of samples.

Continuous Control

Paper
Code

Lexicographic Multi-Objective Reinforcement Learning

1 code implementation • 28 Dec 2022 • Joar Skalse, Lewis Hammond, Charlie Griffin, Alessandro Abate

In this work we introduce reinforcement learning techniques for solving lexicographic multi-objective problems.

Multi-Objective Reinforcement Learning reinforcement-learning

Paper
Code

Misspecification in Inverse Reinforcement Learning

no code implementations • 6 Dec 2022 • Joar Skalse, Alessandro Abate

In this paper, we provide a mathematical analysis of how robust different IRL models are to misspecification, and answer precisely how the demonstrator policy may differ from each of the standard models before that model leads to faulty inferences about the reward function $R$.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Data-driven memory-dependent abstractions of dynamical systems

no code implementations • 4 Dec 2022 • Adrien Banse, Licio Romao, Alessandro Abate, Raphaël M. Jungers

We propose a sample-based, sequential method to abstract a (potentially black-box) dynamical system with a sequence of memory-dependent Markov chains of increasing size.

Paper
Add Code

Formal Controller Synthesis for Markov Jump Linear Systems with Uncertain Dynamics

no code implementations • 1 Dec 2022 • Luke Rickard, Thom Badings, Licio Romao, Alessandro Abate

We consider the cases where the transition probabilities of this MDP are either known up to an interval or completely unknown.

Paper
Add Code

Probabilities Are Not Enough: Formal Controller Synthesis for Stochastic Dynamical Models with Epistemic Uncertainty

1 code implementation • 12 Oct 2022 • Thom Badings, Licio Romao, Alessandro Abate, Nils Jansen

Stochastic noise causes aleatoric uncertainty, whereas imprecise knowledge of model parameters leads to epistemic uncertainty.

Paper
Code

Bounded Robustness in Reinforcement Learning via Lexicographic Objectives

no code implementations • 30 Sep 2022 • Daniel Jarne Ornia, Licio Romao, Lewis Hammond, Manuel Mazo Jr., Alessandro Abate

Policy robustness in Reinforcement Learning may not be desirable at any cost: the alterations caused by robustness requirements from otherwise optimal policies should be explainable, quantifiable and formally verifiable.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning

1 code implementation • 21 Sep 2022 • Hosein Hasanbeig, Daniel Kroening, Alessandro Abate

LCRL is a software tool that implements model-free Reinforcement Learning (RL) algorithms over unknown Markov Decision Processes (MDPs), synthesising policies that satisfy a given linear temporal specification with maximal probability.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Learning Task Automata for Reinforcement Learning using Hidden Markov Models

no code implementations • 25 Aug 2022 • Alessandro Abate, Yousif Almulla, James Fox, David Hyland, Michael Wooldridge

Second, we propose a novel method for distilling the task automaton (assumed to be a deterministic finite automaton) from the learnt product MDP.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Low Emission Building Control with Zero-Shot Reinforcement Learning

no code implementations • 12 Aug 2022 • Scott R. Jeen, Alessandro Abate, Jonathan M. Cullen

Heating and cooling systems in buildings account for 31\% of global energy use, much of which are regulated by Rule Based Controllers (RBCs) that neither maximise energy efficiency nor minimise emissions by interacting optimally with the grid.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Low Emission Building Control with Zero-Shot Reinforcement Learning

1 code implementation • 28 Jun 2022 • Scott R. Jeen, Alessandro Abate, Jonathan M. Cullen

Heating and cooling systems in buildings account for 31% of global energy use, much of which are regulated by Rule Based Controllers (RBCs) that neither maximise energy efficiency nor minimise emissions by interacting optimally with the grid.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

no code implementations • 14 Mar 2022 • Joar Skalse, Matthew Farrugia-Roberts, Stuart Russell, Alessandro Abate, Adam Gleave

It is often very challenging to manually design reward functions for complex, real-world tasks.

Paper
Add Code

Sampling-Based Robust Control of Autonomous Systems with Non-Gaussian Noise

no code implementations • 25 Oct 2021 • Thom S. Badings, Alessandro Abate, Nils Jansen, David Parker, Hasan A. Poonawala, Marielle Stoelinga

We use state-of-the-art verification techniques to provide guarantees on the iMDP, and compute a controller for which these guarantees carry over to the autonomous system.

Paper
Add Code

Certification of Iterative Predictions in Bayesian Neural Networks

1 code implementation • 21 May 2021 • Matthew Wicker, Luca Laurenti, Andrea Patane, Nicola Paoletti, Alessandro Abate, Marta Kwiatkowska

We consider the problem of computing reach-avoid probabilities for iterative predictions made with Bayesian neural network (BNN) models.

Reinforcement Learning (RL)

Paper
Code

Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic

1 code implementation • 24 Feb 2021 • Mingyu Cai, Mohammadhosein Hasanbeig, Shaoping Xiao, Alessandro Abate, Zhen Kan

This paper investigates the motion planning of autonomous dynamical systems modeled by Markov decision processes (MDP) with unknown transition probabilities over continuous state and action spaces.

Motion Planning OpenAI Gym +2

Paper
Code

Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice

1 code implementation • 9 Feb 2021 • Lewis Hammond, James Fox, Tom Everitt, Alessandro Abate, Michael Wooldridge

Multi-agent influence diagrams (MAIDs) are a popular form of graphical model that, for certain classes of games, have been shown to offer key complexity and explainability advantages over traditional extensive form game (EFG) representations.

Paper
Code

Multi-Agent Reinforcement Learning with Temporal Logic Specifications

1 code implementation • 1 Feb 2021 • Lewis Hammond, Alessandro Abate, Julian Gutierrez, Michael Wooldridge

In this paper, we study the problem of learning to satisfy temporal logic specifications with a group of agents in an unknown environment, which may exhibit probabilistic behaviour.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

SafePILCO: a software tool for safe and data-efficient policy synthesis

1 code implementation • 7 Aug 2020 • Kyriakos Polymenakos, Nikitas Rontsis, Alessandro Abate, Stephen Roberts

SafePILCO is a software tool for safe and data-efficient policy search with reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

310

Paper
Code

Automated and Sound Synthesis of Lyapunov Functions with SMT Solvers

no code implementations • 21 Jul 2020 • Daniele Ahmed, Andrea Peruffo, Alessandro Abate

In this paper we employ SMT solvers to soundly synthesise Lyapunov functions that assert the stability of a given dynamical model.

Paper
Add Code

Automated and Formal Synthesis of Neural Barrier Certificates for Dynamical Models

no code implementations • 7 Jul 2020 • Andrea Peruffo, Daniele Ahmed, Alessandro Abate

We introduce an automated, formal, counterexample-based approach to synthesise Barrier Certificates (BC) for the safety verification of continuous and hybrid dynamical models.

Paper
Add Code

Jump Operator Planning: Goal-Conditioned Policy Ensembles and Zero-Shot Transfer

no code implementations • 6 Jul 2020 • Thomas J. Ringstrom, Mohammadhosein Hasanbeig, Alessandro Abate

In Hierarchical Control, compositionality, abstraction, and task-transfer are crucial for designing versatile algorithms which can solve a variety of problems with maximal representational reuse.

Paper
Add Code

A Randomized Algorithm to Reduce the Support of Discrete Measures

1 code implementation • NeurIPS 2020 • Francesco Cosentino, Harald Oberhauser, Alessandro Abate

Given a discrete probability measure supported on $N$ atoms and a set of $n$ real-valued functions, there exists a probability measure that is supported on a subset of $n+1$ of the original $N$ atoms and has the same mean when integrated against each of the $n$ functions.

Paper
Code

Carathéodory Sampling for Stochastic Gradient Descent

1 code implementation • 2 Jun 2020 • Francesco Cosentino, Harald Oberhauser, Alessandro Abate

Various flavours of Stochastic Gradient Descent (SGD) replace the expensive summation that computes the full gradient by approximating it with a small sum over a randomly selected subsample of the data set that in turn suffers from a high variance.

Paper
Code

Formal Synthesis of Lyapunov Neural Networks

no code implementations • 19 Mar 2020 • Alessandro Abate, Daniele Ahmed, Mirco Giacobbe, Andrea Peruffo

We employ a counterexample-guided approach where a numerical learner and a symbolic verifier interact to construct provably correct Lyapunov neural networks (LNNs).

Paper
Add Code

Cautious Reinforcement Learning with Logical Constraints

no code implementations • 26 Feb 2020 • Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

This paper presents the concept of an adaptive safe padding that forces Reinforcement Learning (RL) to synthesise optimal control policies while ensuring safety during the learning process.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Safety Guarantees for Planning Based on Iterative Gaussian Processes

no code implementations • 29 Nov 2019 • Kyriakos Polymenakos, Luca Laurenti, Andrea Patane, Jan-Peter Calliess, Luca Cardelli, Marta Kwiatkowska, Alessandro Abate, Stephen Roberts

Gaussian Processes (GPs) are widely employed in control and learning because of their principled treatment of uncertainty.

Gaussian Processes Safe Reinforcement Learning

Paper
Add Code

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

1 code implementation • 22 Nov 2019 • Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate, Tom Melham, Daniel Kroening

This paper proposes DeepSynth, a method for effective training of deep Reinforcement Learning (RL) agents when the reward is sparse and non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives.

Hierarchical Reinforcement Learning Montezuma's Revenge +4

Paper
Code

Modular Deep Reinforcement Learning with Temporal Logic Specifications

2 code implementations • 23 Sep 2019 • Lim Zun Yuan, Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

We propose an actor-critic, model-free, and online Reinforcement Learning (RL) framework for continuous-state continuous-action Markov Decision Processes (MDPs) when the reward is highly sparse but encompasses a high-level temporal structure.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees

1 code implementation • 11 Sep 2019 • Mohammadhosein Hasanbeig, Yiannis Kantaros, Alessandro Abate, Daniel Kroening, George J. Pappas, Insup Lee

Reinforcement Learning (RL) has emerged as an efficient method of choice for solving complex sequential decision making problems in automatic control, computer science, economics, and biology.

Decision Making Decision Making Under Uncertainty +4

Paper
Code

Certified Reinforcement Learning with Logic Guidance

1 code implementation • 2 Feb 2019 • Hosein Hasanbeig, Daniel Kroening, Alessandro Abate

Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied to a variety of control problems.

Decision Making Decision Making Under Uncertainty +4

Paper
Code

Logically-Constrained Neural Fitted Q-Iteration

no code implementations • 20 Sep 2018 • Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

We propose a method for efficient training of Q-functions for continuous-state Markov Decision Processes (MDPs) such that the traces of the resulting policies satisfy a given Linear Temporal Logic (LTL) property.

Paper
Add Code

Logically-Constrained Reinforcement Learning

1 code implementation • 24 Jan 2018 • Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

With this reward function, the policy synthesis procedure is "constrained" by the given specification.

Decision Making Decision Making Under Uncertainty +4

Paper
Code

Safe Policy Search with Gaussian Process Models

1 code implementation • 15 Dec 2017 • Kyriakos Polymenakos, Alessandro Abate, Stephen Roberts

We propose a method to optimise the parameters of a policy which will be used to safely perform a given task in a data-efficient manner.

310

Paper
Code

Automated Experiment Design for Data-Efficient Verification of Parametric Markov Decision Processes

no code implementations • 5 Jul 2017 • Elizabeth Polgreen, Viraj Wijesuriya, Sofie Haesaert, Alessandro Abate

We present a new method for statistical verification of quantitative properties over a partially unknown system with actions, utilising a parameterised model (in this work, a parametric Markov decision process) and data collected from experiments performed on the underlying system.

Paper
Add Code

Sampling-based Approximations with Quantitative Performance for the Probabilistic Reach-Avoid Problem over General Markov Processes

no code implementations • 1 Sep 2014 • Sofie Haesaert, Robert Babuska, Alessandro Abate

This article deals with stochastic processes endowed with the Markov (memoryless) property and evolving over general (uncountable) state spaces.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.