Search Results for author: Sven Schewe

Found 15 papers, 4 papers with code

Omega-Regular Decision Processes

no code implementations • 14 Dec 2023 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

Regular decision processes (RDPs) are a subclass of non-Markovian decision processes where the transition and reward functions are guarded by some regular property of the past (a lookback).

Paper
Add Code

Omega-Regular Reward Machines

no code implementations • 14 Aug 2023 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

Reinforcement learning (RL) is a powerful approach for training agents to perform tasks, but designing an appropriate reward mechanism is critical to its success.

Reinforcement Learning (RL)

Paper
Add Code

Recursive Reinforcement Learning

no code implementations • 23 Jun 2022 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

Recursion is the fundamental paradigm to finitely describe potentially infinite objects.

Q-Learning reinforcement-learning +1

Paper
Add Code

Alternating Good-for-MDP Automata

no code implementations • 6 May 2022 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

The surprising answer is that we have to pay significantly less when we instead expand the good-for-MDP property to alternating automata: like the nondeterministic GFM automata obtained from deterministic Rabin automata, the alternating good-for-MDP automata we produce from deterministic Streett automata are bi-linear in the the size of the deterministic automaton and its index, and can therefore be exponentially more succinct than minimal nondeterministic B\"uchi automata.

Reinforcement Learning (RL) Translation

Paper
Add Code

Enhancing Adversarial Training with Second-Order Statistics of Weights

1 code implementation • CVPR 2022 • Gaojie Jin, Xinping Yi, Wei Huang, Sven Schewe, Xiaowei Huang

In this paper, we show that treating model weights as random variables allows for enhancing adversarial training through \textbf{S}econd-Order \textbf{S}tatistics \textbf{O}ptimization (S$^2$O) with respect to the weights.

Paper
Code

Weight Expansion: A New Perspective on Dropout and Generalization

no code implementations • 23 Jan 2022 • Gaojie Jin, Xinping Yi, Pengfei Yang, Lijun Zhang, Sven Schewe, Xiaowei Huang

While dropout is known to be a successful regularization technique, insights into the mechanisms that lead to this success are still lacking.

Paper
Add Code

Reliability Assessment and Safety Arguments for Machine Learning Components in System Assurance

no code implementations • 30 Nov 2021 • Yi Dong, Wei Huang, Vibhav Bharti, Victoria Cox, Alec Banks, Sen Wang, Xingyu Zhao, Sven Schewe, Xiaowei Huang

The increasing use of Machine Learning (ML) components embedded in autonomous systems -- so-called Learning-Enabled Systems (LESs) -- has resulted in the pressing need to assure their functional safety.

Paper
Add Code

Mungojerrie: Reinforcement Learning of Linear-Time Objectives

no code implementations • 16 Jun 2021 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

Reinforcement learning synthesizes controllers without prior knowledge of the system.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Model-free Reinforcement Learning for Branching Markov Decision Processes

no code implementations • 12 Jun 2021 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

We study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDPs), a natural extension of (multitype) Branching Markov Chains (BMCs).

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Assessing the Reliability of Deep Learning Classifiers Through Robustness Evaluation and Operational Profiles

1 code implementation • 2 Jun 2021 • Xingyu Zhao, Wei Huang, Alec Banks, Victoria Cox, David Flynn, Sven Schewe, Xiaowei Huang

The utilisation of Deep Learning (DL) is advancing into increasingly more sophisticated applications.

Paper
Code

Detecting Operational Adversarial Examples for Reliable Deep Learning

no code implementations • 13 Apr 2021 • Xingyu Zhao, Wei Huang, Sven Schewe, Yi Dong, Xiaowei Huang

The utilisation of Deep Learning (DL) raises new challenges regarding its dependability in critical applications.

Paper
Add Code

Abstraction and Symbolic Execution of Deep Neural Networks with Bayesian Approximation of Hidden Features

1 code implementation • 5 Mar 2021 • Nicolas Berthier, Amany Alshareef, James Sharp, Sven Schewe, Xiaowei Huang

Intensive research has been conducted on the verification and validation of deep neural networks (DNNs), aiming to understand if, and how, DNNs can be applied to safety critical applications.

Dimensionality Reduction

115

Paper
Code

Simple Stochastic Games with Almost-Sure Energy-Parity Objectives are in NP and coNP

no code implementations • 18 Jan 2021 • Richard Mayr, Sven Schewe, Patrick Totzke, Dominik Wojtczak

We study stochastic games with energy-parity objectives, which combine quantitative rewards with a qualitative $\omega$-regular condition: The maximizer aims to avoid running out of energy while simultaneously satisfying a parity condition.

Computer Science and Game Theory Logic in Computer Science

Paper
Add Code

How does Weight Correlation Affect Generalisation Ability of Deep Neural Networks?

no code implementations • NeurIPS 2020 • Gaojie Jin, Xinping Yi, Liang Zhang, Lijun Zhang, Sven Schewe, Xiaowei Huang

This paper studies the novel concept of weight correlation in deep neural networks and discusses its impact on the networks' generalisation ability.

Paper
Add Code

How does Weight Correlation Affect the Generalisation Ability of Deep Neural Networks

1 code implementation • 12 Oct 2020 • Gaojie Jin, Xinping Yi, Liang Zhang, Lijun Zhang, Sven Schewe, Xiaowei Huang

This paper studies the novel concept of weight correlation in deep neural networks and discusses its impact on the networks' generalisation ability.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.