Search Results for author: Sven Schewe

Found 15 papers, 4 papers with code

Omega-Regular Decision Processes

no code implementations14 Dec 2023 Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

Regular decision processes (RDPs) are a subclass of non-Markovian decision processes where the transition and reward functions are guarded by some regular property of the past (a lookback).

Omega-Regular Reward Machines

no code implementations14 Aug 2023 Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

Reinforcement learning (RL) is a powerful approach for training agents to perform tasks, but designing an appropriate reward mechanism is critical to its success.

Reinforcement Learning (RL)

Alternating Good-for-MDP Automata

no code implementations6 May 2022 Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

The surprising answer is that we have to pay significantly less when we instead expand the good-for-MDP property to alternating automata: like the nondeterministic GFM automata obtained from deterministic Rabin automata, the alternating good-for-MDP automata we produce from deterministic Streett automata are bi-linear in the the size of the deterministic automaton and its index, and can therefore be exponentially more succinct than minimal nondeterministic B\"uchi automata.

Reinforcement Learning (RL) Translation

Enhancing Adversarial Training with Second-Order Statistics of Weights

1 code implementation CVPR 2022 Gaojie Jin, Xinping Yi, Wei Huang, Sven Schewe, Xiaowei Huang

In this paper, we show that treating model weights as random variables allows for enhancing adversarial training through \textbf{S}econd-Order \textbf{S}tatistics \textbf{O}ptimization (S$^2$O) with respect to the weights.

Weight Expansion: A New Perspective on Dropout and Generalization

no code implementations23 Jan 2022 Gaojie Jin, Xinping Yi, Pengfei Yang, Lijun Zhang, Sven Schewe, Xiaowei Huang

While dropout is known to be a successful regularization technique, insights into the mechanisms that lead to this success are still lacking.

Reliability Assessment and Safety Arguments for Machine Learning Components in System Assurance

no code implementations30 Nov 2021 Yi Dong, Wei Huang, Vibhav Bharti, Victoria Cox, Alec Banks, Sen Wang, Xingyu Zhao, Sven Schewe, Xiaowei Huang

The increasing use of Machine Learning (ML) components embedded in autonomous systems -- so-called Learning-Enabled Systems (LESs) -- has resulted in the pressing need to assure their functional safety.

Model-free Reinforcement Learning for Branching Markov Decision Processes

no code implementations12 Jun 2021 Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

We study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDPs), a natural extension of (multitype) Branching Markov Chains (BMCs).

reinforcement-learning Reinforcement Learning (RL)

Detecting Operational Adversarial Examples for Reliable Deep Learning

no code implementations13 Apr 2021 Xingyu Zhao, Wei Huang, Sven Schewe, Yi Dong, Xiaowei Huang

The utilisation of Deep Learning (DL) raises new challenges regarding its dependability in critical applications.

Abstraction and Symbolic Execution of Deep Neural Networks with Bayesian Approximation of Hidden Features

1 code implementation5 Mar 2021 Nicolas Berthier, Amany Alshareef, James Sharp, Sven Schewe, Xiaowei Huang

Intensive research has been conducted on the verification and validation of deep neural networks (DNNs), aiming to understand if, and how, DNNs can be applied to safety critical applications.

Dimensionality Reduction

Simple Stochastic Games with Almost-Sure Energy-Parity Objectives are in NP and coNP

no code implementations18 Jan 2021 Richard Mayr, Sven Schewe, Patrick Totzke, Dominik Wojtczak

We study stochastic games with energy-parity objectives, which combine quantitative rewards with a qualitative $\omega$-regular condition: The maximizer aims to avoid running out of energy while simultaneously satisfying a parity condition.

Computer Science and Game Theory Logic in Computer Science

How does Weight Correlation Affect Generalisation Ability of Deep Neural Networks?

no code implementations NeurIPS 2020 Gaojie Jin, Xinping Yi, Liang Zhang, Lijun Zhang, Sven Schewe, Xiaowei Huang

This paper studies the novel concept of weight correlation in deep neural networks and discusses its impact on the networks' generalisation ability.

How does Weight Correlation Affect the Generalisation Ability of Deep Neural Networks

1 code implementation12 Oct 2020 Gaojie Jin, Xinping Yi, Liang Zhang, Lijun Zhang, Sven Schewe, Xiaowei Huang

This paper studies the novel concept of weight correlation in deep neural networks and discusses its impact on the networks' generalisation ability.

Cannot find the paper you are looking for? You can Submit a new open access paper.