Search Results for author: Murat Cubuktepe

Found 17 papers, 3 papers with code

Formal Methods for Autonomous Systems

no code implementations2 Nov 2023 Tichakorn Wongpiromsarn, Mahsa Ghasemi, Murat Cubuktepe, Georgios Bakirtzis, Steven Carr, Mustafa O. Karabag, Cyrus Neary, Parham Gohari, Ufuk Topcu

Formal methods refer to rigorous, mathematical approaches to system development and have played a key role in establishing the correctness of safety-critical systems.

Verifiable Reinforcement Learning Systems via Compositionality

no code implementations9 Sep 2023 Cyrus Neary, Aryaman Singh Samyal, Christos Verginis, Murat Cubuktepe, Ufuk Topcu

We propose a framework for verifiable and compositional reinforcement learning (RL) in which a collection of RL subsystems, each of which learns to accomplish a separate subtask, are composed to achieve an overall task.

reinforcement-learning Reinforcement Learning (RL)

Task-Guided IRL in POMDPs that Scales

1 code implementation30 Dec 2022 Franck Djeumou, Christian Ellis, Murat Cubuktepe, Craig Lennon, Ufuk Topcu

First, they require an excessive amount of data due to the information asymmetry between the expert and the learner.

Unity

Convex Optimization for Parameter Synthesis in MDPs

1 code implementation30 Jun 2021 Murat Cubuktepe, Nils Jansen, Sebastian Junges, Joost-Pieter Katoen, Ufuk Topcu

The parameter synthesis problem is to compute an instantiation of these unspecified parameters such that the resulting MDP satisfies the temporal logic specification.

Collision Avoidance

Probabilistic Control of Heterogeneous Swarms Subject to Graph Temporal Logic Specifications: A Decentralized and Scalable Approach

no code implementations29 Jun 2021 Franck Djeumou, Zhe Xu, Murat Cubuktepe, Ufuk Topcu

Specifically, we study a setting in which the agents move along the nodes of a graph, and the high-level task specifications for the swarm are expressed in a recently-proposed language called graph temporal logic (GTL).

Verifiable and Compositional Reinforcement Learning Systems

1 code implementation7 Jun 2021 Cyrus Neary, Christos Verginis, Murat Cubuktepe, Ufuk Topcu

We propose a framework for verifiable and compositional reinforcement learning (RL) in which a collection of RL subsystems, each of which learns to accomplish a separate subtask, are composed to achieve an overall task.

reinforcement-learning Reinforcement Learning (RL)

Task-Guided Inverse Reinforcement Learning Under Partial Information

no code implementations28 May 2021 Franck Djeumou, Murat Cubuktepe, Craig Lennon, Ufuk Topcu

Nevertheless, the resulting formulation is still nonconvex due to the intrinsic nonconvexity of the so-called forward problem, i. e., computing an optimal policy given a reward function, in POMDPs.

reinforcement-learning Reinforcement Learning (RL)

Polynomial-Time Algorithms for Multi-Agent Minimal-Capacity Planning

no code implementations4 May 2021 Murat Cubuktepe, František Blahoudek, Ufuk Topcu

We develop an algorithm that solves this graph problem in time that is \emph{polynomial} in the number of agents, target locations, and size of the consumption Markov decision process.

Robust Finite-State Controllers for Uncertain POMDPs

no code implementations24 Sep 2020 Murat Cubuktepe, Nils Jansen, Sebastian Junges, Ahmadreza Marandi, Marnix Suilen, Ufuk Topcu

(3) We linearize this dual problem and (4) solve the resulting finite linear program to obtain locally optimal solutions to the original problem.

Collision Avoidance Motion Planning

Synthesis of Provably Correct Autonomy Protocols for Shared Control

no code implementations15 May 2019 Murat Cubuktepe, Nils Jansen, Mohammed Alsiekh, Ufuk Topcu

We design the autonomy protocol to ensure that the resulting robot behavior satisfies given safety and performance specifications in probabilistic temporal logic.

Reward-Based Deception with Cognitive Bias

no code implementations25 Apr 2019 Bo Wu, Murat Cubuktepe, Suda Bharadwaj, Ufuk Topcu

In this paper, we consider deceiving adversaries with bounded rationality and in terms of expected rewards.

The Partially Observable Games We Play for Cyber Deception

no code implementations28 Sep 2018 Mohamadreza Ahmadi, Murat Cubuktepe, Nils Jansen, Sebastian Junges, Joost-Pieter Katoen, Ufuk Topcu

Then, the deception problem is to compute a strategy for the deceiver that minimizes the expected cost of deception against all strategies of the infiltrator.

Entropy Maximization for Markov Decision Processes Under Temporal Logic Constraints

no code implementations9 Jul 2018 Yagiz Savas, Melkior Ornik, Murat Cubuktepe, Mustafa O. Karabag, Ufuk Topcu

Such a policy minimizes the predictability of the paths it generates, or dually, maximizes the exploration of different paths in an MDP while ensuring the satisfaction of a temporal logic specification.

Motion Planning

Synthesis in pMDPs: A Tale of 1001 Parameters

no code implementations5 Mar 2018 Murat Cubuktepe, Nils Jansen, Sebastian Junges, Joost-Pieter Katoen, Ufuk Topcu

This paper considers parametric Markov decision processes (pMDPs) whose transitions are equipped with affine functions over a finite set of parameters.

Verification of Markov Decision Processes with Risk-Sensitive Measures

no code implementations28 Feb 2018 Murat Cubuktepe, Ufuk Topcu

We develop a method for computing policies in Markov decision processes with risk-sensitive measures subject to temporal logic constraints.

Synthesis of Shared Control Protocols with Provable Safety and Performance Guarantees

no code implementations26 Oct 2016 Nils Jansen, Murat Cubuktepe, Ufuk Topcu

We formalize synthesis of shared control protocols with correctness guarantees for temporal logic specifications.

Cannot find the paper you are looking for? You can Submit a new open access paper.