no code implementations • 19 Mar 2025 • David E. J. van Wijk, Ersin Das, Anil Alan, Samuel Coogan, Tamas G. Molnar, Joel W. Burdick, Manoranjan Majji, Kerianne L. Hobbs
To tackle this shortcoming, we integrate backup control barrier functions with a disturbance observer and estimate the flow under a reconstruction of the disturbance while refining this estimate over time.
no code implementations • 19 Feb 2025 • Vince Kurtz, Joel W. Burdick
In particular, we introduce generative predictive control, a supervised learning framework for tasks with fast dynamics that are easy to simulate but difficult to demonstrate.
no code implementations • 26 Nov 2024 • Yitaek Kim, Ersin Das, Jeeseop Kim, Aaron D. Ames, Joel W. Burdick, Christoffer Sloth
The main idea is to reduce the maximum delay estimation error bound so that the state prediction error bound is monotonically non-increasing.
1 code implementation • 26 Nov 2024 • Xiao Tan, Ersin Das, Aaron D. Ames, Joel W. Burdick
We propose a novel zero-order control barrier function (ZOCBF) for sampled-data systems to ensure system safety.
no code implementations • 9 Sep 2024 • Vince Kurtz, Joel W. Burdick
In this paper, we show how to automatically generate supervised learning data for a class of continuous-time nonlinear stochastic optimal control problems.
no code implementations • 13 Mar 2024 • Ersin Das, Aaron D. Ames, Joel W. Burdick
This paper develops rollover prevention guarantees for mobile robots using control barrier function (CBF) theory, and demonstrates the method experimentally.
no code implementations • 3 Jan 2024 • Ersin Das, Joel W. Burdick
Our constructive framework couples control barrier function (CBF) theory with a new uncertainty estimator to ensure robust safety.
no code implementations • 12 Dec 2022 • Prithvi Akella, Skylar X. Wei, Joel W. Burdick, Aaron D. Ames
Recent advances in safety-critical risk-aware control are predicated on apriori knowledge of the disturbances a system might face.
no code implementations • 21 Apr 2022 • Prithvi Akella, Anushri Dixit, Mohamadreza Ahmadi, Joel W. Burdick, Aaron D. Ames
The dramatic increase of autonomous systems subject to variable environments has given rise to the pressing need to consider risk in both the synthesis and verification of policies for these systems.
no code implementations • 20 Apr 2022 • Anushri Dixit, Mohamadreza Ahmadi, Joel W. Burdick
This paper studies the problem of risk-averse receding horizon motion planning for agents with uncertain dynamics, in the presence of stochastic, dynamic obstacles.
no code implementations • 22 Mar 2022 • Anushri Dixit, Mohamadreza Ahmadi, Joel W. Burdick
This paper studies the problem of distributionally robust model predictive control (MPC) using total variation distance ambiguity sets.
no code implementations • 26 Mar 2021 • Mohamadreza Ahmadi, Anushri Dixit, Joel W. Burdick, Aaron D. Ames
We consider the stochastic shortest path planning problem in MDPs, i. e., the problem of designing policies that ensure reaching a goal state from a given initial state with minimum accrued cost.
no code implementations • 18 Feb 2021 • Yidan Qin, Max Allan, Yisong Yue, Joel W. Burdick, Mahdi Azizian
The combination of high diversity and limited data calls for new learning methods that are robust and invariant to operating conditions and surgical techniques.
no code implementations • 23 Nov 2020 • Anushri Dixit, Mohamadreza Ahmadi, Joel W. Burdick
We consider the problem of risk-sensitive motion planning in the presence of randomly moving obstacles.
1 code implementation • 9 Nov 2020 • Kejun Li, Maegan Tucker, Erdem Biyik, Ellen Novoseller, Joel W. Burdick, Yanan Sui, Dorsa Sadigh, Yisong Yue, Aaron D. Ames
ROIAL learns Bayesian posteriors that predict each exoskeleton user's utility landscape across four exoskeleton gait parameters.
no code implementations • 24 Sep 2020 • Yidan Qin, Seyedshams Feyzabadi, Max Allan, Joel W. Burdick, Mahdi Azizian
We propose daVinciNet - an end-to-end dual-task model for robot motion and surgical state predictions.
1 code implementation • 11 Apr 2020 • Richard Cheng, Mohammad Javad Khojasteh, Aaron D. Ames, Joel W. Burdick
Robots operating in real world settings must navigate and maintain safety while interacting with many heterogeneous agents and obstacles.
1 code implementation • 13 Mar 2020 • Maegan Tucker, Myra Cheng, Ellen Novoseller, Richard Cheng, Yisong Yue, Joel W. Burdick, Aaron D. Ames
Optimizing lower-body exoskeleton walking gaits for user comfort requires understanding users' preferences over a high-dimensional gait parameter space.
no code implementations • 7 Feb 2020 • Yidan Qin, Sahba Aghajani Pedram, Seyedshams Feyzabadi, Max Allan, A. Jonathan McLeod, Joel W. Burdick, Mahdi Azizian
A crucial step towards the automation of such surgical tasks is the temporal perception of the current surgical scene, which requires a real-time estimation of the states in the FSMs.
no code implementations • 21 Jan 2020 • Mohamadreza Ahmadi, Rangoli Sharan, Joel W. Burdick
Partially observable Markov decision processes (POMDPs) provide a modeling framework for autonomous decision making under uncertainty and imperfect sensing, e. g. robot manipulation and self-driving cars.
1 code implementation • 4 Aug 2019 • Ellen R. Novoseller, Yibing Wei, Yanan Sui, Yisong Yue, Joel W. Burdick
In preference-based reinforcement learning (RL), an agent interacts with the environment while receiving preferences instead of absolute feedback.
1 code implementation • 14 May 2019 • Richard Cheng, Abhinav Verma, Gabor Orosz, Swarat Chaudhuri, Yisong Yue, Joel W. Burdick
We show that functional regularization yields a bias-variance trade-off, and propose an adaptive tuning strategy to optimize this trade-off.
1 code implementation • 21 Mar 2019 • Richard Cheng, Gabor Orosz, Richard M. Murray, Joel W. Burdick
Reinforcement Learning (RL) algorithms have found limited success beyond simulated applications, and one main reason is the absence of safety guarantees during the learning process.
no code implementations • ICML 2018 • Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue
We provide theoretical guarantees for both the satisfaction of safety constraints as well as convergence to the optimal utility value.
no code implementations • 21 Nov 2017 • Yanan Sui, Kun Ho Kim, Joel W. Burdick
Spinal cord stimulation has enabled humans with motor complete spinal cord injury (SCI) to independently stand and recover some lost autonomic function.
no code implementations • 7 Oct 2017 • Kun Li, Joel W. Burdick
Observing that each demonstrator has an inherent reward for each state and the task-specific behaviors mainly depend on a small number of key states, we propose a meta IRL algorithm that first models the reward function for each task as a distribution conditioned on a baseline reward function shared by all tasks and dependent only on the demonstrator, and then finds the most likely reward function in the distribution that explains the task-specific behaviors.
no code implementations • 23 Aug 2017 • Kun Li, Joel W. Burdick
This works handles the inverse reinforcement learning problem in high-dimensional state spaces, which relies on an efficient solution of model-based high-dimensional reinforcement learning problems.
no code implementations • 28 Jul 2017 • Kun Li, Joel W. Burdick
We also show that the proposed method can extend many existing methods to high-dimensional state spaces.
no code implementations • 24 Jul 2017 • Kun Li, Yanan Sui, Joel W. Burdick
We introduce a strategy to flexibly handle different types of actions with two approximations of the Bellman Optimality Equation, and a Bellman Gradient Iteration method to compute the gradient of the Q-value with respect to the reward function.
no code implementations • 22 Jul 2017 • Kun Li, Joel W. Burdick
This paper develops a method to use RGB-D cameras to track the motions of a human spinal cord injury patient undergoing spinal stimulation and physical rehabilitation.
no code implementations • 8 Jul 2017 • Yanan Sui, Yisong Yue, Joel W. Burdick
This problem can be formulated as a $K$-armed Dueling Bandits problem where $K$ is the total number of decisions.
no code implementations • 29 Apr 2017 • Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue
The dueling bandits problem is an online learning framework for learning from pairwise preference feedback, and is particularly well-suited for modeling settings that elicit subjective or implicit human feedback.
no code implementations • 15 Jan 2014 • Matanya B. Horowitz, Nikolai Matni, Joel W. Burdick
The method is a convex relaxation of the classical pose estimation problem, and is based on explicit linear matrix inequality (LMI) representations for the convex hulls of $SE(2)$ and $SE(3)$.