Search Results for author: Joel W. Burdick

Found 28 papers, 6 papers with code

Rollover Prevention for Mobile Robots with Control Barrier Functions: Differentiator-Based Adaptation and Projection-to-State Safety

no code implementations • 13 Mar 2024 • Ersin Das, Aaron D. Ames, Joel W. Burdick

To this end, we consider a safety measure based on the zero moment point to provide conditions on the control input through the lens of CBFs.

Paper
Add Code

Robust Control Barrier Functions using Uncertainty Estimation with Application to Mobile Robots

no code implementations • 3 Jan 2024 • Ersin Das, Joel W. Burdick

Then, we robustify existing CBF constraints with this uncertainty estimate and the estimation error bounds to ensure robust safety via a quadratic program (CBF-QP).

Paper
Add Code

Learning Disturbances Online for Risk-Aware Control: Risk-Aware Flight with Less Than One Minute of Data

no code implementations • 12 Dec 2022 • Prithvi Akella, Skylar X. Wei, Joel W. Burdick, Aaron D. Ames

Recent advances in safety-critical risk-aware control are predicated on apriori knowledge of the disturbances a system might face.

Paper
Add Code

Sample-Based Bounds for Coherent Risk Measures: Applications to Policy Synthesis and Verification

no code implementations • 21 Apr 2022 • Prithvi Akella, Anushri Dixit, Mohamadreza Ahmadi, Joel W. Burdick, Aaron D. Ames

The dramatic increase of autonomous systems subject to variable environments has given rise to the pressing need to consider risk in both the synthesis and verification of policies for these systems.

Paper
Add Code

Risk-Averse Receding Horizon Motion Planning for Obstacle Avoidance using Coherent Risk Measures

no code implementations • 20 Apr 2022 • Anushri Dixit, Mohamadreza Ahmadi, Joel W. Burdick

This paper studies the problem of risk-averse receding horizon motion planning for agents with uncertain dynamics, in the presence of stochastic, dynamic obstacles.

Model Predictive Control Motion Planning

Paper
Add Code

Distributionally Robust Model Predictive Control with Total Variation Distance

no code implementations • 22 Mar 2022 • Anushri Dixit, Mohamadreza Ahmadi, Joel W. Burdick

This paper studies the problem of distributionally robust model predictive control (MPC) using total variation distance ambiguity sets.

Computational Efficiency Model Predictive Control

Paper
Add Code

Risk-Averse Stochastic Shortest Path Planning

no code implementations • 26 Mar 2021 • Mohamadreza Ahmadi, Anushri Dixit, Joel W. Burdick, Aaron D. Ames

We consider the stochastic shortest path planning problem in MDPs, i. e., the problem of designing policies that ensure reaching a goal state from a given initial state with minimum accrued cost.

Paper
Add Code

Learning Invariant Representation of Tasks for Robust Surgical State Estimation

no code implementations • 18 Feb 2021 • Yidan Qin, Max Allan, Yisong Yue, Joel W. Burdick, Mahdi Azizian

The combination of high diversity and limited data calls for new learning methods that are robust and invariant to operating conditions and surgical techniques.

Paper
Add Code

Risk-Sensitive Motion Planning using Entropic Value-at-Risk

no code implementations • 23 Nov 2020 • Anushri Dixit, Mohamadreza Ahmadi, Joel W. Burdick

We consider the problem of risk-sensitive motion planning in the presence of randomly moving obstacles.

Model Predictive Control Motion Planning

Paper
Add Code

ROIAL: Region of Interest Active Learning for Characterizing Exoskeleton Gait Preference Landscapes

1 code implementation • 9 Nov 2020 • Kejun Li, Maegan Tucker, Erdem Biyik, Ellen Novoseller, Joel W. Burdick, Yanan Sui, Dorsa Sadigh, Yisong Yue, Aaron D. Ames

ROIAL learns Bayesian posteriors that predict each exoskeleton user's utility landscape across four exoskeleton gait parameters.

Active Learning

Paper
Code

daVinciNet: Joint Prediction of Motion and Surgical State in Robot-Assisted Surgery

no code implementations • 24 Sep 2020 • Yidan Qin, Seyedshams Feyzabadi, Max Allan, Joel W. Burdick, Mahdi Azizian

We propose daVinciNet - an end-to-end dual-task model for robot motion and surgical state predictions.

Trajectory Prediction

Paper
Add Code

Safe Multi-Agent Interaction through Robust Control Barrier Functions with Learned Uncertainties

1 code implementation • 11 Apr 2020 • Richard Cheng, Mohammad Javad Khojasteh, Aaron D. Ames, Joel W. Burdick

Robots operating in real world settings must navigate and maintain safety while interacting with many heterogeneous agents and obstacles.

Navigate

Paper
Code

Human Preference-Based Learning for High-dimensional Optimization of Exoskeleton Walking Gaits

1 code implementation • 13 Mar 2020 • Maegan Tucker, Myra Cheng, Ellen Novoseller, Richard Cheng, Yisong Yue, Joel W. Burdick, Aaron D. Ames

Optimizing lower-body exoskeleton walking gaits for user comfort requires understanding users' preferences over a high-dimensional gait parameter space.

Paper
Code

Temporal Segmentation of Surgical Sub-tasks through Deep Learning with Multiple Data Sources

no code implementations • 7 Feb 2020 • Yidan Qin, Sahba Aghajani Pedram, Seyedshams Feyzabadi, Max Allan, A. Jonathan McLeod, Joel W. Burdick, Mahdi Azizian

A crucial step towards the automation of such surgical tasks is the temporal perception of the current surgical scene, which requires a real-time estimation of the states in the FSMs.

Paper
Add Code

Stochastic Finite State Control of POMDPs with LTL Specifications

no code implementations • 21 Jan 2020 • Mohamadreza Ahmadi, Rangoli Sharan, Joel W. Burdick

Partially observable Markov decision processes (POMDPs) provide a modeling framework for autonomous decision making under uncertainty and imperfect sensing, e. g. robot manipulation and self-driving cars.

Decision Making Decision Making Under Uncertainty +3

Paper
Add Code

Dueling Posterior Sampling for Preference-Based Reinforcement Learning

1 code implementation • 4 Aug 2019 • Ellen R. Novoseller, Yibing Wei, Yanan Sui, Yisong Yue, Joel W. Burdick

In preference-based reinforcement learning (RL), an agent interacts with the environment while receiving preferences instead of absolute feedback.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Control Regularization for Reduced Variance Reinforcement Learning

1 code implementation • 14 May 2019 • Richard Cheng, Abhinav Verma, Gabor Orosz, Swarat Chaudhuri, Yisong Yue, Joel W. Burdick

We show that functional regularization yields a bias-variance trade-off, and propose an adaptive tuning strategy to optimize this trade-off.

Continuous Control reinforcement-learning +1

Paper
Code

End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks

1 code implementation • 21 Mar 2019 • Richard Cheng, Gabor Orosz, Richard M. Murray, Joel W. Burdick

Reinforcement Learning (RL) algorithms have found limited success beyond simulated applications, and one main reason is the absence of safety guarantees during the learning process.

Continuous Control Gaussian Processes +2

116

Paper
Code

Stagewise Safe Bayesian Optimization with Gaussian Processes

no code implementations • ICML 2018 • Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue

We provide theoretical guarantees for both the satisfaction of safety constraints as well as convergence to the optimal utility value.

Bayesian Optimization Decision Making +2

Paper
Add Code

Quantifying Performance of Bipedal Standing with Multi-channel EMG

no code implementations • 21 Nov 2017 • Yanan Sui, Kun Ho Kim, Joel W. Burdick

Spinal cord stimulation has enabled humans with motor complete spinal cord injury (SCI) to independently stand and recover some lost autonomic function.

Electromyography (EMG)

Paper
Add Code

Meta Inverse Reinforcement Learning via Maximum Reward Sharing for Human Motion Analysis

no code implementations • 7 Oct 2017 • Kun Li, Joel W. Burdick

Observing that each demonstrator has an inherent reward for each state and the task-specific behaviors mainly depend on a small number of key states, we propose a meta IRL algorithm that first models the reward function for each task as a distribution conditioned on a baseline reward function shared by all tasks and dependent only on the demonstrator, and then finds the most likely reward function in the distribution that explains the task-specific behaviors.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A Function Approximation Method for Model-based High-Dimensional Inverse Reinforcement Learning

no code implementations • 23 Aug 2017 • Kun Li, Joel W. Burdick

This works handles the inverse reinforcement learning problem in high-dimensional state spaces, which relies on an efficient solution of model-based high-dimensional reinforcement learning problems.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Inverse Reinforcement Learning in Large State Spaces via Function Approximation

no code implementations • 28 Jul 2017 • Kun Li, Joel W. Burdick

We also show that the proposed method can extend many existing methods to high-dimensional state spaces.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Bellman Gradient Iteration for Inverse Reinforcement Learning

no code implementations • 24 Jul 2017 • Kun Li, Yanan Sui, Joel W. Burdick

We introduce a strategy to flexibly handle different types of actions with two approximations of the Bellman Optimality Equation, and a Bellman Gradient Iteration method to compute the gradient of the Q-value with respect to the reward function.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Clinical Patient Tracking in the Presence of Transient and Permanent Occlusions via Geodesic Feature

no code implementations • 22 Jul 2017 • Kun Li, Joel W. Burdick

This paper develops a method to use RGB-D cameras to track the motions of a human spinal cord injury patient undergoing spinal stimulation and physical rehabilitation.

Paper
Add Code

Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces

no code implementations • 8 Jul 2017 • Yanan Sui, Yisong Yue, Joel W. Burdick

This problem can be formulated as a $K$-armed Dueling Bandits problem where $K$ is the total number of decisions.

Decision Making Decision Making Under Uncertainty

Paper
Add Code

Multi-dueling Bandits with Dependent Arms

no code implementations • 29 Apr 2017 • Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue

The dueling bandits problem is an online learning framework for learning from pairwise preference feedback, and is particularly well-suited for modeling settings that elicit subjective or implicit human feedback.

Thompson Sampling

Paper
Add Code

Convex Relaxations of SE(2) and SE(3) for Visual Pose Estimation

no code implementations • 15 Jan 2014 • Matanya B. Horowitz, Nikolai Matni, Joel W. Burdick

The method is a convex relaxation of the classical pose estimation problem, and is based on explicit linear matrix inequality (LMI) representations for the convex hulls of $SE(2)$ and $SE(3)$.

Pose Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.