Search Results for author: Shengbo Wang

Found 13 papers, 2 papers with code

Constrained Bayesian Optimization Under Partial Observations: Balanced Improvements and Provable Convergence

1 code implementation6 Dec 2023 Shengbo Wang, Ke Li

We endeavor to design an efficient and provable method for expensive POCOPs under the framework of constrained Bayesian optimization.

Bayesian Optimization

On the Foundation of Distributionally Robust Reinforcement Learning

no code implementations15 Nov 2023 Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

This is accomplished through a comprehensive modeling framework centered around distributionally robust Markov decision processes (DRMDPs).

reinforcement-learning

Optimal Sample Complexity for Average Reward Markov Decision Processes

no code implementations13 Oct 2023 Shengbo Wang, Jose Blanchet, Peter Glynn

In this context, the existing literature provides a sample complexity upper bound of $\widetilde O(|S||A|t_{\text{mix}}^2 \epsilon^{-2})$ and a lower bound of $\Omega(|S||A|t_{\text{mix}} \epsilon^{-2})$.

Intelligent machines work in unstructured environments by differential neuromorphic computing

no code implementations16 Sep 2023 Shengbo Wang, Shuo Gao, Chenyu Tang, Edoardo Occhipinti, Cong Li, Shurui Wang, Jiaqi Wang, Hubin Zhao, Guohua Hu, Arokia Nathan, Ravinder Dahiya, Luigi Occhipinti

By mimicking the intrinsic nature of human low-level perception mechanisms, the electronic memristive neuromorphic circuit-based method, presented here shows the potential for adapting to diverse sensing technologies and helping intelligent machines generate smart high-level decisions in the real world.

Autonomous Driving Decision Making

Model-Assisted Probabilistic Safe Adaptive Control With Meta-Bayesian Learning

no code implementations3 Jul 2023 Shengbo Wang, Ke Li, Yin Yang, Yuting Cao, TingWen Huang, Shiping Wen

Specifically, with the help of CBF method, we learn the inherent and external uncertainties by a unified adaptive Bayesian linear regression (ABLR) model, which consists of a forward neural network (NN) and a Bayesian output layer.

Meta-Learning Safe Exploration

Sample Complexity of Variance-reduced Distributionally Robust Q-learning

no code implementations28 May 2023 Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

Further, the variance-reduced distributionally robust Q-learning combines the synchronous Q-learning with variance-reduction techniques to enhance its performance.

Decision Making Q-Learning

A Finite Sample Complexity Bound for Distributionally Robust Q-learning

no code implementations26 Feb 2023 Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

We consider a reinforcement learning setting in which the deployment environment is different from the training environment.

Q-Learning

Optimal Sample Complexity of Reinforcement Learning for Mixing Discounted Markov Decision Processes

no code implementations15 Feb 2023 Shengbo Wang, Jose Blanchet, Peter Glynn

We consider the optimal sample complexity theory of tabular reinforcement learning (RL) for maximizing the infinite horizon discounted reward in a Markov decision process (MDP).

reinforcement-learning Reinforcement Learning (RL)

Suboptimal Safety-Critical Control for Continuous Systems Using Prediction-Correction Online Optimization

no code implementations29 Mar 2022 Shengbo Wang, Shiping Wen, Yin Yang, Yuting Cao, Kaibo Shi, TingWen Huang

This paper investigates the control barrier function (CBF) based safety-critical control for continuous nonlinear control affine systems using the more efficient online algorithms through time-varying optimization.

Robust Adaptive Safety-Critical Control for Unknown Systems with Finite-Time Element-Wise Parameter Estimation

no code implementations27 Nov 2021 Shengbo Wang, Bo Lyu, Shiping Wen, Kaibo Shi, Song Zhu, TingWen Huang

On the one hand, it is shown that the proposed control scheme can always guarantee the safety in the identification process with noised signal injection excitation, which was not considered in the previous study.

AutoGMap: Learning to Map Large-scale Sparse Graphs on Memristive Crossbars

1 code implementation15 Nov 2021 Bo Lyu, Shengbo Wang, Shiping Wen, Kaibo Shi, Yin Yang, Lingfang Zeng, TingWen Huang

But the exploration of large-scale sparse graph computing on processing-in-memory (PIM) platforms (typically with memristive crossbars) is still in its infancy.

Decision Making Knowledge Graphs +1

Cannot find the paper you are looking for? You can Submit a new open access paper.