no code implementations • 17 Jun 2024 • Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou
We explore the control of stochastic systems with potentially continuous state and action spaces, characterized by the state dynamics $X_{t+1} = f(X_t, A_t, W_t)$.
no code implementations • 1 Jun 2024 • Shengbo Wang, Cong Li, Tongming Pu, Jian Zhang, Weihao Ma, Luigi Occhipinti, Arokia Nathan, Shuo Gao
Memristive neuromorphic systems are designed to emulate human perception and cognition, where the memristor states represent essential historical information to perform both low-level and high-level tasks.
no code implementations • 25 Feb 2024 • Lekai Song, Pengyu Liu, Jingfang Pei, Yang Liu, Songwei Liu, Shengbo Wang, Leonard W. T. Ng, Tawfique Hasan, Kong-Pang Pun, Shuo Gao, Guohua Hu
The demand for efficient edge vision has spurred the interest in developing stochastic computing approaches for performing image processing tasks.
1 code implementation • 6 Dec 2023 • Shengbo Wang, Ke Li
We endeavor to design an efficient and provable method for expensive POCOPs under the framework of constrained Bayesian optimization.
no code implementations • 15 Nov 2023 • Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou
This is accomplished through a comprehensive modeling framework centered around distributionally robust Markov decision processes (DRMDPs).
no code implementations • 13 Oct 2023 • Shengbo Wang, Jose Blanchet, Peter Glynn
In this context, the existing literature provides a sample complexity upper bound of $\widetilde O(|S||A|t_{\text{mix}}^2 \epsilon^{-2})$ and a lower bound of $\Omega(|S||A|t_{\text{mix}} \epsilon^{-2})$.
no code implementations • 16 Sep 2023 • Shengbo Wang, Shuo Gao, Chenyu Tang, Edoardo Occhipinti, Cong Li, Shurui Wang, Jiaqi Wang, Hubin Zhao, Guohua Hu, Arokia Nathan, Ravinder Dahiya, Luigi Occhipinti
By mimicking the intrinsic nature of human low-level perception mechanisms, the electronic memristive neuromorphic circuit-based method, presented here shows the potential for adapting to diverse sensing technologies and helping intelligent machines generate smart high-level decisions in the real world.
no code implementations • 3 Jul 2023 • Shengbo Wang, Ke Li, Yin Yang, Yuting Cao, TingWen Huang, Shiping Wen
Specifically, with the help of CBF method, we learn the inherent and external uncertainties by a unified adaptive Bayesian linear regression (ABLR) model, which consists of a forward neural network (NN) and a Bayesian output layer.
no code implementations • 28 May 2023 • Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou
Dynamic decision-making under distributional shifts is of fundamental interest in theory and applications of reinforcement learning: The distribution of the environment in which the data is collected can differ from that of the environment in which the model is deployed.
no code implementations • 26 Feb 2023 • Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou
We consider a reinforcement learning setting in which the deployment environment is different from the training environment.
no code implementations • 15 Feb 2023 • Shengbo Wang, Jose Blanchet, Peter Glynn
We consider the optimal sample complexity theory of tabular reinforcement learning (RL) for maximizing the infinite horizon discounted reward in a Markov decision process (MDP).
no code implementations • 29 Mar 2022 • Shengbo Wang, Shiping Wen, Yin Yang, Yuting Cao, Kaibo Shi, TingWen Huang
This paper investigates the control barrier function (CBF) based safety-critical control for continuous nonlinear control affine systems using the more efficient online algorithms through time-varying optimization.
no code implementations • 27 Nov 2021 • Shengbo Wang, Bo Lyu, Shiping Wen, Kaibo Shi, Song Zhu, TingWen Huang
On the one hand, it is shown that the proposed control scheme can always guarantee the safety in the identification process with noised signal injection excitation, which was not considered in the previous study.
no code implementations • 27 Nov 2021 • Shengbo Wang, Shiping Wen, Kaibo Shi, Song Zhu, TingWen Huang
This paper introduces this method to tracking problem for heterogeneous linear systems.
1 code implementation • 15 Nov 2021 • Bo Lyu, Shengbo Wang, Shiping Wen, Kaibo Shi, Yin Yang, Lingfang Zeng, TingWen Huang
But the exploration of large-scale sparse graph computing on processing-in-memory (PIM) platforms (typically with memristive crossbars) is still in its infancy.