no code implementations • 5 Apr 2024 • Zehui Lu, Hao Tu, Huazhen Fang, Yebin Wang, Shaoshuai Mou
A state-feedback model predictive control algorithm is then developed for integrated fast charging and active thermal management.
no code implementations • 7 Dec 2023 • Ayush Rai, Shaoshuai Mou
In the proposed approach, the agents sample their individual local functions in a way that benefits the whole network by utilizing a running consensus to estimate the upper confidence bound on the global function.
no code implementations • 24 May 2023 • Wenjian Hao, Zehui Lu, Zihao Liang, Tianyu Zhou, Shaoshuai Mou
This paper develops a policy learning method for tuning a pre-trained policy to adapt to additional tasks without altering the original task.
no code implementations • 24 May 2023 • Wenjian Hao, Paulo C. Heredia, Bowen Huang, Zehui Lu, Zihao Liang, Shaoshuai Mou
This paper proposes a policy learning algorithm based on the Koopman operator theory and policy gradient approach, which seeks to approximate an unknown dynamical system and search for optimal policy simultaneously, using the observations gathered through interaction with the environment.
no code implementations • 31 Mar 2023 • Zihao Liang, Wenjian Hao, Shaoshuai Mou
By assuming the objective function to be learned is parameterized as a linear combination of features with unknown weights, the proposed approach for IOC is able to achieve a Koopman representation of the unknown dynamics and the unknown weights in objective function together.
no code implementations • 20 Jan 2023 • Zehui Lu, Shaoshuai Mou
Designing control inputs for a system that involves dynamical responses in multiple timescales is nontrivial.
no code implementations • 12 Oct 2022 • Wenjian Hao, Bowen Huang, Wei Pan, Di wu, Shaoshuai Mou
This paper presents a data-driven approach to approximate the dynamics of a nonlinear time-varying system (NTVS) by a linear time-varying system (LTVS), which is resulted from the Koopman operator and deep neural networks.
1 code implementation • 24 Sep 2022 • Zehui Lu, Wanxin Jin, Shaoshuai Mou, Brian D. O. Anderson
Different from classical techniques for tuning parameters in a controller, we allow tunable parameters appearing in both the system dynamics and the objective functions of each agent.
no code implementations • 12 Jun 2022 • Xuan Wang, Shaoshuai Mou, Shreyas Sundaram
By applying this new device to multi-agent systems, we introduce network and constraint redundancy conditions under which resilient constrained consensus can be achieved with an exponential convergence rate.
1 code implementation • NeurIPS 2021 • Wanxin Jin, Shaoshuai Mou, George J. Pappas
We propose a Safe Pontryagin Differentiable Programming (Safe PDP) methodology, which establishes a theoretical and algorithmic framework to solve a broad class of safety-critical learning and control tasks -- problems that require the guarantee of safety constraint satisfaction at any stage of the learning and control progress.
no code implementations • 7 Apr 2021 • Yijing Xie, Shaoshuai Mou, Shreyas Sundaram
This paper considers the multi-agent reinforcement learning (MARL) problem for a networked (peer-to-peer) system in the presence of Byzantine agents.
1 code implementation • 30 Nov 2020 • Wanxin Jin, Todd D. Murphey, Zehui Lu, Shaoshuai Mou
This paper proposes a novel approach that enables a robot to learn an objective function incrementally from human directional corrections.
no code implementations • 28 Oct 2020 • Wanxin Jin, Zihao Liang, Shaoshuai Mou
This paper proposes an inverse optimal control method which enables a robot to incrementally learn a control objective function from a collection of trajectory segments.
Robotics
2 code implementations • 5 Aug 2020 • Wanxin Jin, Todd D. Murphey, Dana Kulić, Neta Ezer, Shaoshuai Mou
The time stamps of the keyframes can be different from the time of the robot's actual execution.
no code implementations • 15 Jun 2020 • Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou
This paper develops an approach to learn a policy of a dynamical system that is guaranteed to be both provably safe and goal-reaching.
1 code implementation • NeurIPS 2020 • Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou
This paper develops a Pontryagin Differentiable Programming (PDP) methodology, which establishes a unified framework to solve a broad class of learning and control tasks.
2 code implementations • 21 Mar 2018 • Wanxin Jin, Dana Kulić, Shaoshuai Mou, Sandra Hirche
We handle the problem by proposing the recovery matrix, which establishes a relationship between available observations of the trajectory and weights of given candidate features.
Robotics Systems and Control