Search Results for author: Shaoshuai Mou

Found 17 papers, 6 papers with code

Integrated Optimal Control for Fast Charging and Active Thermal Management of Lithium-Ion Batteries in Extreme Ambient Temperatures

no code implementations • 5 Apr 2024 • Zehui Lu, Hao Tu, Huazhen Fang, Yebin Wang, Shaoshuai Mou

A state-feedback model predictive control algorithm is then developed for integrated fast charging and active thermal management.

Management Model Predictive Control

Paper
Add Code

Distributed Optimization via Kernelized Multi-armed Bandits

no code implementations • 7 Dec 2023 • Ayush Rai, Shaoshuai Mou

In the proposed approach, the agents sample their individual local functions in a way that benefits the whole network by utilizing a running consensus to estimate the upper confidence bound on the global function.

Decision Making Distributed Optimization +1

Paper
Add Code

Adaptive Policy Learning to Additional Tasks

no code implementations • 24 May 2023 • Wenjian Hao, Zehui Lu, Zihao Liang, Tianyu Zhou, Shaoshuai Mou

This paper develops a policy learning method for tuning a pre-trained policy to adapt to additional tasks without altering the original task.

Policy Gradient Methods

Paper
Add Code

Policy Learning based on Deep Koopman Representation

no code implementations • 24 May 2023 • Wenjian Hao, Paulo C. Heredia, Bowen Huang, Zehui Lu, Zihao Liang, Shaoshuai Mou

This paper proposes a policy learning algorithm based on the Koopman operator theory and policy gradient approach, which seeks to approximate an unknown dynamical system and search for optimal policy simultaneously, using the observations gathered through interaction with the environment.

Paper
Add Code

A Data-Driven Approach for Inverse Optimal Control

no code implementations • 31 Mar 2023 • Zihao Liang, Wenjian Hao, Shaoshuai Mou

By assuming the objective function to be learned is parameterized as a linear combination of features with unknown weights, the proposed approach for IOC is able to achieve a Koopman representation of the unknown dynamics and the unknown weights in objective function together.

Paper
Add Code

Variable Sampling MPC via Differentiable Time-Warping Function

no code implementations • 20 Jan 2023 • Zehui Lu, Shaoshuai Mou

Designing control inputs for a system that involves dynamical responses in multiple timescales is nontrivial.

Paper
Add Code

Deep Koopman Learning of Nonlinear Time-Varying Systems

no code implementations • 12 Oct 2022 • Wenjian Hao, Bowen Huang, Wei Pan, Di wu, Shaoshuai Mou

This paper presents a data-driven approach to approximate the dynamics of a nonlinear time-varying system (NTVS) by a linear time-varying system (LTVS), which is resulted from the Koopman operator and deep neural networks.

Computational Efficiency

Paper
Add Code

Cooperative Tuning of Multi-Agent Optimal Control Systems

1 code implementation • 24 Sep 2022 • Zehui Lu, Wanxin Jin, Shaoshuai Mou, Brian D. O. Anderson

Different from classical techniques for tuning parameters in a controller, we allow tunable parameters appearing in both the system dynamics and the objective functions of each agent.

Distributed Optimization

Paper
Code

Resilience for Distributed Consensus with Constraints

no code implementations • 12 Jun 2022 • Xuan Wang, Shaoshuai Mou, Shreyas Sundaram

By applying this new device to multi-agent systems, we introduce network and constraint redundancy conditions under which resilient constrained consensus can be achieved with an exponential convergence rate.

Paper
Add Code

Safe Pontryagin Differentiable Programming

1 code implementation • NeurIPS 2021 • Wanxin Jin, Shaoshuai Mou, George J. Pappas

We propose a Safe Pontryagin Differentiable Programming (Safe PDP) methodology, which establishes a theoretical and algorithmic framework to solve a broad class of safety-critical learning and control tasks -- problems that require the guarantee of safety constraint satisfaction at any stage of the learning and control progress.

Motion Planning

Paper
Code

Towards Resilience for Multi-Agent $QD$-Learning

no code implementations • 7 Apr 2021 • Yijing Xie, Shaoshuai Mou, Shreyas Sundaram

This paper considers the multi-agent reinforcement learning (MARL) problem for a networked (peer-to-peer) system in the presence of Byzantine agents.

Multi-agent Reinforcement Learning Q-Learning

Paper
Add Code

Learning from Human Directional Corrections

1 code implementation • 30 Nov 2020 • Wanxin Jin, Todd D. Murphey, Zehui Lu, Shaoshuai Mou

This paper proposes a novel approach that enables a robot to learn an objective function incrementally from human directional corrections.

Motion Planning

Paper
Code

Learning Objective Functions Incrementally by Inverse Optimal Control

no code implementations • 28 Oct 2020 • Wanxin Jin, Zihao Liang, Shaoshuai Mou

This paper proposes an inverse optimal control method which enables a robot to incrementally learn a control objective function from a collection of trajectory segments.

Robotics

Paper
Add Code

Learning from Sparse Demonstrations

2 code implementations • 5 Aug 2020 • Wanxin Jin, Todd D. Murphey, Dana Kulić, Neta Ezer, Shaoshuai Mou

The time stamps of the keyframes can be different from the time of the robot's actual execution.

Motion Planning

144

Paper
Code

Neural Certificates for Safe Control Policies

no code implementations • 15 Jun 2020 • Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou

This paper develops an approach to learn a policy of a dynamical system that is guaranteed to be both provably safe and goal-reaching.

Paper
Add Code

Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework

1 code implementation • NeurIPS 2020 • Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou

This paper develops a Pontryagin Differentiable Programming (PDP) methodology, which establishes a unified framework to solve a broad class of learning and control tasks.

144

Paper
Code

Inverse Optimal Control with Incomplete Observations

2 code implementations • 21 Mar 2018 • Wanxin Jin, Dana Kulić, Shaoshuai Mou, Sandra Hirche

We handle the problem by proposing the recovery matrix, which establishes a relationship between available observations of the trajectory and weights of given candidate features.

Robotics Systems and Control

144

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.