Search Results for author: Shaoshuai Mou

Found 17 papers, 6 papers with code

Distributed Optimization via Kernelized Multi-armed Bandits

no code implementations7 Dec 2023 Ayush Rai, Shaoshuai Mou

In the proposed approach, the agents sample their individual local functions in a way that benefits the whole network by utilizing a running consensus to estimate the upper confidence bound on the global function.

Decision Making Distributed Optimization +1

Adaptive Policy Learning to Additional Tasks

no code implementations24 May 2023 Wenjian Hao, Zehui Lu, Zihao Liang, Tianyu Zhou, Shaoshuai Mou

This paper develops a policy learning method for tuning a pre-trained policy to adapt to additional tasks without altering the original task.

Policy Gradient Methods

Policy Learning based on Deep Koopman Representation

no code implementations24 May 2023 Wenjian Hao, Paulo C. Heredia, Bowen Huang, Zehui Lu, Zihao Liang, Shaoshuai Mou

This paper proposes a policy learning algorithm based on the Koopman operator theory and policy gradient approach, which seeks to approximate an unknown dynamical system and search for optimal policy simultaneously, using the observations gathered through interaction with the environment.

A Data-Driven Approach for Inverse Optimal Control

no code implementations31 Mar 2023 Zihao Liang, Wenjian Hao, Shaoshuai Mou

By assuming the objective function to be learned is parameterized as a linear combination of features with unknown weights, the proposed approach for IOC is able to achieve a Koopman representation of the unknown dynamics and the unknown weights in objective function together.

Variable Sampling MPC via Differentiable Time-Warping Function

no code implementations20 Jan 2023 Zehui Lu, Shaoshuai Mou

Designing control inputs for a system that involves dynamical responses in multiple timescales is nontrivial.

Deep Koopman Learning of Nonlinear Time-Varying Systems

no code implementations12 Oct 2022 Wenjian Hao, Bowen Huang, Wei Pan, Di wu, Shaoshuai Mou

This paper presents a data-driven approach to approximate the dynamics of a nonlinear time-varying system (NTVS) by a linear time-varying system (LTVS), which is resulted from the Koopman operator and deep neural networks.

Computational Efficiency

Cooperative Tuning of Multi-Agent Optimal Control Systems

1 code implementation24 Sep 2022 Zehui Lu, Wanxin Jin, Shaoshuai Mou, Brian D. O. Anderson

Different from classical techniques for tuning parameters in a controller, we allow tunable parameters appearing in both the system dynamics and the objective functions of each agent.

Distributed Optimization

Resilience for Distributed Consensus with Constraints

no code implementations12 Jun 2022 Xuan Wang, Shaoshuai Mou, Shreyas Sundaram

By applying this new device to multi-agent systems, we introduce network and constraint redundancy conditions under which resilient constrained consensus can be achieved with an exponential convergence rate.

Safe Pontryagin Differentiable Programming

1 code implementation NeurIPS 2021 Wanxin Jin, Shaoshuai Mou, George J. Pappas

We propose a Safe Pontryagin Differentiable Programming (Safe PDP) methodology, which establishes a theoretical and algorithmic framework to solve a broad class of safety-critical learning and control tasks -- problems that require the guarantee of safety constraint satisfaction at any stage of the learning and control progress.

Motion Planning

Towards Resilience for Multi-Agent $QD$-Learning

no code implementations7 Apr 2021 Yijing Xie, Shaoshuai Mou, Shreyas Sundaram

This paper considers the multi-agent reinforcement learning (MARL) problem for a networked (peer-to-peer) system in the presence of Byzantine agents.

Multi-agent Reinforcement Learning Q-Learning

Learning from Human Directional Corrections

1 code implementation30 Nov 2020 Wanxin Jin, Todd D. Murphey, Zehui Lu, Shaoshuai Mou

This paper proposes a novel approach that enables a robot to learn an objective function incrementally from human directional corrections.

Motion Planning

Learning Objective Functions Incrementally by Inverse Optimal Control

no code implementations28 Oct 2020 Wanxin Jin, Zihao Liang, Shaoshuai Mou

This paper proposes an inverse optimal control method which enables a robot to incrementally learn a control objective function from a collection of trajectory segments.

Robotics

Learning from Sparse Demonstrations

2 code implementations5 Aug 2020 Wanxin Jin, Todd D. Murphey, Dana Kulić, Neta Ezer, Shaoshuai Mou

The time stamps of the keyframes can be different from the time of the robot's actual execution.

Motion Planning

Neural Certificates for Safe Control Policies

no code implementations15 Jun 2020 Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou

This paper develops an approach to learn a policy of a dynamical system that is guaranteed to be both provably safe and goal-reaching.

Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework

1 code implementation NeurIPS 2020 Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou

This paper develops a Pontryagin Differentiable Programming (PDP) methodology, which establishes a unified framework to solve a broad class of learning and control tasks.

Inverse Optimal Control with Incomplete Observations

2 code implementations21 Mar 2018 Wanxin Jin, Dana Kulić, Shaoshuai Mou, Sandra Hirche

We handle the problem by proposing the recovery matrix, which establishes a relationship between available observations of the trajectory and weights of given candidate features.

Robotics Systems and Control

Cannot find the paper you are looking for? You can Submit a new open access paper.