Search Results for author: Boyi Liu

Found 17 papers, 0 papers with code

BooVI: Provably Efficient Bootstrapped Value Iteration

no code implementations NeurIPS 2021 Boyi Liu, Qi Cai, Zhuoran Yang, Zhaoran Wang

Despite the tremendous success of reinforcement learning (RL) with function approximation, efficient exploration remains a significant challenge, both practically and theoretically.

Efficient Exploration

Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima

no code implementations4 Oct 2021 Boyi Liu, Jiayang Li, Zhuoran Yang, Hoi-To Wai, Mingyi Hong, Yu Marco Nie, Zhaoran Wang

To regulate a social system comprised of self-interested agents, economic incentives (e. g., taxes, tolls, and subsidies) are often required to induce a desirable outcome.

Multi-Level Features Contrastive Networks for Unsupervised Domain Adaptation

no code implementations14 Sep 2021 Le Liu, Jieren Cheng, Boyi Liu, Yue Yang, Ke Zhou, Qiaobo Da

As a result, it needs to reduce the data distribution difference between the two domains to improve the model's generalization ability.

Unsupervised Domain Adaptation

Policy Optimization in Zero-Sum Markov Games: Fictitious Self-Play Provably Attains Nash Equilibria

no code implementations1 Jan 2021 Boyi Liu, Zhuoran Yang, Zhaoran Wang

Specifically, in each iteration, each player infers the policy of the opponent implicitly via policy evaluation and improves its current policy by taking the smoothed best-response via a proximal policy optimization (PPO) step.

A Real-time Contribution Measurement Method for Participants in Federated Learning

no code implementations28 Sep 2020 Bingjie Yan, Yize Zhou, Boyi Liu, Jun Wang, Yuhan Zhang, Li Liu, Xiaolan Nie, Zhiwei Fan, Zhixuan Liang

However, there is a lack of a sufficiently reasonable contribution measurement mechanism to distribute the reward for each agent.

Federated Learning

Experiments of Federated Learning for COVID-19 Chest X-ray Images

no code implementations5 Jul 2020 Boyi Liu, Bingjie Yan, Yize Zhou, Yifan Yang, Yixian Zhang

However, for the protection and respect of the privacy of patients, the hospital's specific medical-related data did not allow leakage and sharing without permission.

Federated Learning

Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy

no code implementations NeurIPS 2019 Boyi Liu, Qi Cai, Zhuoran Yang, Zhaoran Wang

Proximal policy optimization and trust region policy optimization (PPO and TRPO) with actor and critic parametrized by neural networks achieve significant empirical success in deep reinforcement learning.

Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy

no code implementations25 Jun 2019 Boyi Liu, Qi Cai, Zhuoran Yang, Zhaoran Wang

Proximal policy optimization and trust region policy optimization (PPO and TRPO) with actor and critic parametrized by neural networks achieve significant empirical success in deep reinforcement learning.

Traffic Flow Combination Forecasting Method Based on Improved LSTM and ARIMA

no code implementations25 Jun 2019 Boyi Liu, Xiangyan Tang, Jieren Cheng, Pengchao Shi

In this paper, we define the traffic data time singularity ratio in the dropout module and propose a combination prediction method based on the improved long short-term memory neural network and time series autoregressive integrated moving average model (SDLSTM-ARIMA), which is derived from the Recurrent Neural Networks (RNN) model.

Time Series Traffic Prediction

Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems

no code implementations19 Jan 2019 Boyi Liu, Lujia Wang, Ming Liu

To address the problem, we present a learning architecture for navigation in cloud robotic systems: Lifelong Federated Reinforcement Learning (LFRL).

Robot Navigation Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.