Search Results for author: Yizheng Zhang

Found 5 papers, 0 papers with code

Relative Policy-Transition Optimization for Fast Policy Transfer

no code implementations13 Jun 2022 Jiawei Xu, Cheng Zhou, Yizheng Zhang, Baoxiang Wang, Lei Han

Integrating the two algorithms results in the complete Relative Policy-Transition Optimization (RPTO) algorithm, in which the policy interacts with the two environments simultaneously, such that data collections from two environments, policy and transition updates are completed in one closed loop to form a principled learning framework for policy transfer.

Continuous Control LEMMA +1

A General Theory of Relativity in Reinforcement Learning

no code implementations29 Sep 2021 Lei Han, Cheng Zhou, Yizheng Zhang

We propose a new general theory measuring the relativity between two arbitrary Markov Decision Processes (MDPs) from the perspective of reinforcement learning (RL).

reinforcement-learning Reinforcement Learning (RL)

Trade-off on Sim2Real Learning: Real-world Learning Faster than Simulations

no code implementations21 Jul 2020 Jingyi Huang, Yizheng Zhang, Fabio Giardina, Andre Rosendo

While considering Sim and Real learning, our experiments show that the sample-efficient Deep Bayesian RL performance is better than DRL even when computation time (as opposed to number of iterations) is taken in consideration.

Q-Learning

Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals

no code implementations8 Oct 2019 Yizheng Zhang, Andre Rosendo

Deep Reinforcement Learning (DRL) has shown its promising capabilities to learn optimal policies directly from trial and error.

Q-Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.