Search Results for author: Zhaoyi Zhou

Found 2 papers, 2 papers with code

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

1 code implementation30 Oct 2023 Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du

Off-policy dynamic programming (DP) techniques such as $Q$-learning have proven to be important in sequential decision-making problems.

Decision Making Offline RL +1

Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games

1 code implementation8 Mar 2023 Zhaoyi Zhou, Zaiwei Chen, Yiheng Lin, Adam Wierman

The algorithm is scalable since each agent uses only local information and does not need access to the global state.

Cannot find the paper you are looking for? You can Submit a new open access paper.