Search Results for author: Haozhi Wang

Found 6 papers, 2 papers with code

Private Wasserstein Distance with Random Noises

1 code implementation10 Apr 2024 Wenqian Li, Haozhi Wang, Zhe Huang, Yan Pang

Wasserstein distance is a principle measure of data divergence from a distributional standpoint.

Meta Generative Flow Networks with Personalization for Task-Specific Adaptation

no code implementations16 Jun 2023 Xinyuan Ji, Xu Zhang, Wei Xi, Haozhi Wang, Olga Gadyatskaya, Yinchuan Li

Multi-task reinforcement learning and meta-reinforcement learning have been developed to quickly adapt to new tasks, but they tend to focus on tasks with higher rewards and more frequent occurrences, leading to poor performance on tasks with sparse rewards.

Meta-Learning Meta Reinforcement Learning +1

Multi-agent Policy Reciprocity with Theoretical Guarantee

no code implementations12 Apr 2023 Haozhi Wang, Yinchuan Li, Qing Wang, Yunfeng Shao, Jianye Hao

We then define an adjacency space for mismatched states and design a plug-and-play module for value iteration, which enables agents to infer more precise returns.

Continuous Control Multi-agent Reinforcement Learning +1

CFlowNets: Continuous Control with Generative Flow Networks

1 code implementation4 Mar 2023 Yinchuan Li, Shuang Luo, Haozhi Wang, Jianye Hao

Generative flow networks (GFlowNets), as an emerging technique, can be used as an alternative to reinforcement learning for exploratory control tasks.

Active Learning Continuous Control +2

On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies

no code implementations21 Sep 2022 Haozhi Wang, Qing Wang, Yunfeng Shao, Dong Li, Jianye Hao, Yinchuan Li

Modern meta-reinforcement learning (Meta-RL) methods are mainly developed based on model-agnostic meta-learning, which performs policy gradient steps across tasks to maximize policy performance.

Continuous Control Meta-Learning +3

Multiple Intelligent Reflecting Surface aided Multi-user Weighted Sum-Rate Maximization using Manifold Optimization

no code implementations29 Sep 2021 Liyue Zhang, Qing Wang, Haozhi Wang

Intelligent reflecting surface (IRS) are able to amend radio propagation condition tasks on account of its functional properties in phase shift optimizing.

Cannot find the paper you are looking for? You can Submit a new open access paper.