Search Results for author: Haozhi Wang

Found 6 papers, 2 papers with code

Private Wasserstein Distance with Random Noises

1 code implementation • 10 Apr 2024 • Wenqian Li, Haozhi Wang, Zhe Huang, Yan Pang

Wasserstein distance is a principle measure of data divergence from a distributional standpoint.

Paper
Code

Meta Generative Flow Networks with Personalization for Task-Specific Adaptation

no code implementations • 16 Jun 2023 • Xinyuan Ji, Xu Zhang, Wei Xi, Haozhi Wang, Olga Gadyatskaya, Yinchuan Li

Multi-task reinforcement learning and meta-reinforcement learning have been developed to quickly adapt to new tasks, but they tend to focus on tasks with higher rewards and more frequent occurrences, leading to poor performance on tasks with sparse rewards.

Meta-Learning Meta Reinforcement Learning +1

Paper
Add Code

Multi-agent Policy Reciprocity with Theoretical Guarantee

no code implementations • 12 Apr 2023 • Haozhi Wang, Yinchuan Li, Qing Wang, Yunfeng Shao, Jianye Hao

We then define an adjacency space for mismatched states and design a plug-and-play module for value iteration, which enables agents to infer more precise returns.

Continuous Control Multi-agent Reinforcement Learning +1

Paper
Add Code

CFlowNets: Continuous Control with Generative Flow Networks

1 code implementation • 4 Mar 2023 • Yinchuan Li, Shuang Luo, Haozhi Wang, Jianye Hao

Generative flow networks (GFlowNets), as an emerging technique, can be used as an alternative to reinforcement learning for exploratory control tasks.

Active Learning Continuous Control +2

Paper
Code

On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies

no code implementations • 21 Sep 2022 • Haozhi Wang, Qing Wang, Yunfeng Shao, Dong Li, Jianye Hao, Yinchuan Li

Modern meta-reinforcement learning (Meta-RL) methods are mainly developed based on model-agnostic meta-learning, which performs policy gradient steps across tasks to maximize policy performance.

Continuous Control Meta-Learning +3

Paper
Add Code

Multiple Intelligent Reflecting Surface aided Multi-user Weighted Sum-Rate Maximization using Manifold Optimization

no code implementations • 29 Sep 2021 • Liyue Zhang, Qing Wang, Haozhi Wang

Intelligent reflecting surface (IRS) are able to amend radio propagation condition tasks on account of its functional properties in phase shift optimizing.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.