1 code implementation • 10 Apr 2024 • Wenqian Li, Haozhi Wang, Zhe Huang, Yan Pang
Wasserstein distance is a principle measure of data divergence from a distributional standpoint.
no code implementations • 16 Jun 2023 • Xinyuan Ji, Xu Zhang, Wei Xi, Haozhi Wang, Olga Gadyatskaya, Yinchuan Li
Multi-task reinforcement learning and meta-reinforcement learning have been developed to quickly adapt to new tasks, but they tend to focus on tasks with higher rewards and more frequent occurrences, leading to poor performance on tasks with sparse rewards.
no code implementations • 12 Apr 2023 • Haozhi Wang, Yinchuan Li, Qing Wang, Yunfeng Shao, Jianye Hao
We then define an adjacency space for mismatched states and design a plug-and-play module for value iteration, which enables agents to infer more precise returns.
1 code implementation • 4 Mar 2023 • Yinchuan Li, Shuang Luo, Haozhi Wang, Jianye Hao
Generative flow networks (GFlowNets), as an emerging technique, can be used as an alternative to reinforcement learning for exploratory control tasks.
no code implementations • 21 Sep 2022 • Haozhi Wang, Qing Wang, Yunfeng Shao, Dong Li, Jianye Hao, Yinchuan Li
Modern meta-reinforcement learning (Meta-RL) methods are mainly developed based on model-agnostic meta-learning, which performs policy gradient steps across tasks to maximize policy performance.
no code implementations • 29 Sep 2021 • Liyue Zhang, Qing Wang, Haozhi Wang
Intelligent reflecting surface (IRS) are able to amend radio propagation condition tasks on account of its functional properties in phase shift optimizing.