Search Results for author: Jiawei Xu

Found 11 papers, 4 papers with code

TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning

1 code implementation25 Nov 2020 Peng Sun, Jiechao Xiong, Lei Han, Xinghai Sun, Shuxing Li, Jiawei Xu, Meng Fang, Zhengyou Zhang

This poses non-trivial difficulties for researchers or engineers and prevents the application of MARL to a broader range of real-world problems.

Dota 2 Multi-agent Reinforcement Learning +4

Superior Performance with Diversified Strategic Control in FPS Games Using General Reinforcement Learning

no code implementations29 Sep 2021 Shuxing Li, Jiawei Xu, Chun Yuan, Peng Sun, Zhuobin Zheng, Zhengyou Zhang, Lei Han

We provide comprehensive analysis and experiments to elaborate the effect of each component in affecting the agent performance, and demonstrate that the proposed and adopted techniques are important to achieve superior performance in general end-to-end FPS games.

FPS Games General Reinforcement Learning +2

Bootstrapped Hindsight Experience replay with Counterintuitive Prioritization

no code implementations29 Sep 2021 Jiawei Xu, Shuxing Li, Chun Yuan, Zhengyou Zhang, Lei Han

In this paper, inspired by Bootstrapped DQN, we use multiple heads in DDPG and take advantage of the diversity and uncertainty among multiple heads to improve the data efficiency with relabeled goals.

Q-Learning

Infrared and Visible Image Fusion via Interactive Compensatory Attention Adversarial Learning

1 code implementation29 Mar 2022 Zhishe Wang, Wenyu Shao, Yanlin Chen, Jiawei Xu, Xiaoqin Zhang

The existing generative adversarial fusion methods generally concatenate source images and extract local features through convolution operation, without considering their global characteristics, which tends to produce an unbalanced result and is biased towards the infrared image or visible image.

Infrared And Visible Image Fusion

SeqNet: An Efficient Neural Network for Automatic Malware Detection

no code implementations8 May 2022 Jiawei Xu, Wenxuan Fu, Haoyu Bu, Zhi Wang, Lingyun Ying

We demonstrate the effectiveness of our methods and the low training cost requirement of SeqNet in our experiments.

Efficient Neural Network Feature Engineering +2

Relative Policy-Transition Optimization for Fast Policy Transfer

no code implementations13 Jun 2022 Jiawei Xu, Cheng Zhou, Yizheng Zhang, Baoxiang Wang, Lei Han

Integrating the two algorithms results in the complete Relative Policy-Transition Optimization (RPTO) algorithm, in which the policy interacts with the two environments simultaneously, such that data collections from two environments, policy and transition updates are completed in one closed loop to form a principled learning framework for policy transfer.

Continuous Control LEMMA +1

LLA-FLOW: A Lightweight Local Aggregation on Cost Volume for Optical Flow Estimation

no code implementations17 Apr 2023 Jiawei Xu, Zongqing Lu, Qingmin Liao

Lack of texture often causes ambiguity in matching, and handling this issue is an important challenge in optical flow estimation.

Optical Flow Estimation

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

2 code implementations19 Oct 2023 Rui Yang, Han Zhong, Jiawei Xu, Amy Zhang, Chongjie Zhang, Lei Han, Tong Zhang

Offline reinforcement learning (RL) presents a promising approach for learning reinforced policies from offline datasets without the need for costly or unsafe interactions with the environment.

Offline RL Q-Learning +2

HyperAgent: A Simple, Scalable, Efficient and Provable Reinforcement Learning Framework for Complex Environments

no code implementations5 Feb 2024 Yingru Li, Jiawei Xu, Lei Han, Zhi-Quan Luo

To solve complex tasks under resource constraints, reinforcement learning (RL) agents need to be simple, efficient, and scalable, addressing (1) large state spaces and (2) the continuous accumulation of interaction data.

LEMMA Reinforcement Learning (RL)

Thought Graph: Generating Thought Process for Biological Reasoning

no code implementations11 Mar 2024 Chi-Yang Hsu, Kyle Cox, Jiawei Xu, Zhen Tan, Tianhua Zhai, Mengzhou Hu, Dexter Pratt, Tianlong Chen, Ziniu Hu, Ying Ding

We present the Thought Graph as a novel framework to support complex reasoning and use gene set analysis as an example to uncover semantic relationships between biological processes.

Cannot find the paper you are looking for? You can Submit a new open access paper.