Search Results for author: Chao Yu

Found 21 papers, 5 papers with code

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning

no code implementations15 Jun 2022 Wei Fu, Chao Yu, Zelai Xu, Jiaqi Yang, Yi Wu

Despite all the advantages, we revisit these two principles and show that in certain scenarios, e. g., environments with a highly multi-modal reward landscape, value decomposition, and parameter sharing can be problematic and lead to undesired outcomes.

Multi-agent Reinforcement Learning reinforcement-learning +1

ESCM$^2$: Entire Space Counterfactual Multi-Task Model for Post-Click Conversion Rate Estimation

1 code implementation3 Apr 2022 Hao Wang, Tai-Wei Chang, Tianqiao Liu, Jianmin Huang, Zhichao Chen, Chao Yu, Ruopeng Li, Wei Chu

In this paper, we theoretically demonstrate that ESMM suffers from the following two problems: (1) Inherent Estimation Bias (IEB), where the estimated CVR of ESMM is inherently higher than the ground truth; (2) Potential Independence Priority (PIP) for CTCVR estimation, where there is a risk that the ESMM overlooks the causality from click to conversion.

Recommendation Systems Selection bias

Constrained Sequence-to-Tree Generation for Hierarchical Text Classification

no code implementations2 Apr 2022 Chao Yu, Yi Shen, Yue Mao, Longjun Cai

Hierarchical Text Classification (HTC) is a challenging task where a document can be assigned to multiple hierarchically structured categories within a taxonomy.

Benchmark Classification +2

Passive Motion Detection via mmWave Communication System

no code implementations28 Mar 2022 Jie Li, Chao Yu, Yan Luo, Yifei Sun, Rui Wang

Relying on the passive sensing system, a dataset of received signals, where three types of hand gestures are sensed, is collected by using Line-of-Sight (LoS) and Non-Line-of-Sight (NLoS) paths as the reference channel respectively.

Hand Gesture Recognition Hand-Gesture Recognition +1

Creativity of AI: Automatic Symbolic Option Discovery for Facilitating Deep Reinforcement Learning

no code implementations18 Dec 2021 Mu Jin, Zhihao Ma, Kebing Jin, Hankz Hankui Zhuo, Chen Chen, Chao Yu

Despite of achieving great success in real life, Deep Reinforcement Learning (DRL) is still suffering from three critical issues, which are data efficiency, lack of the interpretability and transferability.

Montezuma's Revenge reinforcement-learning

Multi-Agent Vulnerability Discovery for Autonomous Driving with Hazard Arbitration Reward

no code implementations12 Dec 2021 Weilin Liu, Ye Mu, Chao Yu, Xuefei Ning, Zhong Cao, Yi Wu, Shuang Liang, Huazhong Yang, Yu Wang

These scenarios indeed correspond to the vulnerabilities of the under-test driving policies, thus are meaningful for their further improvements.

Autonomous Driving Multi-agent Reinforcement Learning

Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines

no code implementations18 Nov 2021 Xuejing Zheng, Chao Yu, Chen Chen, Jianye Hao, Hankz Hankui Zhuo

In this paper, we propose Lifelong reinforcement learning with Sequential linear temporal logic formulas and Reward Machines (LSRM), which enables an agent to leverage previously learned knowledge to fasten learning of logically specified tasks.

reinforcement-learning Transfer Learning

Coordinated Proximal Policy Optimization

1 code implementation NeurIPS 2021 Zifan Wu, Chao Yu, Deheng Ye, Junge Zhang, Haiyin Piao, Hankz Hankui Zhuo

We present Coordinated Proximal Policy Optimization (CoPPO), an algorithm that extends the original Proximal Policy Optimization (PPO) to the multi-agent setting.

Starcraft Starcraft II

Learning Efficient Multi-Agent Cooperative Visual Exploration

no code implementations12 Oct 2021 Chao Yu, Xinyi Yang, Jiaxuan Gao, Huazhong Yang, Yu Wang, Yi Wu

In this paper, we extend the state-of-the-art single-agent visual navigation method, Active Neural SLAM (ANS), to the multi-agent setting by introducing a novel RL-based planning module, Multi-agent Spatial Planner (MSP). MSP leverages a transformer-based architecture, Spatial-TeamFormer, which effectively captures spatial relations and intra-agent interactions via hierarchical spatial self-attentions.

Visual Navigation

Reinforcement Learning with Expert Trajectory For Quantitative Trading

no code implementations9 May 2021 Sihang Chen, Weiqi Luo, Chao Yu

In recent years, quantitative investment methods combined with artificial intelligence have attracted more and more attention from investors and researchers.

Q-Learning reinforcement-learning

Single-photon imaging over 200 km

no code implementations10 Mar 2021 Zheng-Ping Li, Jun-Tian Ye, Xin Huang, Peng-Yu Jiang, Yuan Cao, Yu Hong, Chao Yu, Jun Zhang, Qiang Zhang, Cheng-Zhi Peng, Feihu Xu, Jian-Wei Pan

Long-range active imaging has widespread applications in remote sensing and target recognition.

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

2 code implementations ICLR 2021 Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Du, Yu Wang, Yi Wu

We propose a simple, general and effective technique, Reward Randomization for discovering diverse strategic policies in complex multi-agent games.

The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games

6 code implementations2 Mar 2021 Chao Yu, Akash Velu, Eugene Vinitsky, Yu Wang, Alexandre Bayen, Yi Wu

Proximal Policy Optimization (PPO) is a popular on-policy reinforcement learning algorithm but is significantly less utilized than off-policy learning algorithms in multi-agent settings.

Starcraft Starcraft II

A Joint Training Dual-MRC Framework for Aspect Based Sentiment Analysis

no code implementations4 Jan 2021 Yue Mao, Yi Shen, Chao Yu, Longjun Cai

Some recent work focused on solving a combination of two subtasks, e. g., extracting aspect terms along with sentiment polarities or extracting the aspect and opinion terms pair-wisely.

Aspect-oriented Opinion Extraction Aspect Sentiment Triplet Extraction +3

Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms

no code implementations1 Jan 2021 Chao Yu, Akash Velu, Eugene Vinitsky, Yu Wang, Alexandre Bayen, Yi Wu

We benchmark commonly used multi-agent deep reinforcement learning (MARL) algorithms on a variety of cooperative multi-agent games.

reinforcement-learning Starcraft

Deep Learning-based Modulation Detection for NOMA Systems

no code implementations24 May 2020 Wenwu Xie, Jian Xiao, Jinxia Yang, Xin Peng, Chao Yu, Peng Zhu

Since the signal with strong power should be demodulated first for successive interference cancellation (SIC) demodulation in non-orthogonal multiple access (NOMA) systems, the base station (BS) should inform the near user terminal (UT), which has allocated higher power, of modulation mode of the far user terminal.


Symmetrical Gaussian Error Linear Units (SGELUs)

no code implementations10 Nov 2019 Chao Yu, Zhiguo Su

In this paper, a novel neural network activation function, called Symmetrical Gaussian Error Linear Unit (SGELU), is proposed to obtain high performance.

Reinforcement Learning in Healthcare: A Survey

no code implementations22 Aug 2019 Chao Yu, Jiming Liu, Shamim Nemati

As a subfield of machine learning, reinforcement learning (RL) aims at empowering one's capabilities in behavioural decision making by using interaction experience with the world and an evaluative feedback.

Decision Making Medical Diagnosis +1

Learning Shaping Strategies in Human-in-the-loop Interactive Reinforcement Learning

no code implementations10 Nov 2018 Chao Yu, Tianpei Yang, Wenxuan Zhu, Dongxu Wang, Guangliang Li

Providing reinforcement learning agents with informationally rich human knowledge can dramatically improve various aspects of learning.


The Price of Governance: A Middle Ground Solution to Coordination in Organizational Control

no code implementations9 Nov 2018 Chao Yu

We then propose a hierarchical supervision framework to explicitly model the PoG, and define step by step how to realize the core principle of the framework and compute the optimal PoG for a control problem.

DS-SLAM: A Semantic Visual SLAM towards Dynamic Environments

2 code implementations22 Sep 2018 Chao Yu, Zuxin Liu, Xinjun Liu, Fugui Xie, Yi Yang, Qi Wei, Qiao Fei

It is one of the state-of-the-art SLAM systems in high-dynamic environments.


Cannot find the paper you are looking for? You can Submit a new open access paper.