1 code implementation • 22 Dec 2021 • Rui Zhao, Jinming Song, Yufeng Yuan, Hu Haifeng, Yang Gao, Yi Wu, Zhongqian Sun, Yang Wei
We study the problem of training a Reinforcement Learning (RL) agent that is collaborative with humans without using any human data.
1 code implementation • 12 Oct 2021 • Xiaohui Wang, Yang Wei, Ying Xiong, Guyue Huang, Xian Qian, Yufei Ding, Mingxuan Wang, Lei LI
In this paper, we present LightSeq2, a system to accelerate training for a general family of Transformer models on GPUs.
no code implementations • ICLR 2022 • Haobo Fu, Weiming Liu, Shuang Wu, Yijia Wang, Tao Yang, Kai Li, Junliang Xing, Bin Li, Bo Ma, Qiang Fu, Yang Wei
The deep policy gradient method has demonstrated promising results in many large-scale games, where the agent learns purely from its own experience.
no code implementations • 29 Sep 2021 • Jingwei Yang, Qingchun Hou, Xiaoqing Wang, Yang Wei, Yuming Deng, Hongyang Jia, Ning Zhang
Such a problem is computationally hard to address by exact mathematical programming methods.
no code implementations • 8 Feb 2021 • Yang Wei, Yuanbin Wu, Man Lan
We propose a novel in-order chart-based model for constituent parsing.
1 code implementation • NAACL 2021 • Xiaohui Wang, Ying Xiong, Yang Wei, Mingxuan Wang, Lei LI
Transformer, BERT and their variants have achieved great success in natural language processing.
1 code implementation • 2 Sep 2020 • Jiuniu Wang, Wenjia Xu, Xingyu Fu, Yang Wei, Li Jin, Ziyan Chen, Guangluan Xu, Yirong Wu
This model enhances the question answering system in the multi-document scenario from three aspects: model structure, optimization goal, and training method, corresponding to Multilayer Attention (MA), Cross Evidence (CE), and Adversarial Training (AT) respectively.
1 code implementation • ACL 2020 • Yang Wei, Yuanbin Wu, Man Lan
We propose a novel linearization of a constituent tree, together with a new locally normalized model.
1 code implementation • cvpr 2019 workshop 2019 • Xiuli Bi, Yang Wei, Bin Xiao, Weisheng Li
The core idea of the RRU-Net is to strengthen the learning way of CNN, which is inspired by the recall and the consolidation mechanism of the human brain and implemented by the propagation and the feedback process of the residual in CNN.
1 code implementation • 4 Apr 2019 • Xin Chen, Anqi Pang, Yang Wei, Lan Xui, Jingyi Yu
In this paper, we present TightCap, a data-driven scheme to capture both the human shape and dressed garments accurately with only a single 3D human scan, which enables numerous applications such as virtual try-on, biometrics and body evaluation.
no code implementations • 3 Sep 2018 • Jiuniu Wang, Xingyu Fu, Guangluan Xu, Yirong Wu, Ziyan Chen, Yang Wei, Li Jin
Meanwhile, we construct A3Net for the WebQA dataset.
no code implementations • 9 Feb 2018 • Shuo Chen, Chen Gong, Jian Yang, Xiang Li, Yang Wei, Jun Li
In distinguishment stage, a metric is exhaustively learned to try its best to distinguish both the adversarial pairs and the original training pairs.