Search Results for author: Wenhao Li

Found 36 papers, 11 papers with code

HiNet: Novel Multi-Scenario & Multi-Task Learning with Hierarchical Information Extraction

1 code implementation10 Mar 2023 Jie zhou, Xianshuai Cao, Wenhao Li, Lin Bo, Kun Zhang, Chuan Luo, Qian Yu

Multi-scenario & multi-task learning has been widely applied to many recommendation systems in industrial applications, wherein an effective and practical approach is to carry out multi-scenario transfer learning on the basis of the Mixture-of-Expert (MoE) architecture.

Multi-Task Learning Recommendation Systems

Feature Completion Transformer for Occluded Person Re-identification

no code implementations3 Mar 2023 Tao Wang, Hong Liu, Wenhao Li, Miaoju Ban, Tuanyu Guo, Yidi Li

In this paper, different from most previous works that discard the occluded region, we propose a Feature Completion Transformer (FCFormer) to implicitly complement the semantic information of occluded parts in the feature space.

Person Re-Identification

Diverse Policy Optimization for Structured Action Space

1 code implementation23 Feb 2023 Wenhao Li, Baoxiang Wang, Shanchao Yang, Hongyuan Zha

We propose a simple and effective RL method, Diverse Policy Optimization (DPO), to model the policies in structured action space as the energy-based models (EBM) by following the probabilistic RL framework.

Reinforcement Learning (RL)

HTNet: Human Topology Aware Network for 3D Human Pose Estimation

1 code implementation20 Feb 2023 Jialun Cai, Hong Liu, Runwei Ding, Wenhao Li, Jianbing Wu, Miaoju Ban

3D human pose estimation errors would propagate along the human body topology and accumulate at the end joints of limbs.

3D Human Pose Estimation

Learning Roles with Emergent Social Value Orientations

no code implementations31 Jan 2023 Wenhao Li, Xiangfeng Wang, Bo Jin, Jingyi Lu, Hongyuan Zha

Social dilemmas can be considered situations where individual rationality leads to collective irrationality.

Multi-agent Reinforcement Learning Role Embedding

Algorithmic Decision-Making Safeguarded by Human Knowledge

no code implementations20 Nov 2022 Ningyuan Chen, Ming Hu, Wenhao Li

In view of such a conflict, we provide a general analytical framework to study the augmentation of algorithmic decisions with human knowledge: the analyst uses the knowledge to set a guardrail by which the algorithmic decision is clipped if the algorithmic output is out of bound, and seems unreasonable.

Decision Making

Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention

1 code implementation14 Nov 2022 Wenhao Li, Xiaoyuan Yi, Jinyi Hu, Maosong Sun, Xing Xie

In this work, we dig into the intrinsic mechanism of this problem and found that sparser attention values in Transformer could improve diversity.

Text Generation

Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense

no code implementations10 Nov 2022 Zhecan Wang, Haoxuan You, Yicheng He, Wenhao Li, Kai-Wei Chang, Shih-Fu Chang

Visual commonsense understanding requires Vision Language (VL) models to not only understand image and text but also cross-reference in-between to fully integrate and achieve comprehension of the visual scene described.

Recurrence Boosts Diversity! Revisiting Recurrent Latent Variable in Transformer-Based Variational AutoEncoder for Diverse Text Generation

no code implementations22 Oct 2022 Jinyi Hu, Xiaoyuan Yi, Wenhao Li, Maosong Sun, Xing Xie

We demonstrate that TRACE could enhance the entanglement of each segment and preceding latent variables and deduce a non-zero lower bound of the KL term, providing a theoretical guarantee of generation diversity.

Text Generation

GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation

no code implementations13 Jun 2022 Wenhao Li, Hong Liu, Tianyu Guo, Hao Tang, Runwei Ding

To the best of our knowledge, this is the first MLP-Like architecture for 3D human pose estimation in a single frame and a video sequence.

3D Human Pose Estimation Representation Learning

StyleBERT: Chinese pretraining by font style information

no code implementations21 Feb 2022 Chao Lv, Han Zhang, Xinkai Du, Yunhao Zhang, Ying Huang, Wenhao Li, Jia Han, Shanshan Gu

With the success of down streaming task using English pre-trained language model, the pre-trained Chinese language model is also necessary to get a better performance of Chinese NLP task.

Language Modelling

Multi-Agent Path Finding with Prioritized Communication Learning

1 code implementation8 Feb 2022 Wenhao Li, Hongjun Chen, Bo Jin, Wenzhe Tan, Hongyuan Zha, Xiangfeng Wang

The learning-based, fully decentralized framework has been introduced to alleviate real-time problems and simultaneously pursue optimal planning policy.

Multi-Agent Path Finding Multi-agent Reinforcement Learning +1

VMAgent: Scheduling Simulator for Reinforcement Learning

2 code implementations9 Dec 2021 Junjie Sheng, Shengliang Cai, Haochuan Cui, Wenhao Li, Yun Hua, Bo Jin, Wenli Zhou, Yiqiu Hu, Lei Zhu, Qian Peng, Hongyuan Zha, Xiangfeng Wang

A novel simulator called VMAgent is introduced to help RL researchers better explore new methods, especially for virtual machine scheduling.

reinforcement-learning Reinforcement Learning (RL) +1

Interactive Medical Image Segmentation with Self-Adaptive Confidence Calibration

no code implementations15 Nov 2021 Wenhao Li, Qisen Xu, Chuyun Shen, Bin Hu, Fengping Zhu, Yuxin Li, Bo Jin, Xiangfeng Wang

Based on the confidential information, a self-adaptive reward function is designed to provide more detailed feedback, and a simulated label generation mechanism is proposed on unsupervised data to reduce over-reliance on labeled data.

Image Segmentation Interactive Segmentation +3

CCPM: A Chinese Classical Poetry Matching Dataset

1 code implementation3 Jun 2021 Wenhao Li, Fanchao Qi, Maosong Sun, Xiaoyuan Yi, Jiarui Zhang

We hope this dataset can further enhance the study on incorporating deep semantics into the understanding and generation system of Chinese classical poetry.

Translation

Dealing with Non-Stationarity in MARL via Trust-Region Decomposition

no code implementations ICLR 2022 Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Hongyuan Zha

In this paper, we introduce a novel notion, the $\delta$-measurement, to explicitly measure the non-stationarity of a policy sequence, which can be further proved to be bounded by the KL-divergence of consecutive joint policies.

Multi-agent Reinforcement Learning

Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning

no code implementations9 Feb 2021 Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Yun Hua, Hongyuan Zha

In order to improve the efficiency of cooperation and exploration, we propose a structured diversification emergence MARL framework named {\sc{Rochico}} based on reinforced organization control and hierarchical consensus learning.

Multi-agent Reinforcement Learning

Malicious Requests Detection with Improved Bidirectional Long Short-term Memory Neural Networks

no code implementations26 Oct 2020 Wenhao Li, Bincheng Zhang, Jiajie Zhang

Detecting and intercepting malicious requests are one of the most widely used ways against attacks in the network security.

Few-Shot Learning Metric Learning +1

Using Information to Amplify Competition

no code implementations11 Oct 2020 Wenhao Li

I characterize the consumer-optimal market segmentation in competitive markets where multiple firms selling differentiated products to consumers with unit demand.

Dimension Reduction in Contextual Online Learning via Nonparametric Variable Selection

no code implementations17 Sep 2020 Wenhao Li, Ningyuan Chen, L. Jeff Hong

Our algorithm achieves the regret $\tilde{O}(T^{(d_x^*+d_y+1)/(d_x^*+d_y+2)})$, where $d_x^*$ is the effective covariate dimension.

Dimensionality Reduction Variable Selection

F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning

no code implementations17 Apr 2020 Wenhao Li, Bo Jin, Xiangfeng Wang, Junchi Yan, Hongyuan Zha

Traditional centralized multi-agent reinforcement learning (MARL) algorithms are sometimes unpractical in complicated applications, due to non-interactivity between agents, curse of dimensionality and computation complexity.

Multi-agent Reinforcement Learning Reinforcement Learning (RL) +2

MixPoet: Diverse Poetry Generation via Learning Controllable Mixed Latent Space

no code implementations13 Mar 2020 Xiaoyuan Yi, Ruoyu Li, Cheng Yang, Wenhao Li, Maosong Sun

Though recent neural models make prominent progress in some criteria of poetry quality, generated poems still suffer from the problem of poor diversity.

HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem

no code implementations11 Feb 2020 Yun Hua, Xiangfeng Wang, Bo Jin, Wenhao Li, Junchi Yan, Xiaofeng He, Hongyuan Zha

In spite of the success of existing meta reinforcement learning methods, they still have difficulty in learning a meta policy effectively for RL problems with sparse reward.

Meta-Learning Meta Reinforcement Learning +2

Iteratively-Refined Interactive 3D Medical Image Segmentation with Multi-Agent Reinforcement Learning

no code implementations CVPR 2020 Xuan Liao, Wenhao Li, Qisen Xu, Xiangfeng Wang, Bo Jin, Xiaoyun Zhang, Ya zhang, Yan-Feng Wang

We here propose to model the dynamic process of iterative interactive image segmentation as a Markov decision process (MDP) and solve it with reinforcement learning (RL).

Image Segmentation Medical Image Segmentation +4

A Dimension-free Algorithm for Contextual Continuum-armed Bandits

no code implementations15 Jul 2019 Wenhao Li, Ningyuan Chen, L. Jeff Hong

The literature has shown that for Lipschitz-continuous functions, the optimal regret is $\tilde{O}(T^{\frac{d_x+d_y+1}{d_x+d_y+2}})$, where $d_x$ and $d_y$ are the dimensions of contexts and arms, and thus suffers from the curse of dimensionality.

Jiuge: A Human-Machine Collaborative Chinese Classical Poetry Generation System

no code implementations ACL 2019 Guo Zhipeng, Xiaoyuan Yi, Maosong Sun, Wenhao Li, Cheng Yang, Jiannan Liang, Huimin Chen, Yuhui Zhang, Ruoyu Li

By exposing the options of poetry genres, styles and revision modes, Jiuge, acting as a professional assistant, allows constant and active participation of users in poetic creation.

Stylistic Chinese Poetry Generation via Unsupervised Style Disentanglement

no code implementations EMNLP 2018 Cheng Yang, Maosong Sun, Xiaoyuan Yi, Wenhao Li

The ability to write diverse poems in different styles under the same poetic imagery is an important characteristic of human poetry writing.

Disentanglement Machine Translation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.