Search Results for author: Wenhao Li

Found 60 papers, 21 papers with code

InterFusion: Text-Driven Generation of 3D Human-Object Interaction

no code implementations22 Mar 2024 Sisi Dai, Wenhao Li, Haowen Sun, Haibin Huang, Chongyang Ma, Hui Huang, Kai Xu, Ruizhen Hu

In this study, we tackle the complex task of generating 3D human-object interactions (HOI) from textual descriptions in a zero-shot text-to-3D manner.

3D Generation Human-Object Interaction Detection +2

Learning Dual-arm Object Rearrangement for Cartesian Robots

no code implementations21 Feb 2024 Shishun Zhang, Qijin She, Wenhao Li, Chenyang Zhu, Yongjun Wang, Ruizhen Hu, Kai Xu

To achieve the goal, the core idea is to develop an effective object-to-arm task assignment strategy for minimizing the cumulative task execution time and maximizing the dual-arm cooperation efficiency.

Computational Efficiency Object +1

Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation

no code implementations4 Feb 2024 Ti Wang, Mengyuan Liu, Hong Liu, Bin Ren, Yingxuan You, Wenhao Li, Nicu Sebe, Xia Li

We observe that previous optimization-based methods commonly rely on projection constraint, which only ensures alignment in 2D space, potentially leading to the overfitting problem.

3D Human Pose Estimation

Complementary Information Mutual Learning for Multimodality Medical Image Segmentation

no code implementations5 Jan 2024 Chuyun Shen, Wenhao Li, Haoqing Chen, Xiaoling Wang, Fengping Zhu, Yuxin Li, Xiangfeng Wang, Bo Jin

CIML adopts the idea of addition and removes inter-modal redundant information through inductive bias-driven task decomposition and message passing-based redundancy filtering.

Image Segmentation Inductive Bias +4

Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym

1 code implementation6 Dec 2023 Junjie Sheng, Zixiao Huang, Chuyun Shen, Wenhao Li, Yun Hua, Bo Jin, Hongyuan Zha, Xiangfeng Wang

The formidable capacity for zero- or few-shot decision-making in language agents encourages us to pose a compelling question: Can language agents be alternatives to PPO agents in traditional sequential decision-making tasks?

Benchmarking Decision Making +1

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation

1 code implementation20 Nov 2023 Wenhao Li, Mengyuan Liu, Hong Liu, Pichao Wang, Jialun Cai, Nicu Sebe

Transformers have been successfully applied in the field of video-based 3D human pose estimation.

3D Human Pose Estimation

Efficient Planning with Latent Diffusion

no code implementations30 Sep 2023 Wenhao Li

We establish the theoretical equivalence between planning in the latent action space and energy-guided sampling with a pretrained diffusion model and incorporate a novel sequence-level exact sampling method.

Representation Learning

Co-Evolution of Pose and Mesh for 3D Human Body Estimation from Video

1 code implementation ICCV 2023 Yingxuan You, Hong Liu, Ti Wang, Wenhao Li, Runwei Ding, Xia Li

Despite significant progress in single image-based 3D human mesh recovery, accurately and smoothly recovering 3D human motion from a video remains challenging.

3D Human Pose Estimation Decoder +1

FastLLVE: Real-Time Low-Light Video Enhancement with Intensity-Aware Lookup Table

1 code implementation13 Aug 2023 Wenhao Li, Guangyang Wu, Wenyi Wang, Peiran Ren, Xiaohong Liu

Experimental results on benchmark datasets demonstrate that our method achieves the State-Of-The-Art (SOTA) performance in terms of both image quality and inter-frame brightness consistency.

Video Enhancement

Joint Adversarial and Collaborative Learning for Self-Supervised Action Recognition

1 code implementation15 Jul 2023 Tianyu Guo, Mengyuan Liu, Hong Liu, Wenhao Li, Jingwen Guo, Tao Wang, Yidi Li

Considering the instance-level discriminative ability, contrastive learning methods, including MoCo and SimCLR, have been adapted from the original image representation learning task to solve the self-supervised skeleton-based action recognition task.

Contrastive Learning Ensemble Learning +4

Allocating Divisible Resources on Arms with Unknown and Random Rewards

no code implementations28 Jun 2023 Ningyuan Chen, Wenhao Li

We consider a decision maker allocating one unit of renewable and divisible resource in each period on a number of arms.

MCPI: Integrating Multimodal Data for Enhanced Prediction of Compound Protein Interactions

no code implementations15 Jun 2023 Li Zhang, Wenhao Li, Haotian Guan, Zhiquan He, Mingjun Cheng, Han Wang

The identification of compound-protein interactions (CPI) plays a critical role in drug screening, drug repurposing, and combination therapy studies.

Negotiated Reasoning: On Provably Addressing Relative Over-Generalization

no code implementations8 Jun 2023 Junjie Sheng, Wenhao Li, Bo Jin, Hongyuan Zha, Jun Wang, Xiangfeng Wang

Recent methods have shown that assigning reasoning ability to agents can mitigate RO algorithmically and empirically, but there has been a lack of theoretical understanding of RO, let alone designing provably RO-free methods.

Multi-agent Reinforcement Learning

LLMs and the Abstraction and Reasoning Corpus: Successes, Failures, and the Importance of Object-based Representations

1 code implementation26 May 2023 Yudong Xu, Wenhao Li, Pashootan Vaezipoor, Scott Sanner, Elias B. Khalil

Although the state-of-the-art GPT-4 is unable to "reason" perfectly within non-language domains such as the 1D-ARC or a simple ARC subset, our study reveals that the use of object-based representations can significantly improve its reasoning ability.

Language Modelling Large Language Model

Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots

no code implementations19 May 2023 Jinyi Hu, Xu Han, Xiaoyuan Yi, Yutong Chen, Wenhao Li, Zhiyuan Liu, Maosong Sun

IAP optimizes only a separate Chinese text encoder with all other parameters fixed to align Chinese semantics space to the English one in CLIP.

Cross-Lingual Transfer Image Generation

Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning

no code implementations18 May 2023 Wenhao Li, Dan Qiao, Baoxiang Wang, Xiangfeng Wang, Bo Jin, Hongyuan Zha

The difficulty of appropriately assigning credit is particularly heightened in cooperative MARL with sparse reward, due to the concurrent time and structural scales involved.

Decision Making Multi-agent Reinforcement Learning +2

Information Design in Multi-Agent Reinforcement Learning

1 code implementation NeurIPS 2023 Yue Lin, Wenhao Li, Hongyuan Zha, Baoxiang Wang

To thrive in those environments, the agent needs to influence other agents so their actions become more helpful and less harmful.

Multi-agent Reinforcement Learning reinforcement-learning +1

Deep Learning for Solving and Estimating Dynamic Macro-Finance Models

no code implementations5 May 2023 Benjamin Fan, Edward Qiao, Anran Jiao, Zhouzhou Gu, Wenhao Li, Lu Lu

We develop a methodology that utilizes deep learning to simultaneously solve and estimate canonical continuous-time general equilibrium models in financial economics.

Interweaved Graph and Attention Network for 3D Human Pose Estimation

1 code implementation27 Apr 2023 Ti Wang, Hong Liu, Runwei Ding, Wenhao Li, Yingxuan You, Xia Li

Despite substantial progress in 3D human pose estimation from a single-view image, prior works rarely explore global and local correlations, leading to insufficient learning of human skeleton representations.

3D Human Pose Estimation

HiNet: Novel Multi-Scenario & Multi-Task Learning with Hierarchical Information Extraction

1 code implementation10 Mar 2023 Jie zhou, Xianshuai Cao, Wenhao Li, Lin Bo, Kun Zhang, Chuan Luo, Qian Yu

Multi-scenario & multi-task learning has been widely applied to many recommendation systems in industrial applications, wherein an effective and practical approach is to carry out multi-scenario transfer learning on the basis of the Mixture-of-Expert (MoE) architecture.

Multi-Task Learning Recommendation Systems

Feature Completion Transformer for Occluded Person Re-identification

no code implementations3 Mar 2023 Tao Wang, Mengyuan Liu, Hong Liu, Wenhao Li, Miaoju Ban, Tuanyu Guo, Yidi Li

In this paper, different from most previous works that discard the occluded region, we propose a Feature Completion Transformer (FCFormer) to implicitly complement the semantic information of occluded parts in the feature space.

Person Re-Identification

Diverse Policy Optimization for Structured Action Space

1 code implementation23 Feb 2023 Wenhao Li, Baoxiang Wang, Shanchao Yang, Hongyuan Zha

We propose a simple and effective RL method, Diverse Policy Optimization (DPO), to model the policies in structured action space as the energy-based models (EBM) by following the probabilistic RL framework.

Reinforcement Learning (RL)

HTNet: Human Topology Aware Network for 3D Human Pose Estimation

1 code implementation20 Feb 2023 Jialun Cai, Hong Liu, Runwei Ding, Wenhao Li, Jianbing Wu, Miaoju Ban

3D human pose estimation errors would propagate along the human body topology and accumulate at the end joints of limbs.

3D Human Pose Estimation

Learning Roles with Emergent Social Value Orientations

no code implementations31 Jan 2023 Wenhao Li, Xiangfeng Wang, Bo Jin, Jingyi Lu, Hongyuan Zha

Social dilemmas can be considered situations where individual rationality leads to collective irrationality.

Multi-agent Reinforcement Learning Role Embedding

Algorithmic Decision-Making Safeguarded by Human Knowledge

no code implementations20 Nov 2022 Ningyuan Chen, Ming Hu, Wenhao Li

In view of such a conflict, we provide a general analytical framework to study the augmentation of algorithmic decisions with human knowledge: the analyst uses the knowledge to set a guardrail by which the algorithmic decision is clipped if the algorithmic output is out of bound, and seems unreasonable.

Decision Making

Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention

1 code implementation14 Nov 2022 Wenhao Li, Xiaoyuan Yi, Jinyi Hu, Maosong Sun, Xing Xie

In this work, we dig into the intrinsic mechanism of this problem and found that sparser attention values in Transformer could improve diversity.

Attribute Text Generation

Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense

no code implementations10 Nov 2022 Zhecan Wang, Haoxuan You, Yicheng He, Wenhao Li, Kai-Wei Chang, Shih-Fu Chang

Visual commonsense understanding requires Vision Language (VL) models to not only understand image and text but also cross-reference in-between to fully integrate and achieve comprehension of the visual scene described.

Recurrence Boosts Diversity! Revisiting Recurrent Latent Variable in Transformer-Based Variational AutoEncoder for Diverse Text Generation

no code implementations22 Oct 2022 Jinyi Hu, Xiaoyuan Yi, Wenhao Li, Maosong Sun, Xing Xie

We demonstrate that TRACE could enhance the entanglement of each segment and preceding latent variables and deduce a non-zero lower bound of the KL term, providing a theoretical guarantee of generation diversity.

Text Generation

GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation

1 code implementation13 Jun 2022 Wenhao Li, Hong Liu, Tianyu Guo, Runwei Ding, Hao Tang

To the best of our knowledge, this is the first MLP-Like architecture for 3D human pose estimation in a single frame and a video sequence.

3D Human Pose Estimation Representation Learning

StyleBERT: Chinese pretraining by font style information

no code implementations21 Feb 2022 Chao Lv, Han Zhang, Xinkai Du, Yunhao Zhang, Ying Huang, Wenhao Li, Jia Han, Shanshan Gu

With the success of down streaming task using English pre-trained language model, the pre-trained Chinese language model is also necessary to get a better performance of Chinese NLP task.

Language Modelling

Multi-Agent Path Finding with Prioritized Communication Learning

1 code implementation8 Feb 2022 Wenhao Li, Hongjun Chen, Bo Jin, Wenzhe Tan, Hongyuan Zha, Xiangfeng Wang

The learning-based, fully decentralized framework has been introduced to alleviate real-time problems and simultaneously pursue optimal planning policy.

Multi-Agent Path Finding Multi-agent Reinforcement Learning +1

VMAgent: Scheduling Simulator for Reinforcement Learning

2 code implementations9 Dec 2021 Junjie Sheng, Shengliang Cai, Haochuan Cui, Wenhao Li, Yun Hua, Bo Jin, Wenli Zhou, Yiqiu Hu, Lei Zhu, Qian Peng, Hongyuan Zha, Xiangfeng Wang

A novel simulator called VMAgent is introduced to help RL researchers better explore new methods, especially for virtual machine scheduling.

Cloud Computing reinforcement-learning +2

Interactive Medical Image Segmentation with Self-Adaptive Confidence Calibration

no code implementations15 Nov 2021 Wenhao Li, Qisen Xu, Chuyun Shen, Bin Hu, Fengping Zhu, Yuxin Li, Bo Jin, Xiangfeng Wang

Based on the confidential information, a self-adaptive reward function is designed to provide more detailed feedback, and a simulated label generation mechanism is proposed on unsupervised data to reduce over-reliance on labeled data.

Image Segmentation Interactive Segmentation +4

CCPM: A Chinese Classical Poetry Matching Dataset

1 code implementation3 Jun 2021 Wenhao Li, Fanchao Qi, Maosong Sun, Xiaoyuan Yi, Jiarui Zhang

We hope this dataset can further enhance the study on incorporating deep semantics into the understanding and generation system of Chinese classical poetry.


Dealing with Non-Stationarity in MARL via Trust-Region Decomposition

no code implementations ICLR 2022 Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Hongyuan Zha

In this paper, we introduce a novel notion, the $\delta$-measurement, to explicitly measure the non-stationarity of a policy sequence, which can be further proved to be bounded by the KL-divergence of consecutive joint policies.

Multi-agent Reinforcement Learning

Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning

no code implementations9 Feb 2021 Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Yun Hua, Hongyuan Zha

In order to improve the efficiency of cooperation and exploration, we propose a structured diversification emergence MARL framework named {\sc{Rochico}} based on reinforced organization control and hierarchical consensus learning.

Multi-agent Reinforcement Learning

Malicious Requests Detection with Improved Bidirectional Long Short-term Memory Neural Networks

no code implementations26 Oct 2020 Wenhao Li, Bincheng Zhang, Jiajie Zhang

Detecting and intercepting malicious requests are one of the most widely used ways against attacks in the network security.

Few-Shot Learning Metric Learning +1

Using Information to Amplify Competition

no code implementations11 Oct 2020 Wenhao Li

I characterize the consumer-optimal market segmentation in competitive markets where multiple firms selling differentiated products to consumers with unit demand.


Dimension Reduction in Contextual Online Learning via Nonparametric Variable Selection

no code implementations17 Sep 2020 Wenhao Li, Ningyuan Chen, L. Jeff Hong

Our algorithm achieves the regret $\tilde{O}(T^{(d_x^*+d_y+1)/(d_x^*+d_y+2)})$, where $d_x^*$ is the effective covariate dimension.

Dimensionality Reduction Variable Selection

F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning

no code implementations17 Apr 2020 Wenhao Li, Bo Jin, Xiangfeng Wang, Junchi Yan, Hongyuan Zha

Traditional centralized multi-agent reinforcement learning (MARL) algorithms are sometimes unpractical in complicated applications, due to non-interactivity between agents, curse of dimensionality and computation complexity.

Multi-agent Reinforcement Learning Reinforcement Learning (RL) +2

MixPoet: Diverse Poetry Generation via Learning Controllable Mixed Latent Space

no code implementations13 Mar 2020 Xiaoyuan Yi, Ruoyu Li, Cheng Yang, Wenhao Li, Maosong Sun

Though recent neural models make prominent progress in some criteria of poetry quality, generated poems still suffer from the problem of poor diversity.

HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem

no code implementations11 Feb 2020 Yun Hua, Xiangfeng Wang, Bo Jin, Wenhao Li, Junchi Yan, Xiaofeng He, Hongyuan Zha

In spite of the success of existing meta reinforcement learning methods, they still have difficulty in learning a meta policy effectively for RL problems with sparse reward.

Meta-Learning Meta Reinforcement Learning +2

Iteratively-Refined Interactive 3D Medical Image Segmentation with Multi-Agent Reinforcement Learning

no code implementations CVPR 2020 Xuan Liao, Wenhao Li, Qisen Xu, Xiangfeng Wang, Bo Jin, Xiaoyun Zhang, Ya zhang, Yan-Feng Wang

We here propose to model the dynamic process of iterative interactive image segmentation as a Markov decision process (MDP) and solve it with reinforcement learning (RL).

Image Segmentation Medical Image Segmentation +5

A Dimension-free Algorithm for Contextual Continuum-armed Bandits

no code implementations15 Jul 2019 Wenhao Li, Ningyuan Chen, L. Jeff Hong

The literature has shown that for Lipschitz-continuous functions, the optimal regret is $\tilde{O}(T^{\frac{d_x+d_y+1}{d_x+d_y+2}})$, where $d_x$ and $d_y$ are the dimensions of contexts and arms, and thus suffers from the curse of dimensionality.

Jiuge: A Human-Machine Collaborative Chinese Classical Poetry Generation System

no code implementations ACL 2019 Guo Zhipeng, Xiaoyuan Yi, Maosong Sun, Wenhao Li, Cheng Yang, Jiannan Liang, Huimin Chen, Yuhui Zhang, Ruoyu Li

By exposing the options of poetry genres, styles and revision modes, Jiuge, acting as a professional assistant, allows constant and active participation of users in poetic creation.

Cultural Vocal Bursts Intensity Prediction

Stylistic Chinese Poetry Generation via Unsupervised Style Disentanglement

no code implementations EMNLP 2018 Cheng Yang, Maosong Sun, Xiaoyuan Yi, Wenhao Li

The ability to write diverse poems in different styles under the same poetic imagery is an important characteristic of human poetry writing.

Disentanglement Machine Translation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.