Search Results for author: Shihao Zou

Found 14 papers, 7 papers with code

Multimodal Prompt Transformer with Hybrid Contrastive Learning for Emotion Recognition in Conversation

no code implementations4 Oct 2023 Shihao Zou, Xianying Huang, Xudong Shen

MPT embeds multimodal fusion information into each attention layer of the Transformer, allowing prompt information to participate in encoding textual features and being fused with multi-level textual information to obtain better multimodal fusion features.

Contrastive Learning Emotion Recognition in Conversation

Event-based Human Pose Tracking by Spiking Spatiotemporal Transformer

1 code implementation16 Mar 2023 Shihao Zou, Yuxuan Mu, Xinxin Zuo, Sen Wang, Li Cheng

Motivated by the above mentioned issues, we present in this paper a dedicated end-to-end sparse deep learning approach for event-based pose tracking: 1) to our knowledge this is the first time that 3D human pose tracking is obtained from events only, thus eliminating the need of accessing to any frame-based images as part of input; 2) our approach is based entirely upon the framework of Spiking Neural Networks (SNNs), which consists of Spike-Element-Wise (SEW) ResNet and a novel Spiking Spatiotemporal Transformer; 3) a large-scale synthetic dataset is constructed that features a broad and diverse set of annotated 3D human motions, as well as longer hours of event stream data, named SynEventHPD.

3D Human Pose Estimation 3D Human Pose Tracking

Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and Forecasting on a Video Snippet

1 code implementation9 Jul 2022 Shihao Zou, Yuanlu Xu, Chao Li, Lingni Ma, Li Cheng, Minh Vo

In this paper, we propose Snipper, a unified framework to perform multi-person 3D pose estimation, tracking, and motion forecasting simultaneously in a single stage.

3D Pose Estimation Motion Forecasting +1

Action2video: Generating Videos of Human 3D Actions

no code implementations12 Nov 2021 Chuan Guo, Xinxin Zuo, Sen Wang, Xinshuang Liu, Shihao Zou, Minglun Gong, Li Cheng

Action2motion stochastically generates plausible 3D pose sequences of a prescribed action category, which are processed and rendered by motion2video to form 2D videos.

Human Pose and Shape Estimation from Single Polarization Images

1 code implementation15 Aug 2021 Shihao Zou, Xinxin Zuo, Sen Wang, Yiming Qian, Chuan Guo, Li Cheng

This paper focuses on a new problem of estimating human pose and shape from single polarization images.

Surface Normal Estimation

EventHPE: Event-based 3D Human Pose and Shape Estimation

1 code implementation ICCV 2021 Shihao Zou, Chuan Guo, Xinxin Zuo, Sen Wang, Pengyu Wang, Xiaoqin Hu, Shoushun Chen, Minglun Gong, Li Cheng

Event camera is an emerging imaging sensor for capturing dynamics of moving objects as events, which motivates our work in estimating 3D human pose and shape from the event signals.

3D human pose and shape estimation Optical Flow Estimation

Action2Motion: Conditioned Generation of 3D Human Motions

1 code implementation30 Jul 2020 Chuan Guo, Xinxin Zuo, Sen Wang, Shihao Zou, Qingyao Sun, Annan Deng, Minglun Gong, Li Cheng

Action recognition is a relatively established task, where givenan input sequence of human motion, the goal is to predict its ac-tion category.

Action Generation

3D Human Shape Reconstruction from a Polarization Image

no code implementations ECCV 2020 Shihao Zou, Xinxin Zuo, Yiming Qian, Sen Wang, Chi Xu, Minglun Gong, Li Cheng

Inspired by the recent advances in human shape estimation from single color images, in this paper, we attempt at estimating human body shapes by leveraging the geometric cues from single polarization images.

Polarization Human Shape and Pose Dataset

no code implementations30 Apr 2020 Shihao Zou, Xinxin Zuo, Yiming Qian, Sen Wang, Chuan Guo, Chi Xu, Minglun Gong, Li Cheng

Polarization images are known to be able to capture polarized reflected lights that preserve rich geometric cues of an object, which has motivated its recent applications in reconstructing detailed surface normal of the objects of interest.

MarlRank: Multi-agent Reinforced Learning to Rank

no code implementations15 Sep 2019 Shihao Zou, Zhonghua Li, Mohammad Akbari, Jun Wang, Peng Zhang

By defining reward as a function of NDCG, we can optimize our model directly on the ranking performance measure.

Document Ranking Learning-To-Rank

A Regularized Opponent Model with Maximum Entropy Objective

1 code implementation17 May 2019 Zheng Tian, Ying Wen, Zhichen Gong, Faiz Punakkath, Shihao Zou, Jun Wang

In a single-agent setting, reinforcement learning (RL) tasks can be cast into an inference problem by introducing a binary random variable o, which stands for the "optimality".

Multi-agent Reinforcement Learning reinforcement-learning +1

Learning to Communicate Implicitly By Actions

no code implementations10 Oct 2018 Zheng Tian, Shihao Zou, Ian Davies, Tim Warr, Lisheng Wu, Haitham Bou Ammar, Jun Wang

The auxiliary reward for communication is integrated into the learning of the policy module.

Cannot find the paper you are looking for? You can Submit a new open access paper.