Search Results for author: Shihao Zou

Found 14 papers, 7 papers with code

Tri-Modal Motion Retrieval by Learning a Joint Embedding Space

no code implementations • 1 Mar 2024 • Kangning Yin, Shihao Zou, Yuxuan Ge, Zheng Tian

Information retrieval is an ever-evolving and crucial research domain.

Cross-Modal Retrieval Information Retrieval +1

Paper
Add Code

Multimodal Prompt Transformer with Hybrid Contrastive Learning for Emotion Recognition in Conversation

no code implementations • 4 Oct 2023 • Shihao Zou, Xianying Huang, Xudong Shen

MPT embeds multimodal fusion information into each attention layer of the Transformer, allowing prompt information to participate in encoding textual features and being fused with multi-level textual information to obtain better multimodal fusion features.

Ranked #2 on Emotion Recognition in Conversation on IEMOCAP

Contrastive Learning Emotion Recognition in Conversation

Paper
Add Code

Event-based Human Pose Tracking by Spiking Spatiotemporal Transformer

1 code implementation • 16 Mar 2023 • Shihao Zou, Yuxuan Mu, Xinxin Zuo, Sen Wang, Li Cheng

Motivated by the above mentioned issues, we present in this paper a dedicated end-to-end sparse deep learning approach for event-based pose tracking: 1) to our knowledge this is the first time that 3D human pose tracking is obtained from events only, thus eliminating the need of accessing to any frame-based images as part of input; 2) our approach is based entirely upon the framework of Spiking Neural Networks (SNNs), which consists of Spike-Element-Wise (SEW) ResNet and a novel Spiking Spatiotemporal Transformer; 3) a large-scale synthetic dataset is constructed that features a broad and diverse set of annotated 3D human motions, as well as longer hours of event stream data, named SynEventHPD.

3D Human Pose Estimation 3D Human Pose Tracking

Paper
Code

Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and Forecasting on a Video Snippet

1 code implementation • 9 Jul 2022 • Shihao Zou, Yuanlu Xu, Chao Li, Lingni Ma, Li Cheng, Minh Vo

In this paper, we propose Snipper, a unified framework to perform multi-person 3D pose estimation, tracking, and motion forecasting simultaneously in a single stage.

3D Pose Estimation Motion Forecasting +1

Paper
Code

Generating Diverse and Natural 3D Human Motions From Text

1 code implementation • CVPR 2022 • Chuan Guo, Shihao Zou, Xinxin Zuo, Sen Wang, Wei Ji, Xingyu Li, Li Cheng

Automated generation of 3D human motions from text is a challenging problem.

Ranked #6 on Motion Synthesis on InterHuman

Motion Synthesis

394

Paper
Code

Action2video: Generating Videos of Human 3D Actions

no code implementations • 12 Nov 2021 • Chuan Guo, Xinxin Zuo, Sen Wang, Xinshuang Liu, Shihao Zou, Minglun Gong, Li Cheng

Action2motion stochastically generates plausible 3D pose sequences of a prescribed action category, which are processed and rendered by motion2video to form 2D videos.

Paper
Add Code

Human Pose and Shape Estimation from Single Polarization Images

1 code implementation • 15 Aug 2021 • Shihao Zou, Xinxin Zuo, Sen Wang, Yiming Qian, Chuan Guo, Li Cheng

This paper focuses on a new problem of estimating human pose and shape from single polarization images.

Surface Normal Estimation

Paper
Code

EventHPE: Event-based 3D Human Pose and Shape Estimation

1 code implementation • ICCV 2021 • Shihao Zou, Chuan Guo, Xinxin Zuo, Sen Wang, Pengyu Wang, Xiaoqin Hu, Shoushun Chen, Minglun Gong, Li Cheng

Event camera is an emerging imaging sensor for capturing dynamics of moving objects as events, which motivates our work in estimating 3D human pose and shape from the event signals.

3D human pose and shape estimation Optical Flow Estimation

Paper
Code

Action2Motion: Conditioned Generation of 3D Human Motions

1 code implementation • 30 Jul 2020 • Chuan Guo, Xinxin Zuo, Sen Wang, Shihao Zou, Qingyao Sun, Annan Deng, Minglun Gong, Li Cheng

Action recognition is a relatively established task, where givenan input sequence of human motion, the goal is to predict its ac-tion category.

Action Generation

145

Paper
Code

3D Human Shape Reconstruction from a Polarization Image

no code implementations • ECCV 2020 • Shihao Zou, Xinxin Zuo, Yiming Qian, Sen Wang, Chi Xu, Minglun Gong, Li Cheng

Inspired by the recent advances in human shape estimation from single color images, in this paper, we attempt at estimating human body shapes by leveraging the geometric cues from single polarization images.

Paper
Add Code

Polarization Human Shape and Pose Dataset

no code implementations • 30 Apr 2020 • Shihao Zou, Xinxin Zuo, Yiming Qian, Sen Wang, Chuan Guo, Chi Xu, Minglun Gong, Li Cheng

Polarization images are known to be able to capture polarized reflected lights that preserve rich geometric cues of an object, which has motivated its recent applications in reconstructing detailed surface normal of the objects of interest.

Paper
Add Code

MarlRank: Multi-agent Reinforced Learning to Rank

no code implementations • 15 Sep 2019 • Shihao Zou, Zhonghua Li, Mohammad Akbari, Jun Wang, Peng Zhang

By defining reward as a function of NDCG, we can optimize our model directly on the ranking performance measure.

Document Ranking Learning-To-Rank

Paper
Add Code

A Regularized Opponent Model with Maximum Entropy Objective

1 code implementation • 17 May 2019 • Zheng Tian, Ying Wen, Zhichen Gong, Faiz Punakkath, Shihao Zou, Jun Wang

In a single-agent setting, reinforcement learning (RL) tasks can be cast into an inference problem by introducing a binary random variable o, which stands for the "optimality".

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

Learning to Communicate Implicitly By Actions

no code implementations • 10 Oct 2018 • Zheng Tian, Shihao Zou, Ian Davies, Tim Warr, Lisheng Wu, Haitham Bou Ammar, Jun Wang

The auxiliary reward for communication is integrated into the learning of the policy module.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.