Search Results for author: Minsi Wang

Found 6 papers, 2 papers with code

3D Human Action Representation Learning via Cross-View Consistency Pursuit

1 code implementation CVPR 2021 Linguo Li, Minsi Wang, Bingbing Ni, Hang Wang, Jiancheng Yang, Wenjun Zhang

In this work, we propose a Cross-view Contrastive Learning framework for unsupervised 3D skeleton-based action Representation (CrosSCLR), by leveraging multi-view complementary supervision signal.

Action Recognition Contrastive Learning +1

Skeleton2Mesh: Kinematics Prior Injected Unsupervised Human Mesh Recovery

no code implementations ICCV 2021 Zhenbo Yu, Junjie Wang, Jingwei Xu, Bingbing Ni, Chenglong Zhao, Minsi Wang, Wenjun Zhang

The challenges of the latter task are two folds: (1) pose failure (i. e., pose mismatching -- different skeleton definitions in dataset and SMPL , and pose ambiguity -- endpoints have arbitrary joint angle configurations for the same 3D joint coordinates).

3D Pose Estimation Human Mesh Recovery

Multiple Granularity Group Interaction Prediction

no code implementations CVPR 2018 Taiping Yao, Minsi Wang, Bingbing Ni, Huawei Wei, Xiaokang Yang

Most human activity analysis works (i. e., recognition or prediction) only focus on a single granularity, i. e., either modelling global motion based on the coarse level movement such as human trajectories or forecasting future detailed action based on body parts’ movement such as skeleton motion.

Crowd Counting via Adversarial Cross-Scale Consistency Pursuit

1 code implementation CVPR 2018 Zan Shen, Yi Xu, Bingbing Ni, Minsi Wang, Jianguo Hu, Xiaokang Yang

Crowd counting or density estimation is a challenging task in computer vision due to large scale variations, perspective distortions and serious occlusions, etc.

Crowd Counting Density Estimation

Fine-Grained Video Captioning for Sports Narrative

no code implementations CVPR 2018 Huanyu Yu, Shuo Cheng, Bingbing Ni, Minsi Wang, Jian Zhang, Xiaokang Yang

First, to facilitate this novel research of fine-grained video caption, we collected a novel dataset called Fine-grained Sports Narrative dataset (FSN) that contains 2K sports videos with ground-truth narratives from YouTube. com.

2k Video Captioning

Recurrent Modeling of Interaction Context for Collective Activity Recognition

no code implementations CVPR 2017 Minsi Wang, Bingbing Ni, Xiaokang Yang

However, most of the previous activity recognition methods do not offer a flexible and scalable scheme to handle the high order context modeling problem.

Descriptive Group Activity Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.