TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Recognition	NTU RGB+D	ViewCon (RGB + Pose)	Accuracy (CS)	93.7	# 12
Action Recognition	NTU RGB+D	ViewCon (RGB + Pose)	Accuracy (CV)	98.9	# 3
Action Recognition	NTU RGB+D 120	ViewCon (RGB)	Accuracy (Cross-Subject)	85.6	# 12
Action Recognition	NTU RGB+D 120	ViewCon (RGB)	Accuracy (Cross-Setup)	87.5	# 11

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-view-action-recognition-using/action-recognition-in-videos-on-ntu-rgbd-120)](https://paperswithcode.com/sota/action-recognition-in-videos-on-ntu-rgbd-120?p=multi-view-action-recognition-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-view-action-recognition-using/action-recognition-in-videos-on-ntu-rgbd)](https://paperswithcode.com/sota/action-recognition-in-videos-on-ntu-rgbd?p=multi-view-action-recognition-using)`

Multi-View Action Recognition Using Contrastive Learning

IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023 · Ketul Shah, Anshul Shah, Chun Pong Lau, Celso M. de Melo, Rama Chellappa ·

In this work, we present a method for RGB-based action recognition using multi-view videos. We present a supervised contrastive learning framework to learn a feature embedding robust to changes in viewpoint, by effectively leveraging multi-view data. We use an improved supervised contrastive loss and augment the positives with those coming from synchronized viewpoints. We also propose a new approach to use classifier probabilities to guide the selection of hard negatives in the contrastive loss, to learn a more discriminative representation. Negative samples from confusing classes based on posterior are weighted higher. We also show that our method leads to better domain generalization compared to the standard supervised training based on synthetic multi-view data. Extensive experiments on real (NTU-60, NTU-120, NUMA) and synthetic (RoCoG) data demonstrate the effectiveness of our approach.

PDF Abstract

Code

Add Remove Mark official

kshah33/viewcon official

Tasks

Add Remove

Action Recognition

Contrastive Learning

Domain Generalization

Datasets

NTU RGB+D

NTU RGB+D 120

Results from the Paper

Add Remove

Ranked #11 on Action Recognition on NTU RGB+D 120

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Recognition	NTU RGB+D	ViewCon (RGB + Pose)	Accuracy (CS)	93.7	# 12	Compare
Action Recognition	NTU RGB+D	ViewCon (RGB + Pose)	Accuracy (CV)	98.9	# 3	Compare
Action Recognition	NTU RGB+D 120	ViewCon (RGB)	Accuracy (Cross-Subject)	85.6	# 12	Compare
Action Recognition	NTU RGB+D 120	ViewCon (RGB)	Accuracy (Cross-Setup)	87.5	# 11	Compare

Methods

Add Remove

Contrastive Learning • Supervised Contrastive Loss

Edit Social Preview

Multi-View Action Recognition Using Contrastive Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove