TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Few Shot Action Recognition	Kinetics-100	OTAM	Accuracy	85.8	# 5
Few Shot Action Recognition	Something-Something-100	OTAM	1:1 Accuracy	52.3	# 5
Action Recognition	Something-Something V2	TAM (5-shot)	Top-1 Accuracy	52.3	# 114

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/few-shot-video-classification-via-temporal/few-shot-action-recognition-on-kinetics-100)](https://paperswithcode.com/sota/few-shot-action-recognition-on-kinetics-100?p=few-shot-video-classification-via-temporal)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/few-shot-video-classification-via-temporal/few-shot-action-recognition-on-something)](https://paperswithcode.com/sota/few-shot-action-recognition-on-something?p=few-shot-video-classification-via-temporal)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/few-shot-video-classification-via-temporal/action-recognition-in-videos-on-something)](https://paperswithcode.com/sota/action-recognition-in-videos-on-something?p=few-shot-video-classification-via-temporal)`

Few-Shot Video Classification via Temporal Alignment

CVPR 2020 · Kaidi Cao, Jingwei Ji, Zhangjie Cao, Chien-Yi Chang, Juan Carlos Niebles ·

There is a growing interest in learning a model which could recognize novel classes with only a few labeled examples. In this paper, we propose Temporal Alignment Module (TAM), a novel few-shot learning framework that can learn to classify a previous unseen video. While most previous works neglect long-term temporal ordering information, our proposed model explicitly leverages the temporal ordering information in video data through temporal alignment. This leads to strong data-efficiency for few-shot learning. In concrete, TAM calculates the distance value of query video with respect to novel class proxies by averaging the per frame distances along its alignment path. We introduce continuous relaxation to TAM so the model can be learned in an end-to-end fashion to directly optimize the few-shot learning objective. We evaluate TAM on two challenging real-world datasets, Kinetics and Something-Something-V2, and show that our model leads to significant improvement of few-shot video classification over a wide range of competitive baselines.

PDF Abstract CVPR 2020 PDF CVPR 2020 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Recognition

Classification

Few Shot Action Recognition

Few-Shot Learning

General Classification

Video Classification

Datasets

ImageNet

Kinetics

Something-Something V2 Kinetics-100 Something-Something-100

Results from the Paper

Edit

Ranked #5 on Few Shot Action Recognition on Kinetics-100

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Few Shot Action Recognition	Kinetics-100	OTAM	Accuracy	85.8	# 5	Compare
Few Shot Action Recognition	Something-Something-100	OTAM	1:1 Accuracy	52.3	# 5	Compare
Action Recognition	Something-Something V2	TAM (5-shot)	Top-1 Accuracy	52.3	# 114	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Few-Shot Video Classification via Temporal Alignment

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove