TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Zero-Shot Action Recognition	ActivityNet	E2E	Top-1 Accuracy	26.6	# 4
Zero-Shot Action Recognition	HMDB51	E2E	Top-1 Accuracy	32.7	# 18
Zero-Shot Action Recognition	UCF101	E2E	Top-1 Accuracy	48	# 18

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rethinking-zero-shot-video-classification-end/zero-shot-action-recognition-on-activitynet)](https://paperswithcode.com/sota/zero-shot-action-recognition-on-activitynet?p=rethinking-zero-shot-video-classification-end)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rethinking-zero-shot-video-classification-end/zero-shot-action-recognition-on-hmdb51)](https://paperswithcode.com/sota/zero-shot-action-recognition-on-hmdb51?p=rethinking-zero-shot-video-classification-end)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rethinking-zero-shot-video-classification-end/zero-shot-action-recognition-on-ucf101)](https://paperswithcode.com/sota/zero-shot-action-recognition-on-ucf101?p=rethinking-zero-shot-video-classification-end)`

Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications

CVPR 2020 · Biagio Brattoli, Joseph Tighe, Fedor Zhdanov, Pietro Perona, Krzysztof Chalupka ·

Trained on large datasets, deep learning (DL) can accurately classify videos into hundreds of diverse classes. However, video data is expensive to annotate. Zero-shot learning (ZSL) proposes one solution to this problem. ZSL trains a model once, and generalizes to new tasks whose classes are not present in the training dataset. We propose the first end-to-end algorithm for ZSL in video classification. Our training procedure builds on insights from recent video classification literature and uses a trainable 3D CNN to learn the visual features. This is in contrast to previous video ZSL methods, which use pretrained feature extractors. We also extend the current benchmarking paradigm: Previous techniques aim to make the test task unknown at training time but fall short of this goal. We encourage domain shift across training and test data and disallow tailoring a ZSL model to a specific test dataset. We outperform the state-of-the-art by a wide margin. Our code, evaluation procedure and model weights are available at github.com/bbrattoli/ZeroShotVideoClassification.

PDF Abstract CVPR 2020 PDF CVPR 2020 Abstract

Code

Add Remove Mark official

bbrattoli/ZeroShotVideoClassificati… official

143

Tasks

Add Remove

Benchmarking

General Classification

Video Classification

Zero-Shot Action Recognition

Zero-Shot Learning

Datasets

UCF101

Kinetics

HMDB51

ActivityNet

Results from the Paper

Edit

Ranked #4 on Zero-Shot Action Recognition on ActivityNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Zero-Shot Action Recognition	ActivityNet	E2E	Top-1 Accuracy	26.6	# 4	Compare
Zero-Shot Action Recognition	HMDB51	E2E	Top-1 Accuracy	32.7	# 18	Compare
Zero-Shot Action Recognition	UCF101	E2E	Top-1 Accuracy	48	# 18	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove