TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Question Answering	ActivityNet-QA	E-VQA	Accuracy	25.1	# 33
Video Question Answering	ActivityNet-QA	E-MN	Accuracy	27.1	# 30
Video Question Answering	ActivityNet-QA	E-SA	Accuracy	31.8	# 29

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/activitynet-qa-a-dataset-for-understanding/video-question-answering-on-activitynet-qa)](https://paperswithcode.com/sota/video-question-answering-on-activitynet-qa?p=activitynet-qa-a-dataset-for-understanding)`

ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering

6 Jun 2019 · Zhou Yu, Dejing Xu, Jun Yu, Ting Yu, Zhou Zhao, Yueting Zhuang, DaCheng Tao ·

Recent developments in modeling language and vision have been successfully applied to image question answering. It is both crucial and natural to extend this research direction to the video domain for video question answering (VideoQA). Compared to the image domain where large scale and fully annotated benchmark datasets exists, VideoQA datasets are limited to small scale and are automatically generated, etc. These limitations restrict their applicability in practice. Here we introduce ActivityNet-QA, a fully annotated and large scale VideoQA dataset. The dataset consists of 58,000 QA pairs on 5,800 complex web videos derived from the popular ActivityNet dataset. We present a statistical analysis of our ActivityNet-QA dataset and conduct extensive experiments on it by comparing existing VideoQA baselines. Moreover, we explore various video representation strategies to improve VideoQA performance, especially for long videos. The dataset is available at https://github.com/MILVLG/activitynet-qa

PDF Abstract

Code

Add Remove Mark official

MILVLG/activitynet-qa official

Tasks

Add Remove

Question Answering

Video Question Answering

Visual Question Answering (VQA)

Zero-Shot Video Question Answer

Datasets

Introduced in the Paper:

ActivityNet-QA

Used in the Paper:

Visual Question Answering

ActivityNet

MovieQA

TGIF-QA

Results from the Paper

Edit

Ranked #29 on Video Question Answering on ActivityNet-QA

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Question Answering	ActivityNet-QA	E-VQA	Accuracy	25.1	# 33	Compare
Video Question Answering	ActivityNet-QA	E-MN	Accuracy	27.1	# 30	Compare
Video Question Answering	ActivityNet-QA	E-SA	Accuracy	31.8	# 29	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove