TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Retrieval	LSMDC	CT-SAN	text-to-video R@1	5.1	# 37
Video Retrieval	LSMDC	CT-SAN	text-to-video R@5	16.3	# 33
Video Retrieval	LSMDC	CT-SAN	text-to-video R@10	25.2	# 32
Video Retrieval	LSMDC	CT-SAN	text-to-video Median Rank	46	# 20

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/end-to-end-concept-word-detection-for-video/video-retrieval-on-lsmdc)](https://paperswithcode.com/sota/video-retrieval-on-lsmdc?p=end-to-end-concept-word-detection-for-video)`

End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering

CVPR 2017 · Youngjae Yu, Hyungjin Ko, Jongwook Choi, Gunhee Kim ·

We propose a high-level concept word detector that can be integrated with any video-to-language models. It takes a video as input and generates a list of concept words as useful semantic priors for language generation models. The proposed word detector has two important properties. First, it does not require any external knowledge sources for training. Second, the proposed word detector is trainable in an end-to-end manner jointly with any video-to-language models. To maximize the values of detected words, we also develop a semantic attention mechanism that selectively focuses on the detected concept words and fuse them with the word encoding and decoding in the language model. In order to demonstrate that the proposed approach indeed improves the performance of multiple video-to-language tasks, we participate in four tasks of LSMDC 2016. Our approach achieves the best accuracies in three of them, including fill-in-the-blank, multiple-choice test, and movie retrieval. We also attain comparable performance for the other task, movie description.

PDF Abstract CVPR 2017 PDF CVPR 2017 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Language Modelling

Multiple-choice

Question Answering

Retrieval

Text Generation

Video Captioning

Video Retrieval

Datasets

LSMDC

Results from the Paper

Edit

Ranked #37 on Video Retrieval on LSMDC

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Retrieval	LSMDC	CT-SAN	text-to-video R@1	5.1	# 37	Compare
			text-to-video R@5	16.3	# 33	Compare
			text-to-video R@10	25.2	# 32	Compare
			text-to-video Median Rank	46	# 20	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove