TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Dense Video Captioning	ActivityNet Captions	ADV-INF + Global	METEOR	16.36	# 2
Dense Video Captioning	ActivityNet Captions	ADV-INF + Global	BLEU-4	9.45	# 1
Dense Video Captioning	ActivityNet Captions	ADV-INF + Global	CIDEr	19.40	# 6
Dense Video Captioning	ActivityNet Captions	ADV-INF + Global	DIV-1	0.60	# 1
Dense Video Captioning	ActivityNet Captions	ADV-INF + Global	DIV-2	0.78	# 1
Dense Video Captioning	ActivityNet Captions	ADV-INF + Global	RE-4	0.05	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/global-object-proposals-for-improving-multi/dense-video-captioning-on-activitynet)](https://paperswithcode.com/sota/dense-video-captioning-on-activitynet?p=global-object-proposals-for-improving-multi)`

Global Object Proposals for Improving Multi-Sentence Video Descriptions

International Joint Conference on Neural Networks (IJCNN) 2021 · Chandresh S. Kanani, Sriparna Saha, Pushpak Bhattacharyya ·

There has been significant progress in image captioning in recent years. The generation of video descriptions is still in its early stages; this is due to the complex nature of videos in comparison to images. Generating paragraph descriptions of a video is even more challenging. Amongst the main issues are temporal object dependencies and complex object-object relationships. Recently, many works are proposed on the generation of multi-sentence video descriptions. The majority of the proposed works are based on a two-step approach: 1) event proposals and 2) caption generation. While these approaches produce good results, they miss out on globally available information. Here we propose the use of global object proposals while generating the video captions. Experimental results on ActivityNet dataset illustrate that the use of global object proposals can produce more informative and correct captions. We also propose three scores to evaluate the object detection capacity of the generator. A qualitative comparison of captions generated by the proposed method and the state-of-the-art techniques proves the efficacy of the proposed method.

PDF

Code

Add Remove Mark official

cskanani/global_object_proposals

Tasks

Add Remove

Caption Generation

Dense Video Captioning

Image Captioning

Object

object-detection

Object Detection

Sentence

Datasets

ActivityNet Captions

Results from the Paper

Add Remove

Ranked #2 on Dense Video Captioning on ActivityNet Captions

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Dense Video Captioning	ActivityNet Captions	ADV-INF + Global	METEOR	16.36	# 2	Compare
			BLEU-4	9.45	# 1	Compare
			CIDEr	19.40	# 6	Compare
			DIV-1	0.60	# 1	Compare
			DIV-2	0.78	# 1	Compare
			RE-4	0.05	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Global Object Proposals for Improving Multi-Sentence Video Descriptions

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove