About

Video Captioning is a task of automatic captioning a video by understanding the action and event in the video which can help in the retrieval of the video efficiently through text.

Source: NITS-VC System for VATEX Video Captioning Challenge 2020

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Subtasks

Datasets

Latest papers without code

Fill-in-the-blank as a Challenging Video Understanding Evaluation Framework

9 Apr 2021

Work to date on language-informed video understanding has primarily addressed two tasks: (1) video question answering using multiple-choice questions, where models perform relatively well because they exploit the fact that candidate answers are readily available; and (2) video captioning, which relies on an open-ended evaluation framework that is often inaccurate because system answers may be perceived as incorrect if they differ in form from the ground truth.

LANGUAGE MODELLING QUESTION ANSWERING VIDEO CAPTIONING VIDEO QUESTION ANSWERING VIDEO UNDERSTANDING

The Use of Video Captioning for Fostering Physical Activity

7 Apr 2021

With the above in mind, this paper proposes a video captioning framework that aims to describe the activities in a video and estimate a person's daily physical activity level.

ACTION DETECTION OBJECT DETECTION VIDEO CAPTIONING

Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning

7 Apr 2021

The proposed system functions and operates as followed: it reads a video; representative image frames are identified and selected; the image frames are captioned; NLP is applied to all generated captions together with text summarization; and finally, a title and an abstract are generated for the video.

TEXT SUMMARIZATION VIDEO CAPTIONING

Open-book Video Captioning with Retrieve-Copy-Generate Network

9 Mar 2021

Due to the rapid emergence of short videos and the requirement for content understanding and creation, the video captioning task has received increasing attention in recent years.

VIDEO CAPTIONING

Exploration of Visual Features and their weighted-additive fusion for Video Captioning

14 Jan 2021

Video captioning is a popular task that challenges models to describe events in videos using natural language.

TOKENIZATION VIDEO CAPTIONING

A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading Rules

8 Jan 2021

The proposed model consists of an encoder which is a neural structure responsible for learning informative features from the input sequence, and a decoder which is a DRL model responsible for learning profitable strategies based on the features extracted by the encoder.

MACHINE TRANSLATION TIME SERIES VIDEO CAPTIONING

Video Captioning in Compressed Video

2 Jan 2021

We propose a video captioning method which operates directly on the stored compressed videos.

VIDEO CAPTIONING

Guidance Module Network for Video Captioning

20 Dec 2020

In this paper, we present a novel architecture which introduces a guidance module to encourage the encoder-decoder model to generate words related to the past and future words in a caption.

VIDEO CAPTIONING