Browse > Computer Vision > Video > Video Description

Video Description

15 papers with code · Computer Vision
Subtask of Video

Leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

VizSeq: A Visual Analysis Toolkit for Text Generation Tasks

IJCNLP 2019 facebookresearch/vizseq

Automatic evaluation of text generation tasks (e. g. machine translation, text summarization, image captioning and video description) usually relies heavily on task-specific metrics, such as BLEU and ROUGE.

IMAGE CAPTIONING MACHINE TRANSLATION TEXT GENERATION TEXT SUMMARIZATION VIDEO DESCRIPTION

Describing Videos by Exploiting Temporal Structure

ICCV 2015 yaoli/arctic-capgen-vid

In this context, we propose an approach that successfully takes into account both the local and global temporal structure of videos to produce descriptions.

TEMPORAL ACTION LOCALIZATION VIDEO DESCRIPTION

Grounded Video Description

CVPR 2019 facebookresearch/grounded-video-description

Our dataset, ActivityNet-Entities, augments the challenging ActivityNet Captions dataset with 158k bounding box annotations, each grounding a noun phrase.

VIDEO DESCRIPTION

TGIF: A New Dataset and Benchmark on Animated GIF Description

CVPR 2016 raingo/TGIF-Release

The motivation for this work is to develop a testbed for image sequence description systems, where the task is to generate natural language descriptions for animated GIFs or video clips.

IMAGE CAPTIONING MACHINE TRANSLATION TEXT GENERATION VIDEO DESCRIPTION

Video Description using Bidirectional Recurrent Neural Networks

12 Apr 2016lvapeab/ABiViRNet

Although traditionally used in the machine translation field, the encoder-decoder framework has been recently applied for the generation of video and image descriptions.

TEXT GENERATION VIDEO CAPTIONING VIDEO DESCRIPTION

Predicting Visual Features from Text for Image and Video Caption Retrieval

5 Sep 2017danieljf24/w2vv

This paper strives to find amidst a set of sentences the one best describing the content of a given image or video.

VIDEO DESCRIPTION

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7

1 Jun 2018hudaAlamri/DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge

Scene-aware dialog systems will be able to have conversations with users about the objects and events around them.

VIDEO DESCRIPTION VISUAL DIALOG

Adversarial Inference for Multi-Sentence Video Description

CVPR 2019 jamespark3922/adv-inf

Among the main issues are the fluency and coherence of the generated descriptions, and their relevance to the video.

IMAGE CAPTIONING VIDEO DESCRIPTION

VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research

ICCV 2019 eric-xw/Video-guided-Machine-Translation

We also introduce two tasks for video-and-language research based on VATEX: (1) Multilingual Video Captioning, aimed at describing a video in various languages with a compact unified captioning model, and (2) Video-guided Machine Translation, to translate a source language description into the target language using the video information as additional spatiotemporal context.

MACHINE TRANSLATION VIDEO CAPTIONING VIDEO DESCRIPTION