Audio-Visual Video Captioning

0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Latest papers with no code

Knowledge Distillation for Efficient Audio-Visual Video Captioning

no code yet • 16 Jun 2023

Automatically describing audio-visual content with texts, namely video captioning, has received significant attention due to its potential applications across diverse fields.

An Attempt towards Interpretable Audio-Visual Video Captioning

no code yet • 7 Dec 2018

To achieve this, we propose a multimodal convolutional neural network-based audio-visual video captioning framework and introduce a modality-aware module for exploring modality selection during sentence generation.