AVE (Audio-Visual Event Localization)

Introduced by Tian et al. in Audio-Visual Event Localization in Unconstrained Videos

To investigate three temporal localization tasks: supervised and weakly-supervised audio-visual event localization, and cross-modality localization.

Source: Audio-Visual Event Localization in Unconstrained Videos

Homepage

No benchmarks yet. Start a new benchmark or link an existing one.

Paper	Code	Results	Date	Stars

No data loaders found. You can submit your data loader here.

VGG-SS