Text to Audio/Video Retrieval

2 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Datasets


MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

mugen-org/MUGEN_baseline 17 Apr 2022

Altogether, MUGEN can help progress research in many tasks in multimodal understanding and generation.

38
17 Apr 2022

Audio Retrieval with Natural Language Queries

oncescuandreea/audio-retrieval 5 May 2021

We consider the task of retrieving audio using free-form natural language queries.

28
05 May 2021