Audio to Text Retrieval
5 papers with code • 4 benchmarks • 4 datasets
This task has no description! Would you like to contribute one?
Latest papers with no code
Killing two birds with one stone: Can an audio captioning system also be used for audio-text retrieval?
For ATR, we propose using the standard Cross-Entropy loss values obtained for any audio/caption pair.
On Negative Sampling for Contrastive Audio-Text Retrieval
With a constant training setting on the retrieval system from [1], we study eight sampling strategies, including hard and semi-hard negative sampling.
Exploring Train and Test-Time Augmentations for Audio-Language Learning
In this paper, we aim to unveil the impact of data augmentation in audio-language multi-modal learning, which has not been explored despite its importance.