no code implementations • 6 Oct 2022 • Benno Weck, Miguel Pérez Fernández, Holger Kirchhoff, Xavier Serra
We present an analysis of large-scale pretrained deep learning models used for cross-modal (text-to-audio) retrieval.
Metric Learning Retrieval +2