TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Cover song identification	Covers80	MOVE	MAP	0.844	# 4
Cover song identification	YouTube350	MOVE	MAP	0.885	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/accurate-and-scalable-version-identification/cover-song-identification-on-covers80)](https://paperswithcode.com/sota/cover-song-identification-on-covers80?p=accurate-and-scalable-version-identification)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/accurate-and-scalable-version-identification/cover-song-identification-on-youtube350)](https://paperswithcode.com/sota/cover-song-identification-on-youtube350?p=accurate-and-scalable-version-identification)`

Accurate and Scalable Version Identification Using Musically-Motivated Embeddings

28 Oct 2019 · Furkan Yesiler, Joan Serrà, Emilia Gómez ·

The version identification (VI) task deals with the automatic detection of recordings that correspond to the same underlying musical piece. Despite many efforts, VI is still an open problem, with much room for improvement, specially with regard to combining accuracy and scalability. In this paper, we present MOVE, a musically-motivated method for accurate and scalable version identification. MOVE achieves state-of-the-art performance on two publicly-available benchmark sets by learning scalable embeddings in an Euclidean distance space, using a triplet loss and a hard triplet mining strategy. It improves over previous work by employing an alternative input representation, and introducing a novel technique for temporal content summarization, a standardized latent space, and a data augmentation strategy specifically designed for VI. In addition to the main results, we perform an ablation study to highlight the importance of our design choices, and study the relation between embedding dimensionality and model performance.

PDF Abstract