Texts

MIR-1K

Introduced by Chao-Ling Hsu et al. in On the Improvement of Singing Voice Separation for Monaural Recordings Using the MIR-1K Dataset

MIR-1K (Multimedia Information Retrieval lab, 1000 song clips) is a dataset designed for singing voice separation. It contains:

1000 song clips with the music accompaniment and the singing voice recorded as left and right channels, respectively,
Manual annotations of pitch contours in semitone, indices and types for unvoiced frames, lyrics, and vocal/non-vocal segments,
The speech recordings of the lyrics by the same person who sang the songs.

The duration of each clip ranges from 4 to 13 seconds, and the total length of the dataset is 133 minutes. These clips are extracted from 110 karaoke songs which contain a mixture track and a music accompaniment track. These songs are freely selected from 5000 Chinese pop songs and sung by researchers from MIR lab (8 females and 11 males). Most of the singers are amateur and do not have professional music training.

Source: https://sites.google.com/site/unvoicedsoundseparation/mir-1k

Homepage