2 dataset results for Speaker Recognition AND Speech

CN-Celeb is a large-scale speaker recognition dataset collected `in the wild'. This dataset contains more than 130,000 utterances from 1,000 Chinese celebrities, and covers 11 different genres in real world.

63 PAPERS • 1 BENCHMARK

MAVS

MAVS (Multilingual Audio-Visual Smartphone dataset)

MAVS is an audio-visual smartphone dataset captured in five different recent smartphones. This new dataset contains 103 subjects captured in three different sessions considering the different real-world scenarios. Three different languages are acquired in this dataset to include the problem of language dependency of the speaker recognition systems.

1 PAPER • NO BENCHMARKS YET

Datasets

2 dataset results for Speaker Recognition AND Speech