2 dataset results for segmentation AND Speaker Verification

…Since the dataset is collected ‘in the wild’, the speech segments are corrupted with real world noise including laughter, cross-talk, channel effects, music and other sounds.

491 PAPERS • 5 BENCHMARKS

CALLHOME American English Speech

…Transcripts: The transcripts cover contiguous 5 or 10-minute segments from recorded conversations. Speaker Awareness: All speakers were aware that they were being recorded.

11 PAPERS • 7 BENCHMARKS

Datasets

2 dataset results for segmentation AND Speaker Verification