3 dataset results for segmentation AND Speech Recognition AND English

Tilde MODEL Corpus (Tilde Multilingual Open Data for European Languages)

…It contains over 10M segments of multilingual open data. The data has been collected from sites allowing free use and reuse of its content, as well as from Public Sector web sites.

2 PAPERS • NO BENCHMARKS YET

MediaSpeech

…The dataset consists of short speech segments automatically extracted from media videos available on YouTube and manually transcribed, with some pre- and post-processing.

4 PAPERS • 1 BENCHMARK

Open Images V7

…A subset of 1.9M includes diverse annotations types. 15,851,536 boxes on 600 classes 2,785,498 instance segmentations on 350 classes 3,284,280 relationship annotations on 1,466 relationships 675,155

4 PAPERS • NO BENCHMARKS YET

Datasets

3 dataset results for segmentation AND Speech Recognition AND English