The MSP-Podcast corpus contains speech segments from podcast recordings which are perceptually annotated using crowdsourcing. The collection of this corpus is an ongoing process. Most of the segments in a regular podcasts are neutral. We use machine learning techniques trained with available data to retrieve candidate segments. These segments are emotionally annotated with crowdsourcing. This approach allows us to spend our resources on speech segments that are likely to convey emotions.
3 PAPERS • 4 BENCHMARKS
…IDs and URLs of the GIFs and the videos are provided, along with temporal alignment of GIF segments to their source videos.
11 PAPERS • NO BENCHMARKS YET