Deeply Korean read speech

Deeply Korean read speech corpus contains pairs of Korean speakers reading a script with 3 distinct text sentiments (negative, neutral, positive), with 3 distinct voice sentiments (negative, neutral, positive), are recorded. The recordings took place in 3 different types of places, which are an anechoic chamber, studio apartment, and dance studio, of which the level of reverberation differs. And in order to examine the effect of the distance of mic from the source and device, every experiment is recorded at 3 distinct distances with 2 types of smartphone, iPhone X, and Galaxy S7.

This sample dataset consists of about 3 hours(290 hours in the full set) of audio(16 kHz, 16-bit, mono), and one pair of speakers. The dataset is a subset(approximately 1%) of a much bigger dataset which were recorded under the same circumstances as these open-source datasets. Please contact us(contact@deeplyinc.com) for the full set with the research/commercial license.

Papers


Paper Code Results Date Stars

Tasks


License


  • Unknown

Modalities


Languages