FSDD (Free Spoken Digit Dataset)

Free Spoken Digit Dataset (FSDD) is a simple audio/speech dataset consisting of recordings of spoken digits in wav files at 8kHz. The recordings are trimmed so that they have near minimal silence at the beginnings and ends. It contains data from 6 speakers, 3,000 recordings (50 of each digit per speaker), and English pronunciations.

Papers


Paper Code Results Date Stars

Tasks


Similar Datasets


License


  • CC BY-SA 4.0

Modalities


Languages