Vāksañcayaḥ (Sanskrit Speech Corpus by IIT Bombay)

Introduced by Adiga et al. in Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights

This Sanskrit speech corpus has more than 78 hours of audio data and contains recordings of 45,953 sentences with a sampling rate of 22KHz. The content is mainly readings of texts spanning over various Śāstras of Saṃskṛtam literature and also includes contemporary stories, radio program, extempore discourse, etc.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks