TIE (https://github.com/raianand1991/TIE)

Introduced by Rai et al. in A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC Videos

Click to add a brief description of the dataset (Markdown and LaTeX enabled). The TIE(Technical Indian English) dataset is a massive speech dataset of ~750 GB, consisting of ~9.8K technical lectures in English, along with their transcripts. The lectures were delivered by instructors from all over India and were sourced from the NPTEL website

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages