ShiftySpeech

Introduced by Garg et al. in Less is More for Synthetic Speech Detection in the Wild

ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts

🔥 Key Features

  • 3000+ hours of synthetic speech
  • Diverse Distribution Shifts: The dataset spans 7 key distribution shifts, including:
  • 📖 Reading Style
  • 🎙️ Podcast
  • 🎥 YouTube
  • 🗣️ Languages (Three different languages)
  • 🌎 Demographics (including variations in age, accent, and gender)
  • Multiple Speech Generation Systems: Includes data synthesized from various TTS models and vocoders.

Dataset can be downloaded from: Hugging Face

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


Modalities


Languages