ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts
🔥 Key Features
- 3000+ hours of synthetic speech
- Diverse Distribution Shifts: The dataset spans 7 key distribution shifts, including:
- 📖 Reading Style
- 🎙️ Podcast
- 🎥 YouTube
- 🗣️ Languages (Three different languages)
- 🌎 Demographics (including variations in age, accent, and gender)
- Multiple Speech Generation Systems: Includes data synthesized from various TTS models and vocoders.
Dataset can be downloaded from: Hugging Face