Search Results for author: Raahil Shah

Found 4 papers, 0 papers with code

On granularity of prosodic representations in expressive text-to-speech

no code implementations • 26 Jan 2023 • Mikolaj Babianski, Kamil Pokora, Raahil Shah, Rafal Sienkiewicz, Daniel Korzekwa, Viacheslav Klimkov

In expressive speech synthesis it is widely adopted to use latent prosody representations to deal with variability of the data during training.

Expressive Speech Synthesis

Paper
Add Code

GaLeNet: Multimodal Learning for Disaster Prediction, Management and Relief

no code implementations • 18 Jun 2022 • Rohit Saha, Mengyi Fang, Angeline Yasodhara, Kyryl Truskovskyi, Azin Asgarian, Daniel Homola, Raahil Shah, Frederik Dieleman, Jack Weatheritt, Thomas Rogers

In this work, we propose a multimodal framework (GaLeNet) for assessing the severity of damage by complementing pre-disaster images with weather data and the trajectory of the hurricane.

Decision Making Management

Paper
Add Code

Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech

no code implementations • 24 Jun 2021 • Raahil Shah, Kamil Pokora, Abdelhamid Ezzerg, Viacheslav Klimkov, Goeric Huybrechts, Bartosz Putrycz, Daniel Korzekwa, Thomas Merritt

In this paper, we present a method for building highly expressive TTS voices with as little as 15 minutes of speech data from the target speaker.

Generative Adversarial Network

Paper
Add Code

Low-resource expressive text-to-speech using data augmentation

no code implementations • 11 Nov 2020 • Goeric Huybrechts, Thomas Merritt, Giulia Comini, Bartek Perz, Raahil Shah, Jaime Lorenzo-Trueba

While recent neural text-to-speech (TTS) systems perform remarkably well, they typically require a substantial amount of recordings from the target speaker reading in the desired speaking style.

Data Augmentation Voice Conversion

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.