Search Results for author: Saiteja Kosgi

Found 5 papers, 0 papers with code

Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems

no code implementations • NAACL 2022 • Saiteja Kosgi, Sarath Sivaprasad, Niranjan Pedanekar, Anil Nelakanti, Vineet Gandhi

We present a method to control the emotional prosody of Text to Speech (TTS) systems by using phoneme-level intermediate features (pitch, energy, and duration) as levers.

Paper
Add Code

MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting

no code implementations • 19 May 2023 • Neil Shah, Vishal Tambrahalli, Saiteja Kosgi, Niranjan Pedanekar, Vineet Gandhi

We present MParrotTTS, a unified multilingual, multi-speaker text-to-speech (TTS) synthesis model that can produce high-quality speech.

Speech Synthesis Text-To-Speech Synthesis

Paper
Add Code

ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations

no code implementations • 1 Mar 2023 • Neil Shah, Saiteja Kosgi, Vishal Tambrahalli, Neha Sahipjohn, Niranjan Pedanekar, Vineet Gandhi

We present ParrotTTS, a modularized text-to-speech synthesis model leveraging disentangled self-supervised speech representations.

Self-Supervised Learning Speech Synthesis +1

Paper
Add Code

Emotional Prosody Control for Speech Generation

no code implementations • 7 Nov 2021 • Sarath Sivaprasad, Saiteja Kosgi, Vineet Gandhi

The proposed TTS system can generate speech from the text in any speaker's style, with fine control of emotion.

Paper
Add Code

Reappraising Domain Generalization in Neural Networks

no code implementations • 15 Oct 2021 • Sarath Sivaprasad, Akshay Goindani, Vaibhav Garg, Ritam Basu, Saiteja Kosgi, Vineet Gandhi

We find that the presence of multiple domains incentivizes domain agnostic learning and is the primary reason for generalization in Tradition DG.

Data Augmentation Domain Generalization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.