Search Results for author: Sudhanshu Srivastava

Found 4 papers, 1 papers with code

Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages

no code implementations • 13 Feb 2023 • Sudhanshu Srivastava, Ishika Gupta, Anusha Prakash, Jom Kuriakose, Hema A. Murthy

Hidden-Markov-model (HMM) based text-to-speech (HTS) offers flexibility in speaking styles along with fast training and synthesis while being computationally less intense.

Speech Synthesis

Paper
Add Code

Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages

no code implementations • 1 Nov 2022 • Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya, S Umesh, Rajeev Sangal

Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video.

Chunking Speech Synthesis +1

Paper
Add Code

Deep Learning--Based Scene Simplification for Bionic Vision

1 code implementation • 30 Jan 2021 • Nicole Han, Sudhanshu Srivastava, Aiwen Xu, Devi Klein, Michael Beyeler

Retinal degenerative diseases cause profound visual impairment in more than 10 million people worldwide, and retinal prostheses are being developed to restore vision to these individuals.

Monocular Depth Estimation Scene Understanding +1

Paper
Code

Deep Cross-Modal Audio-Visual Generation

no code implementations • 26 Apr 2017 • Lele Chen, Sudhanshu Srivastava, Zhiyao Duan, Chenliang Xu

Being the first to explore this new problem, we compose two new datasets with pairs of images and sounds of musical performances of different instruments.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.