Search Results for author: Sindhu B Hegde

Found 5 papers, 4 papers with code

GestSync: Determining who is speaking without a talking head

1 code implementation8 Oct 2023 Sindhu B Hegde, Andrew Zisserman

In this paper we introduce a new synchronisation task, Gesture-Sync: determining if a person's gestures are correlated with their speech or not.

Active Speaker Detection Gesture Synchronization +1

Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild

no code implementations1 Sep 2022 Sindhu B Hegde, K R Prajwal, Rudrabha Mukhopadhyay, Vinay P Namboodiri, C. V. Jawahar

With the help of multiple powerful discriminators that guide the training process, our generator learns to synthesize speech sequences in any voice for the lip movements of any person.

Lip to Speech Synthesis Speech Synthesis

Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors

1 code implementation17 Aug 2022 Sindhu B Hegde, Rudrabha Mukhopadhyay, Vinay P Namboodiri, C. V. Jawahar

We show that when we process this $8\times8$ video with the right set of audio and image priors, we can obtain a full-length, $256\times256$ video.

Super-Resolution Video Compression

Towards Automatic Speech to Sign Language Generation

1 code implementation24 Jun 2021 Parul Kapoor, Rudrabha Mukhopadhyay, Sindhu B Hegde, Vinay Namboodiri, C V Jawahar

Since the current datasets are inadequate for generating sign language directly from speech, we collect and release the first Indian sign language dataset comprising speech-level annotations, text transcripts, and the corresponding sign-language videos.

Text Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.