Search Results for author: Vinay P Namboodiri

Found 10 papers, 4 papers with code

INR-V: A Continuous Representation Space for Video-based Generative Tasks

no code implementations29 Oct 2022 Bipasha Sen, Aditya Agarwal, Vinay P Namboodiri, C. V. Jawahar

In this work, we evaluate the space learned by INR-V on diverse generative tasks such as video interpolation, novel video generation, video inversion, and video inpainting against the existing baselines.

Video Generation Video Inpainting

Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild

no code implementations1 Sep 2022 Sindhu B Hegde, K R Prajwal, Rudrabha Mukhopadhyay, Vinay P Namboodiri, C. V. Jawahar

With the help of multiple powerful discriminators that guide the training process, our generator learns to synthesize speech sequences in any voice for the lip movements of any person.

Lip to Speech Synthesis Speech Synthesis

Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors

1 code implementation17 Aug 2022 Sindhu B Hegde, Rudrabha Mukhopadhyay, Vinay P Namboodiri, C. V. Jawahar

We show that when we process this $8\times8$ video with the right set of audio and image priors, we can obtain a full-length, $256\times256$ video.

Super-Resolution Video Compression

Collaborative Learning to Generate Audio-Video Jointly

no code implementations1 Apr 2021 Vinod K Kurmi, Vipul Bajaj, Badri N Patro, K S Venkatesh, Vinay P Namboodiri, Preethi Jyothi

Towards this, we propose a method that demonstrates that we are able to generate naturalistic samples of video and audio data by the joint correlated generation of audio and video modalities.

Self Supervision for Attention Networks

1 code implementation6 Jan 2021 Badri N Patro, Kasturi GS, Ansh Jain, Vinay P Namboodiri

In recent years, the attention mechanism has become a fairly popular concept and has proven to be successful in many machine learning applications.

Image Classification Language Modelling +5

Stochastic Talking Face Generation Using Latent Distribution Matching

1 code implementation21 Nov 2020 Ravindra Yadav, Ashish Sardana, Vinay P Namboodiri, Rajesh M Hegde

Indeed, just having the ability to generate a single talking face would make a system almost robotic in nature.

Talking Face Generation Video Generation

Speech Prediction in Silent Videos using Variational Autoencoders

no code implementations14 Nov 2020 Ravindra Yadav, Ashish Sardana, Vinay P Namboodiri, Rajesh M Hegde

Understanding the relationship between the auditory and visual signals is crucial for many different applications ranging from computer-generated imagery (CGI) and video editing automation to assisting people with hearing or visual impairments.

Cannot find the paper you are looking for? You can Submit a new open access paper.