Search Results for author: Soumik Mukhopadhyay

Found 3 papers, 3 papers with code

Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization

1 code implementation18 Aug 2023 Soumik Mukhopadhyay, Saksham Suri, Ravi Teja Gadde, Abhinav Shrivastava

We show results on both reconstruction (same audio-video inputs) as well as cross (different audio-video inputs) settings on Voxceleb2 and LRW datasets.

Diffusion Models Beat GANs on Image Classification

1 code implementation17 Jul 2023 Soumik Mukhopadhyay, Matthew Gwilliam, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Srinidhi Hegde, Tianyi Zhou, Abhinav Shrivastava

We explore optimal methods for extracting and using these embeddings for classification tasks, demonstrating promising results on the ImageNet classification task.

Classification Denoising +5

Cannot find the paper you are looking for? You can Submit a new open access paper.