Search Results for author: Soumik Mukhopadhyay

Found 3 papers, 3 papers with code

Do text-free diffusion models learn discriminative visual representations?

1 code implementation • 29 Nov 2023 • Soumik Mukhopadhyay, Matthew Gwilliam, Yosuke Yamaguchi, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Tianyi Zhou, Abhinav Shrivastava

We find that the intermediate feature maps of the U-Net are diverse, discriminative feature representations.

Image Classification object-detection +3

Paper
Code

Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization

1 code implementation • 18 Aug 2023 • Soumik Mukhopadhyay, Saksham Suri, Ravi Teja Gadde, Abhinav Shrivastava

We show results on both reconstruction (same audio-video inputs) as well as cross (different audio-video inputs) settings on Voxceleb2 and LRW datasets.

229

Paper
Code

Diffusion Models Beat GANs on Image Classification

1 code implementation • 17 Jul 2023 • Soumik Mukhopadhyay, Matthew Gwilliam, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Srinidhi Hegde, Tianyi Zhou, Abhinav Shrivastava

We explore optimal methods for extracting and using these embeddings for classification tasks, demonstrating promising results on the ImageNet classification task.

Classification Denoising +5

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.