Search Results for author: Nithin Gopalakrishnan Nair

Found 14 papers, 7 papers with code

MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

no code implementations15 Apr 2024 Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal M Patel

Large diffusion-based Text-to-Image (T2I) models have shown impressive generative powers for text-to-image generation as well as spatially conditioned image generation.

Text-to-Image Generation

Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers

no code implementations15 Apr 2024 Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal M. Patel

As these parameters are independent, a single diffusion model with these task-specific parameters can be used to perform multiple tasks simultaneously.

Image Generation Unconditional Image Generation

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

no code implementations ICCV 2023 Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks

To this end, and capitalizing on the powerful fine-grained generative control offered by the recent diffusion-based generative models, we introduce Steered Diffusion, a generalized framework for photorealistic zero-shot conditional image generation using a diffusion model trained for unconditional generation.

Colorization Conditional Image Generation +2

AdaptiveSAM: Towards Efficient Tuning of SAM for Surgical Scene Segmentation

1 code implementation7 Aug 2023 Jay N. Paranjape, Nithin Gopalakrishnan Nair, Shameema Sikder, S. Swaroop Vedula, Vishal M. Patel

However, SAM does not generalize well to the medical domain as is without utilizing a large amount of compute resources for fine-tuning and using task-specific prompts.

Scene Segmentation Segmentation

$CrowdDiff$: Multi-hypothesis Crowd Density Estimation using Diffusion Models

1 code implementation22 Mar 2023 Yasiru Ranasinghe, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

Furthermore, as the intermediate time steps of the diffusion process are noisy, we incorporate a regression branch for direct crowd estimation only during training to improve the feature learning.

Contour Detection Crowd Counting +1

Bi-Noising Diffusion: Towards Conditional Diffusion Models with Generative Restoration Priors

no code implementations14 Dec 2022 Kangfu Mei, Nithin Gopalakrishnan Nair, Vishal M. Patel

The improvements obtained by our method suggest that the priors can be incorporated as a general plugin for improving conditional diffusion models.

Colorization Rain Removal +1

Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models

1 code implementation CVPR 2023 Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

We also introduce a novel reliability parameter that allows using different off-the-shelf diffusion models trained across various datasets during sampling time alone to guide it to the desired outcome satisfying multiple constraints.

Face Generation Face Sketch Synthesis +4

T2V-DDPM: Thermal to Visible Face Translation using Denoising Diffusion Probabilistic Models

1 code implementation19 Sep 2022 Nithin Gopalakrishnan Nair, Vishal M. Patel

In this paper, we propose a Denoising Diffusion Probabilistic Model (DDPM) based solution for T2V translation specifically for facial images.

Face Verification Person Recognition +2

AT-DDPM: Restoring Faces degraded by Atmospheric Turbulence using Denoising Diffusion Probabilistic Models

1 code implementation24 Aug 2022 Nithin Gopalakrishnan Nair, Kangfu Mei, Vishal M. Patel

In recent years, various deep learning-based single image atmospheric turbulence mitigation methods, including CNN-based and GAN inversion-based, have been proposed in the literature which attempt to remove the distortion in the image.

Image Restoration Image Super-Resolution +1

DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change Detection

1 code implementation23 Jun 2022 Wele Gedara Chaminda Bandara, Nithin Gopalakrishnan Nair, Vishal M. Patel

However, in this work, our focus is not on image synthesis but on utilizing it as a pre-trained feature extractor for the downstream application of change detection.

Change Detection Decision Making +2

Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models

no code implementations10 Jun 2022 Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M Patel

Based on the fact that the distribution over each time step in the diffusion model is Gaussian, in this work we show that there exists a closed-form expression to the generate the image corresponds to the given modalities.

Denoising Image Generation

SAR Despeckling using a Denoising Diffusion Probabilistic Model

1 code implementation9 Jun 2022 Malsha V. Perera, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

The despeckled image is recovered by a reverse process which iteratively predicts the added noise using a noise predictor which is conditioned on the speckled image.

Change Detection Denoising

A comparison of different atmospheric turbulence simulation methods for image restoration

no code implementations19 Apr 2022 Nithin Gopalakrishnan Nair, Kangfu Mei, Vishal M. Patel

In this paper, we systematically evaluate the effectiveness of various turbulence simulation methods on image restoration.

Face Recognition Image Restoration

Cannot find the paper you are looking for? You can Submit a new open access paper.