Search Results for author: Sayak Paul

Found 10 papers, 7 papers with code

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

1 code implementation1 Apr 2024 Agneet Chatterjee, Gabriela Ben Melech Stan, Estelle Aflalo, Sayak Paul, Dhruba Ghosh, Tejas Gokhale, Ludwig Schmidt, Hannaneh Hajishirzi, Vasudev Lal, Chitta Baral, Yezhou Yang

One of the key shortcomings in current text-to-image (T2I) models is their inability to consistently generate images which faithfully follow the spatial relationships specified in the text prompt.

DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models

no code implementations27 Feb 2024 Shyam Marjit, Harshit Singh, Nityanand Mathur, Sayak Paul, Chia-Mu Yu, Pin-Yu Chen

In the realm of subject-driven text-to-image (T2I) generative models, recent developments like DreamBooth and BLIP-Diffusion have led to impressive results yet encounter limitations due to their intensive fine-tuning demands and substantial parameter requirements.

Image Generation

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

1 code implementation10 Jan 2024 Junsong Chen, Yue Wu, Simian Luo, Enze Xie, Sayak Paul, Ping Luo, Hang Zhao, Zhenguo Li

As a state-of-the-art, open-source image generation model, PIXART-{\delta} offers a promising alternative to the Stable Diffusion family of models, contributing significantly to text-to-image synthesis.

Image Generation

Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

1 code implementation5 Jan 2024 Yatharth Gupta, Vishnu V. Jaddipal, Harish Prabhala, Sayak Paul, Patrick von Platen

In this work, we introduce two scaled-down variants, Segmind Stable Diffusion (SSD-1B) and Segmind-Vega, with 1. 3B and 0. 74B parameter UNets, respectively, achieved through progressive removal using layer-level losses focusing on reducing the model size while preserving generative quality.

Knowledge Distillation

Flood Segmentation on Sentinel-1 SAR Imagery with Semi-Supervised Learning

1 code implementation NeurIPS Workshop AI4Scien 2021 Sayak Paul, Siddha Ganju

Floods wreak havoc throughout the world, causing billions of dollars in damages, and uprooting communities, ecosystems and economies.

Disaster Response Semantic Segmentation

Vision Transformers are Robust Learners

1 code implementation17 May 2021 Sayak Paul, Pin-Yu Chen

Transformers, composed of multiple self-attention layers, hold strong promises toward a generic learning primitive applicable to different data modalities, including the recent breakthroughs in computer vision achieving state-of-the-art (SOTA) standard accuracy.

Anomaly Detection Image Classification +1

G-SimCLR: Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling

1 code implementation28 Sep 2020 Souradip Chakraborty, Aritra Roy Gosthipaty, Sayak Paul

In this work, we propose that, with the normalized temperature-scaled cross-entropy (NT-Xent) loss function (as used in SimCLR), it is beneficial to not have images of the same category in the same batch.

Contrastive Learning Denoising +2

G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling

1 code implementation25 Sep 2020 Souradip Chakraborty, Aritra Roy Gosthipaty, Sayak Paul

In this work, we propose that, with the normalized temperature-scaled cross-entropy (NT-Xent) loss function (as used in SimCLR), it is beneficial to not have images of the same category in the same batch.

Contrastive Learning Denoising +1

Cannot find the paper you are looking for? You can Submit a new open access paper.