Search Results for author: Sayak Paul

Found 10 papers, 7 papers with code

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

1 code implementation • 1 Apr 2024 • Agneet Chatterjee, Gabriela Ben Melech Stan, Estelle Aflalo, Sayak Paul, Dhruba Ghosh, Tejas Gokhale, Ludwig Schmidt, Hannaneh Hajishirzi, Vasudev Lal, Chitta Baral, Yezhou Yang

One of the key shortcomings in current text-to-image (T2I) models is their inability to consistently generate images which faithfully follow the spatial relationships specified in the text prompt.

Paper
Code

DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models

no code implementations • 27 Feb 2024 • Shyam Marjit, Harshit Singh, Nityanand Mathur, Sayak Paul, Chia-Mu Yu, Pin-Yu Chen

In the realm of subject-driven text-to-image (T2I) generative models, recent developments like DreamBooth and BLIP-Diffusion have led to impressive results yet encounter limitations due to their intensive fine-tuning demands and substantial parameter requirements.

Image Generation

Paper
Add Code

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

1 code implementation • 10 Jan 2024 • Junsong Chen, Yue Wu, Simian Luo, Enze Xie, Sayak Paul, Ping Luo, Hang Zhao, Zhenguo Li

As a state-of-the-art, open-source image generation model, PIXART-{\delta} offers a promising alternative to the Stable Diffusion family of models, contributing significantly to text-to-image synthesis.

Image Generation

2,110

Paper
Code

Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

1 code implementation • 5 Jan 2024 • Yatharth Gupta, Vishnu V. Jaddipal, Harish Prabhala, Sayak Paul, Patrick von Platen

In this work, we introduce two scaled-down variants, Segmind Stable Diffusion (SSD-1B) and Segmind-Vega, with 1. 3B and 0. 74B parameter UNets, respectively, achieved through progressive removal using layer-level losses focusing on reducing the model size while preserving generative quality.

Knowledge Distillation

154

Paper
Code

Flood Segmentation on Sentinel-1 SAR Imagery with Semi-Supervised Learning

1 code implementation • NeurIPS Workshop AI4Scien 2021 • Sayak Paul, Siddha Ganju

Floods wreak havoc throughout the world, causing billions of dollars in damages, and uprooting communities, ecosystems and economies.

Disaster Response Semantic Segmentation

152

Paper
Code

Vision Transformers are Robust Learners

1 code implementation • 17 May 2021 • Sayak Paul, Pin-Yu Chen

Transformers, composed of multiple self-attention layers, hold strong promises toward a generic learning primitive applicable to different data modalities, including the recent breakthroughs in computer vision achieving state-of-the-art (SOTA) standard accuracy.

Anomaly Detection Image Classification +1

120

Paper
Code

Fast and Accurate Quantized Camera Scene Detection on Smartphones, Mobile AI 2021 Challenge: Report

no code implementations • 17 May 2021 • Andrey Ignatov, Grigory Malivenko, Radu Timofte, Sheng Chen, Xin Xia, Zhaoyan Liu, Yuwei Zhang, Feng Zhu, Jiashi Li, Xuefeng Xiao, Yuan Tian, Xinglong Wu, Christos Kyrkou, Yixin Chen, Zexin Zhang, Yunbo Peng, Yue Lin, Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah, Himanshu Kumar, Chao Ge, Pei-Lin Wu, Jin-Hua Du, Andrew Batutin, Juan Pablo Federico, Konrad Lyda, Levon Khojoyan, Abhishek Thanki, Sayak Paul, Shahid Siddiqui

To address this problem, we introduce the first Mobile AI challenge, where the target is to develop quantized deep learning-based camera scene classification solutions that can demonstrate a real-time performance on smartphones and IoT platforms.

Scene Classification

Paper
Add Code

G-SimCLR: Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling

1 code implementation • 28 Sep 2020 • Souradip Chakraborty, Aritra Roy Gosthipaty, Sayak Paul

In this work, we propose that, with the normalized temperature-scaled cross-entropy (NT-Xent) loss function (as used in SimCLR), it is beneficial to not have images of the same category in the same batch.

Contrastive Learning Denoising +2

Paper
Code

G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling

1 code implementation • 25 Sep 2020 • Souradip Chakraborty, Aritra Roy Gosthipaty, Sayak Paul

Contrastive Learning Denoising +1

Paper
Code

A Review of Deep Learning with Special Emphasis on Architectures, Applications and Recent Trends

no code implementations • 30 May 2019 • Saptarshi Sengupta, Sanchita Basak, Pallabi Saikia, Sayak Paul, Vasilios Tsalavoutis, Frederick Atiah, Vadlamani Ravi, Alan Peters

Where does one start?

Fault Detection Time Series Forecasting

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.