Search Results for author: Onkar Susladkar

Found 10 papers, 5 papers with code

MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion

1 code implementation10 Oct 2024 Onkar Susladkar, Jishu Sen Gupta, Chirag Sehgal, Sparsh Mittal, Rekha Singhal

The spatio-temporal complexity of video data presents significant challenges in tasks such as compression, generation, and inpainting.

Denoising parameter-efficient fine-tuning +5

GAFNet: A Global Fourier Self Attention Based Novel Network for multi-modal downstream tasks

no code implementations IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023 Onkar Susladkar, Gayatri Deshmukh, Dhruv Makwana, Sparsh Mittal, R Sai Chandra Teja, Rekha Singhal

We introduce a novel network, GAFNet (Global Attention Fourier Net), which learns through large-scale pre-training over three image-text datasets (COCO, SBU, and CC-3M), for achieving high performance on downstream vision and language tasks.

Image Generation Image-text Retrieval +2

TPFNet: A Novel Text In-painting Transformer for Text Removal

1 code implementation26 Oct 2022 Onkar Susladkar, Dhruv Makwana, Gayatri Deshmukh, Sparsh Mittal, Sai Chandra Teja R, Rekha Singhal

Further, we use a novel multi-headed decoder that generates a high-pass filtered image and a segmentation map, in addition to a text-free image.

Image Generation Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.