no code implementations • 6 Aug 2023 • Onkar Susladkar, Prajwal Gatti, Anand Mishra
In this work, we study the task of ``visually" translating scene text from a source language (e. g., English) to a target language (e. g., Chinese).
no code implementations • 26 Jun 2023 • Prashant Kumar, Dhruv Makwana, Onkar Susladkar, Anurag Mittal, Prem Kumar Kalra
In the real world however, LiDAR scans consist of non-stationary dynamic structures - moving and movable objects.
no code implementations • IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023 • Onkar Susladkar, Gayatri Deshmukh, Dhruv Makwana, Sparsh Mittal, R Sai Chandra Teja, Rekha Singhal
We introduce a novel network, GAFNet (Global Attention Fourier Net), which learns through large-scale pre-training over three image-text datasets (COCO, SBU, and CC-3M), for achieving high performance on downstream vision and language tasks.
1 code implementation • 26 Oct 2022 • Onkar Susladkar, Dhruv Makwana, Gayatri Deshmukh, Sparsh Mittal, Sai Chandra Teja R, Rekha Singhal
Further, we use a novel multi-headed decoder that generates a high-pass filtered image and a segmentation map, in addition to a text-free image.
1 code implementation • 13 Jul 2022 • Dhruv Makwana, Subhrajit Nag, Onkar Susladkar, Gayatri Deshmukh, Sai Chandra Teja R, Sparsh Mittal, C Krishna Mohan
We propose a novel deep learning model named ACLNet, for cloud segmentation from ground images.
Ranked #1 on Semantic Segmentation on SWINySEG