no code implementations • ECCV 2020 • Sayan Rakshit, Dipesh Tamboli, Pragati Shuddhodhan Meshram, Biplab Banerjee, Gemma Roig, Subhasis Chaudhuri
Besides, an adversarial learning strategy is followed to model the discriminator between the target-domain known and unknown classes.
no code implementations • 16 Jan 2025 • Izzeddin Teeti, Aniket Thomas, Munish Monga, Sachin Kumar, Uddeshya Singh, Andrew Bradley, Biplab Banerjee, Fabio Cuzzolin
We present ASTRA (A} Scene-aware TRAnsformer-based model for trajectory prediction), a light-weight pedestrian trajectory forecasting model that integrates the scene context, spatial dynamics, social inter-agent interactions and temporal progressions for precise forecasting.
no code implementations • 28 Dec 2024 • Sharath Naganna, Saprativa Bhattacharjee, Pushpak Bhattacharyya, Biplab Banerjee
For the first time, we introduce the task of automatically detecting humblebragging in text.
no code implementations • 12 Dec 2024 • Sai Bhargav Rongali, Mohamad Hassan N C, Ankit Jha, Neha Bhargava, Saurabh Prasad, Biplab Banerjee
This paper tackles the intricate challenge of video question-answering (VideoQA).
no code implementations • 10 Dec 2024 • Aryan Bhosale, Samrat Mukherjee, Biplab Banerjee, Fabio Cuzzolin
This paper explores the utility of diffusion-based models for anomaly detection, focusing on their efficacy in identifying deviations in both compact and high-resolution datasets.
no code implementations • 1 Dec 2024 • Jack Belham, Aryan Bhosale, Samrat Mukherjee, Biplab Banerjee, Fabio Cuzzolin
The aim of this paper is to formalise the task of continual semi-supervised anomaly detection (CSAD), with the aim of highlighting the importance of such a problem formulation which assumes as close to real-world conditions as possible.
no code implementations • 21 Nov 2024 • Sanchar Palit, Biplab Banerjee, Subhasis Chaudhuri
In the continual learning framework employing variational inference, our study introduces a regularization term that specifically targets the dynamics and population of the mean and variance of the parameters.
no code implementations • 4 Nov 2024 • Bhupendra Solanki, Ashwin Nair, Mainak Singha, Souradeep Mukhopadhyay, Ankit Jha, Biplab Banerjee
Generalized Category Discovery (GCD) aims to cluster unlabeled images into known and novel categories using labeled images from known classes.
1 code implementation • 26 Sep 2024 • Sandika Biswas, Qianyi Wu, Biplab Banerjee, Hamid Rezatofighi
Despite advancements in Neural Implicit models for 3D surface reconstruction, handling dynamic environments with interactions between arbitrary rigid, non-rigid, or deformable entities remains challenging.
no code implementations • 31 Aug 2024 • Sayan Rakshit, Hmrishav Bandyopadhyay, Nibaran Das, Biplab Banerjee
In the second step, this pseudo source is adapted to the present target domain.
1 code implementation • 31 Aug 2024 • Munish Monga, Sachin Kumar Giroh, Ankit Jha, Mainak Singha, Biplab Banerjee, Jocelyn Chanussot
To the best of our knowledge, COSMo is the first method to address Open-Set Multi-Target DA (OSMTDA), offering a more realistic representation of real-world scenarios and addressing the challenges of both open-set and multi-target DA.
Ranked #1 on Open-Set Multi-Target Domain Adaptation on Office-31
Multi-target Domain Adaptation Open-Set Multi-Target Domain Adaptation
1 code implementation • 5 Jul 2024 • Mainak Singha, Ankit Jha, Divyam Gupta, Pranav Singla, Biplab Banerjee
SpLIP implements a bi-directional prompt-sharing strategy that enables mutual knowledge exchange between CLIP's visual and textual encoders, fostering a more cohesive and synergistic prompt processing mechanism that significantly reduces the semantic gap between the sketch and photo embeddings.
no code implementations • 2 May 2024 • Anurag Kumar, Chinmay Bharti, Saikat Dutta, Srikrishna Karanam, Biplab Banerjee
Our proposed framework not only empowers the model to embrace novel classes with limited data, but also ensures the preservation of performance on base classes.
class-incremental learning Few-Shot Class-Incremental Learning +2
no code implementations • 8 Apr 2024 • Sai Bhargav Rongali, Sarthak Mehrotra, Ankit Jha, Mohamad Hassan N C, Shirsha Bose, Tanisha Gupta, Mainak Singha, Biplab Banerjee
In Generalized Category Discovery (GCD), we cluster unlabeled samples of known and novel classes, leveraging a training dataset of known classes.
1 code implementation • 31 Mar 2024 • Mainak Singha, Ankit Jha, Shirsha Bose, Ashwin Nair, Moloud Abdar, Biplab Banerjee
Central to our approach is modeling a unique prompt tailored for detecting unknown class samples, and to train this, we employ a readily accessible stable diffusion model, elegantly generating proxy images for the open class.
no code implementations • 27 Mar 2024 • Valay Bundele, Mahesh Bhupati, Biplab Banerjee, Aditya Grover
The study of vision-and-language navigation (VLN) has typically relied on expert trajectories, which may not always be available in real-world situations due to the significant effort required to collect them.
1 code implementation • CVPR 2024 • Mainak Singha, Ankit Jha, Shirsha Bose, Ashwin Nair, Moloud Abdar, Biplab Banerjee
We delve into Open Domain Generalization (ODG) marked by domain and category shifts between training's labeled source and testing's unlabeled target domains.
no code implementations • 27 Nov 2023 • Avigyan Bhattacharya, Mainak Singha, Ankit Jha, Biplab Banerjee
To this end, we introduce C-SAW, a method that complements CLIP with a self-supervised loss in the visual space and a novel prompt learning technique that emphasizes both visual domain and content-specific features.
no code implementations • 5 Nov 2023 • Prathmesh Bele, Valay Bundele, Avigyan Bhattacharya, Ankit Jha, Gemma Roig, Biplab Banerjee
Single-source open-domain generalization (SS-ODG) addresses the challenge of labeled source domains with supervision during training and unlabeled novel target domains during testing.
no code implementations • 23 Sep 2023 • Ankit Jha, Debabrata Pal, Mainak Singha, Naman Agarwal, Biplab Banerjee
Even though joint training of audio-visual modalities improves classification performance in a low-data regime, it has yet to be thoroughly investigated in the RS domain.
1 code implementation • ICCV 2023 • Debabrata Pal, Deeptej More, Sai Bhargav, Dipesh Tamboli, Vaneet Aggarwal, Biplab Banerjee
Few-shot learning has made impressive strides in addressing the crucial challenges of recognizing unknown samples from novel classes in target query sets and managing visual shifts between domains.
Ranked #1 on cross-domain few-shot learning on Office-Home
1 code implementation • 22 Aug 2023 • Mainak Singha, Ankit Jha, Biplab Banerjee
GOPro is trained end-to-end on all three loss objectives, combining the strengths of CLIP and SSL in a principled manner.
1 code implementation • 10 Aug 2023 • Mainak Singha, Harsh Pal, Ankit Jha, Biplab Banerjee
We leverage the frozen vision backbone of CLIP to extract both image style (domain) and content information, which we apply to learn prompt tokens.
no code implementations • 8 Aug 2023 • Izzeddin Teeti, Rongali Sai Bhargav, Vivek Singh, Andrew Bradley, Biplab Banerjee, Fabio Cuzzolin
The emerging field of action prediction plays a vital role in various computer vision applications such as autonomous driving, activity analysis and human-computer interaction.
no code implementations • 27 Jul 2023 • Sandika Biswas, Kejie Li, Biplab Banerjee, Subhasis Chaudhuri, Hamid Rezatofighi
This paper proposes using an implicit feature representation of the scene elements to distinguish a physically plausible alignment of humans and objects from an implausible one.
no code implementations • 25 Jun 2023 • Jayesh Songara, Shivam Pande, Shabnam Choudhury, Biplab Banerjee, Rajbabu Velmurugan
In this research, we deal with the problem of visual question answering (VQA) in remote sensing.
no code implementations • 19 Jun 2023 • Shivam Pande, Nassim Ait Ali Braham, Yi Wang, Conrad M Albrecht, Biplab Banerjee, Xiao Xiang Zhu
Recently, to effectively train the deep learning models with minimal labelled samples, the unlabeled samples are also being leveraged in self-supervised and semi-supervised setting.
no code implementations • 27 May 2023 • Vikrant Rangnekar, Uddeshya Upadhyay, Zeynep Akata, Biplab Banerjee
Dense regression is a widely used approach in computer vision for tasks such as image super-resolution, enhancement, depth estimation, etc.
1 code implementation • 12 Apr 2023 • Mainak Singha, Ankit Jha, Bhupendra Solanki, Shirsha Bose, Biplab Banerjee
APPLeNet emphasizes the importance of multi-scale feature learning in RS scene classification and disentangles visual style and content primitives for domain generalization tasks.
no code implementations • 18 Feb 2023 • Shirsha Bose, Ankit Jha, Enrico Fini, Mainak Singha, Elisa Ricci, Biplab Banerjee
Our method focuses on a domain-agnostic prompt learning strategy, aiming to disentangle the visual style and content information embedded in CLIP's pre-trained vision encoder, enabling effortless adaptation to novel domains during inference.
no code implementations • 18 Feb 2023 • Shirsha Bose, Ritesh Sur Chowdhury, Debabrata Pal, Shivashish Bose, Biplab Banerjee, Subhasis Chaudhuri
Efficient road and building footprint extraction from satellite images are predominant in many remote sensing applications.
no code implementations • 5 Nov 2022 • Sanchar Palit, Biplab Banerjee, Subhasis Chaudhuri
We demonstrate the effectiveness of our method by showing that the embedding space remains intact after training the model with new classes and outperforms existing state-of-the-art algorithms in terms of accuracy across different sessions.
class-incremental learning Few-Shot Class-Incremental Learning +1
1 code implementation • 12 Feb 2022 • Advait Kumar, Dipesh Tamboli, Shivam Pande, Biplab Banerjee
We tackle the problem of image inpainting in the remote sensing domain.
no code implementations • 25 Jan 2022 • Sumrit Gupta, Ushasi Chaudhuri, Biplab Banerjee
To bridge the domain gap between the visual features, we propose minimizing the Wasserstein distance between images and sketches in a learned domain-shared space.
no code implementations • 17 Jan 2022 • Ushasi Chaudhuri, Ruchika Chavan, Biplab Banerjee, Anjan Dutta, Zeynep Akata
The efficacy of zero-shot sketch-based image retrieval (ZS-SBIR) models is governed by two challenges.
no code implementations • 28 Dec 2021 • Sayan Rakshit, Anwesh Mohanty, Ruchika Chavhan, Biplab Banerjee, Gemma Roig, Subhasis Chaudhuri
Inspired by the notion of generative feature replay, we propose a novel framework called Feature Replay based Incremental Domain Adaptation (FRIDA) which leverages a new incremental generative adversarial network (GAN) called domain-generic auxiliary classification GAN (DGAC-GAN) for producing domain-specific feature representations seamlessly.
Generative Adversarial Network Unsupervised Domain Adaptation
no code implementations • 7 Aug 2021 • Hmrishav Bandyopadhyay, Shuvayan Ghosh Dastidar, Bisakh Mondal, Biplab Banerjee, Nibaran Das
Presently, Covid-19 is a serious threat to the world at large.
no code implementations • 24 Jul 2021 • Rupak Bose, Shivam Pande, Biplab Banerjee
The model is composed of stacked auto encoders that harness the cross key-value pairs for HSI and LiDAR, thus establishing a communication between the two modalities, while simultaneously using the CNNs to extract the spectral and spatial information from HSI and LiDAR.
no code implementations • 18 Jul 2021 • Pranjal Jain, Shreyas Goenka, Saurabh Bagchi, Biplab Banerjee, Somali Chaterji
Federated learning allows a large number of devices to jointly learn a model without sharing data.
no code implementations • 20 Apr 2021 • Ushasi Chaudhuri, Biplab Banerjee, Avik Bhattacharya, Mihai Datcu
While we define a cross-modal triplet loss to ensure the discriminative nature of the shared space, an innovative cross-modal attention learning strategy is also proposed to guide feature extraction from the image domain exploiting information from the respective sketch counterpart.
no code implementations • 9 Apr 2021 • Sudipan Saha, Biplab Banerjee, Xiao Xiang Zhu
Deep learning (DL) based supervised change detection (CD) models require large labeled training data.
no code implementations • 25 Oct 2020 • Ayush Bhardwaj, Sakshee Pimpale, Saurabh Kumar, Biplab Banerjee
We demonstrate the effectiveness of the proposed method via various experiments on how it allows us to obtain a much smaller model, which takes a minimal hit in performance while being capable of open set recognition for 3D point cloud data.
no code implementations • 5 Oct 2020 • Ruchika Chavhan, Biplab Banerjee, Xiao Xiang Zhu, Subhasis Chaudhuri
We deal with the problem of generating textual captions from optical remote sensing (RS) images using the notion of deep reinforcement learning.
no code implementations • 10 Sep 2020 • Satyam Mohla, Sidharth Mohla, Anupam Guha, Biplab Banerjee
However, the task to segment burn marks is difficult because of its indistinguishably with similar looking land patterns, severe fragmented nature of burn marks and partially labelled noisy datasets.
1 code implementation • 12 Aug 2020 • Ushasi Chaudhuri, Biplab Banerjee, Avik Bhattacharya, Mihai Datcu
We perform a thorough bench-marking of this dataset and demonstrate that the proposed network outperforms other state-of-the-art methods for zero-shot sketch-based retrieval framework in remote sensing.
no code implementations • 31 Jul 2020 • Sravanti Addepalli, Dipesh Tamboli, R. Venkatesh Babu, Biplab Banerjee
Existing visualization methods develop high confidence images consisting of both background and foreground features.
1 code implementation • 25 Jun 2020 • Satyam Mohla, Anshul Nasery, Biplab Banerjee
Recent experiments in computer vision demonstrate texture bias as the primary reason for supreme results in models employing Convolutional Neural Networks (CNNs), conflicting with early works claiming that these networks identify objects using shape.
no code implementations • 20 May 2020 • Abhinaba Roy, Biplab Banerjee, Amir Hussain, Soujanya Poria
Specifically, we pose the selection of potent local descriptors as filtering based feature selection problem which ranks the local features per category based on a novel measure of distinctiveness.
no code implementations • 9 May 2020 • Omkar Gune, Mainak Pal, Preeti Mukherjee, Biplab Banerjee, Subhasis Chaudhuri
We propose a transductive approach to reduce the effect of domain shift, where we utilize unlabeled visual data from unseen classes to generate corresponding semantic features for unseen class visual samples.
no code implementations • 1 May 2020 • Ravi Kumar Kushawaha, Saurabh Kumar, Biplab Banerjee, Rajbabu Velmurugan
We also present a multi-stage knowledge distillation technique for SNNs using an intermediate network to obtain higher performance from the student network.
no code implementations • 6 Apr 2020 • Mukesh Kumar Vishal, Dipesh Tamboli, Abhijeet Patil, Rohit Saluja, Biplab Banerjee, Amit Sethi, Dhandapani Raju, Sudhir Kumar, R N Sahoo, Viswanathan Chinnusamy, J Adinarayana
The present investigation is carried out for discriminating drought tolerant, and susceptible genotypes.
2 code implementations • 19 Oct 2019 • Megh Shukla, Biplab Banerjee, Krishna Mohan Buddhiraju
Some popular techniques among these falter when applied to Hyperspectral Imagery due to the famed curse of dimensionality.
no code implementations • 28 Aug 2019 • Saurabh Kumar, Biplab Banerjee, Subhasis Chaudhuri
We deal with the problem of information fusion driven satellite image/scene classification and propose a generic hallucination architecture considering that all the available sensor information are present during training while some of the image modalities may be absent while testing.
1 code implementation • 9 Apr 2019 • Ushasi Chaudhuri, Biplab Banerjee, Avik Bhattacharya, Mihai Datcu
In particular, we are interested in two application scenarios: i) cross-modal retrieval between panchromatic (PAN) and multi-spectral imagery, and ii) multi-label image retrieval between very high resolution (VHR) images and speech based label annotations.
no code implementations • 4 Aug 2017 • Akashdeep Goel, Biplab Banerjee, Aleksandra Pizurica
We address the problem of scene classification from optical remote sensing (RS) images based on the paradigm of hierarchical metric learning.