Search Results for author: Vishal Chudasama

Found 8 papers, 1 papers with code

Precise Event Spotting in Sports Videos: Solving Long-Range Dependency and Class Imbalance

no code implementations28 Feb 2025 Sanchayan Santra, Vishal Chudasama, Pankaj Wasnik, Vineeth N. Balasubramanian

Particularly, we propose a network with a convolutional spatial-temporal feature extractor enhanced with our proposed Adaptive Spatio-Temporal Refinement Module (ASTRM) and a long-range temporal module.

Open-Set Object Detection By Aligning Known Class Representations

no code implementations30 Dec 2024 Hiran Sarkar, Vishal Chudasama, Naoyuki Onoe, Pankaj Wasnik, Vineeth N Balasubramanian

Open-Set Object Detection (OSOD) has emerged as a contemporary research direction to address the detection of unknown objects.

Clustering object-detection +1

Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection

no code implementations CVPR 2024 Ayush Ghadiya, Purbayan Kar, Vishal Chudasama, Pankaj Wasnik

Recently, weakly supervised video anomaly detection (WS-VAD) has emerged as a contemporary research direction to identify anomaly events like violence and nudity in videos using only video-level labels.

Anomaly Detection Graph Attention +1

Beyond Few-shot Object Detection: A Detailed Survey

no code implementations26 Aug 2024 Vishal Chudasama, Hiran Sarkar, Pankaj Wasnik, Vineeth N Balasubramanian, Jayateja Kalla

While traditional FSOD methods have been studied before, this survey paper comprehensively reviews FSOD research with a specific focus on covering different FSOD settings such as standard FSOD, generalized FSOD, incremental FSOD, open-set FSOD, and domain adaptive FSOD.

Few-Shot Learning Few-Shot Object Detection +3

Fiducial Focus Augmentation for Facial Landmark Detection

no code implementations23 Feb 2024 Purbayan Kar, Vishal Chudasama, Naoyuki Onoe, Pankaj Wasnik, Vineeth Balasubramanian

To effectively utilize the newly proposed augmentation technique, we employ a Siamese architecture-based training mechanism with a Deep Canonical Correlation Analysis (DCCA)-based loss to achieve collective learning of high-level feature representations from two different views of the input images.

 Ranked #1 on Facial Landmark Detection on WFLW (FR@10 (inter-ocular) metric)

Face Alignment Facial Landmark Detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.