Rgb-T Tracking

12 papers with code • 4 benchmarks • 2 datasets

RGBT tracking, or RGB-Thermal tracking, is a sophisticated method utilized in computer vision for tracking objects across both RGB (Red, Green, Blue) and thermal infrared modalities. This technique combines information from both RGB and thermal imagery to enhance object detection and tracking performance, particularly in challenging environments where lighting conditions may vary or be limited. By integrating data from these two modalities, RGBT tracking systems can effectively compensate for the limitations of each individual modality, such as the inability of RGB cameras to capture clear images in low-light or adverse weather conditions, and the inability of thermal cameras to accurately identify object details. RGBT tracking algorithms typically involve sophisticated fusion techniques to combine information from RGB and thermal sensors, enabling robust and accurate object tracking in diverse scenarios ranging from surveillance and security applications to autonomous vehicles and search and rescue operations.

Long-term Frame-Event Visual Tracking: Benchmark Dataset and Baseline

event-ahu/eventvot_benchmark 9 Mar 2024

Current event-/frame-event based trackers undergo evaluation on short-term tracking datasets, however, the tracking of real-world scenarios involves long-term tracking, and the performance of existing tracking algorithms in these scenarios remains unclear.

38
09 Mar 2024

Bi-directional Adapter for Multi-modal Tracking

sparktempest/bat 17 Dec 2023

To handle this problem, we propose a novel multi-modal visual prompt tracking model based on a universal bi-directional adapter, cross-prompting multiple modalities mutually.

39
17 Dec 2023

RGB-T Tracking via Multi-Modal Mutual Prompt Learning

husteryoung/mplt 31 Aug 2023

Object tracking based on the fusion of visible and thermal im-ages, known as RGB-T tracking, has gained increasing atten-tion from researchers in recent years.

18
31 Aug 2023

Visual Prompt Multi-Modal Tracking

jiawen-zhu/vipt CVPR 2023

To inherit the powerful representations of the foundation model, a natural modus operandi for multi-modal tracking is full fine-tuning on the RGB-based parameters.

227
20 Mar 2023

Bridging Search Region Interaction With Template for RGB-T Tracking

ryanhtr/tbsi CVPR 2023

To alleviate these limitations, we propose a novel Template-Bridged Search region Interaction (TBSI) module which exploits templates as the medium to bridge the cross-modal interaction between RGB and TIR search regions by gathering and distributing target-relevant object and environment contexts.

33
01 Jan 2023

Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline

zhang-pengyu/HMFT CVPR 2022

With the popularity of multi-modal sensors, visible-thermal (RGB-T) object tracking is to achieve robust performance and wider application scenarios with the guidance of objects' temperature information.

36
08 Apr 2022

Attribute-Based Progressive Fusion Network for RGBT Tracking

yangmengmeng1997/APFNet AAAI2022 2022

RGBT tracking usually suffers from various challenging factors of fast motion, scale variation, illumination variation, thermal crossover and occlusion, to name a few.

35
26 Jan 2022

MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking

wangxiao5791509/MFG_RGBT_Tracking_PyTorch 22 Jul 2021

The visible and thermal filters will be used to conduct a dynamic convolutional operation on their corresponding input feature maps respectively.

15
22 Jul 2021

LasHeR: A Large-scale High-diversity Benchmark for RGBT Tracking

BUGPLEASEOUT/LasHeR 27 Apr 2021

RGBT tracking receives a surge of interest in the computer vision community, but this research field lacks a large-scale and high-diversity benchmark dataset, which is essential for both the training of deep RGBT trackers and the comprehensive evaluation of RGBT tracking methods.

39
27 Apr 2021

Multi-modal Visual Tracking: Review and Experimental Comparison

zhang-pengyu/Multimodal_tracking_survey 8 Dec 2020

Visual object tracking, as a fundamental task in computer vision, has drawn much attention in recent years.

74
08 Dec 2020