Visual Tracking
168 papers with code • 9 benchmarks • 26 datasets
Visual Tracking is an essential and actively researched problem in the field of computer vision with various real-world applications such as robotic services, smart surveillance systems, autonomous driving, and human-computer interaction. It refers to the automatic estimation of the trajectory of an arbitrary target object, usually specified by a bounding box in the first frame, as it moves around in subsequent video frames.
Source: Learning Reinforced Attentional Representation for End-to-End Visual Tracking
Libraries
Use these libraries to find Visual Tracking models and implementationsLatest papers with no code
Learning Tracking Representations from Single Point Annotations
In this paper, we propose to learn tracking representations from single point annotations (i. e., 4. 5x faster to annotate than the traditional bounding box) in a weakly supervised manner.
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL
We evaluate our tracker on several high-fidelity environments with challenging situations, such as distraction and occlusion.
Multi-attention Associate Prediction Network for Visual Tracking
They are capable of fully capturing the category-related semantics for classification and the local spatial contexts for regression, respectively.
Pedestrian Tracking with Monocular Camera using Unconstrained 3D Motion Model
A first-principle single-object model is proposed for pedestrian tracking.
A Spectrum-based Image Denoising Method with Edge Feature Enhancement
Image denoising stands as a critical challenge in image processing and computer vision, aiming to restore the original image from noise-affected versions caused by various intrinsic and extrinsic factors.
Autoregressive Queries for Adaptive Tracking with Spatio-TemporalTransformers
Firstly, we introduce a set of learnable and autoregressive queries to capture the instantaneous target appearance changes in a sliding window fashion.
Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance
The shared embeddings, which describe the absolute coordinates of multi-resolution images (namely, the template and search images), are inherited from the pre-trained backbones.
Motion-Guided Dual-Camera Tracker for Low-Cost Skill Evaluation of Gastric Endoscopy
In this work, a motion-guided dual-camera tracker is proposed to provide reliable endoscope tip position feedback at a low cost inside a mechanical simulator for endoscopy skill evaluation, tackling several unique challenges.
Visual tracking brain computer interface
Brain-computer interfaces (BCIs) offer a way to interact with computers without relying on physical movements.
HIPTrack: Visual Tracking with Historical Prompts
In this paper, we show that by providing a tracker that follows Siamese paradigm with precise and updated historical information, a significant performance improvement can be achieved with completely unchanged parameters.