Video Object Tracking

28 papers with code • 3 benchmarks • 11 datasets

Video Object Detection aims to detect targets in videos using both spatial and temporal information. It's usually deeply integrated with tasks such as Object Detection and Object Tracking.

Libraries

Use these libraries to find Video Object Tracking models and implementations
3 papers
3,906

Video Polyp Segmentation: A Deep Learning Perspective

DengPingFan/PraNet 27 Mar 2022

We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era.

407
27 Mar 2022

BundleTrack: 6D Pose Tracking for Novel Objects without Instance or Category-Level 3D Models

wenbowen123/BundleTrack 1 Aug 2021

Most prior efforts, however, often assume that the target object's CAD model, at least at a category-level, is available for offline training or during online template matching.

581
01 Aug 2021

Do Different Tracking Tasks Require Different Appearance Models?

Zhongdao/UniTrack NeurIPS 2021

We show how most tracking tasks can be solved within this framework, and that the same appearance model can be successfully used to obtain results that are competitive against specialised methods for most of the tasks considered.

336
05 Jul 2021

Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking

594422814/TransformerTrack CVPR 2021

In video object tracking, there exist rich temporal contexts among successive frames, which have been largely overlooked in existing trackers.

256
22 Mar 2021

Hopper: Multi-hop Transformer for Spatiotemporal Reasoning

necla-ml/cater-h ICLR 2021

We evaluate over CATER dataset and find that Hopper achieves 73. 2% Top-1 accuracy using just 1 FPS by hopping through just a few critical frames.

6
19 Mar 2021

TDIOT: Target-driven Inference for Deep Video Object Tracking

msprITU/TDIOT 19 Mar 2021

For effective video object tracking, object detection is integrated with a data association step performed by either a custom design inference architecture or an end-to-end joint training for tracking purpose.

1
19 Mar 2021

Attention over learned object embeddings enables complex visual reasoning

deepmind/deepmind-research NeurIPS 2021

Neural networks have achieved success in a wide array of perceptual tasks but often fail at tasks involving both perception and higher-level reasoning.

12,813
15 Dec 2020

Contrastive Transformation for Self-supervised Correspondence Learning

594422814/ContrastCorr 9 Dec 2020

It is worth mentioning that our method also surpasses the fully-supervised affinity representation (e. g., ResNet) and performs competitively against the recent fully-supervised algorithms designed for the specific tasks (e. g., VOT and VOS).

35
09 Dec 2020

ApproxDet: Content and Contention-Aware Approximate Object Detection for Mobiles

StarsThu2016/ApproxDet 21 Oct 2020

In this paper we introduce ApproxDet, an adaptive video object detection framework for mobile devices to meet accuracy-latency requirements in the face of changing content and resource contention scenarios.

11
21 Oct 2020

Fast Template Matching and Update for Video Object Tracking and Segmentation

insomnia94/FTMU CVPR 2020

Specifically, the reinforcement learning agent learns to decide whether to update the target template according to the quality of the predicted result.

22
16 Apr 2020