Video Object Tracking

28 papers with code • 3 benchmarks • 11 datasets

Video Object Detection aims to detect targets in videos using both spatial and temporal information. It's usually deeply integrated with tasks such as Object Detection and Object Tracking.

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Tracking

Dataset	Best Model	Compare
CATER	Loci	See all
GOT-10k	TATrack-L-GOT	See all
SoccerNet-v2	CO-MOT	See all

Libraries

Use these libraries to find Video Object Tracking models and implementations

open-mmlab/mmaction2

3 papers

3,876

Datasets

Most implemented papers

Most implemented Social Latest No code

Learning Object Permanence from Video

ofrikleinfeld/ObjectPermanence • • ECCV 2020

The fourth subtask, where a target object is carried by a containing object, is particularly challenging because it requires a system to reason about a moving location of an invisible object.

Paper
Code

Fast Template Matching and Update for Video Object Tracking and Segmentation

insomnia94/FTMU • • CVPR 2020

Specifically, the reinforcement learning agent learns to decide whether to update the target template according to the quality of the predicted result.

Paper
Code

ApproxDet: Content and Contention-Aware Approximate Object Detection for Mobiles

StarsThu2016/ApproxDet • • 21 Oct 2020

In this paper we introduce ApproxDet, an adaptive video object detection framework for mobile devices to meet accuracy-latency requirements in the face of changing content and resource contention scenarios.

Paper
Code

Contrastive Transformation for Self-supervised Correspondence Learning

594422814/ContrastCorr • • 9 Dec 2020

It is worth mentioning that our method also surpasses the fully-supervised affinity representation (e. g., ResNet) and performs competitively against the recent fully-supervised algorithms designed for the specific tasks (e. g., VOT and VOS).

Paper
Code

Attention over learned object embeddings enables complex visual reasoning

deepmind/deepmind-research • • NeurIPS 2021

Neural networks have achieved success in a wide array of perceptual tasks but often fail at tasks involving both perception and higher-level reasoning.

Paper
Code

Hopper: Multi-hop Transformer for Spatiotemporal Reasoning

necla-ml/cater-h • • ICLR 2021

We evaluate over CATER dataset and find that Hopper achieves 73. 2% Top-1 accuracy using just 1 FPS by hopping through just a few critical frames.

Paper
Code

TDIOT: Target-driven Inference for Deep Video Object Tracking

msprITU/TDIOT • • 19 Mar 2021

For effective video object tracking, object detection is integrated with a data association step performed by either a custom design inference architecture or an end-to-end joint training for tracking purpose.

Paper
Code

Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking

594422814/TransformerTrack • • CVPR 2021

In video object tracking, there exist rich temporal contexts among successive frames, which have been largely overlooked in existing trackers.

Paper
Code

Do Different Tracking Tasks Require Different Appearance Models?

Zhongdao/UniTrack • • NeurIPS 2021

We show how most tracking tasks can be solved within this framework, and that the same appearance model can be successfully used to obtain results that are competitive against specialised methods for most of the tasks considered.

Paper
Code

BundleTrack: 6D Pose Tracking for Novel Objects without Instance or Category-Level 3D Models

wenbowen123/BundleTrack • • 1 Aug 2021

Most prior efforts, however, often assume that the target object's CAD model, at least at a category-level, is available for offline training or during online template matching.

Paper
Code

Video Object Tracking

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result