Video Object Tracking

28 papers with code • 3 benchmarks • 11 datasets

Video Object Detection aims to detect targets in videos using both spatial and temporal information. It's usually deeply integrated with tasks such as Object Detection and Object Tracking.

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Tracking

Dataset	Best Model	Compare
CATER	Loci	See all
GOT-10k	TATrack-L-GOT	See all
SoccerNet-v2	CO-MOT	See all

Libraries

Use these libraries to find Video Object Tracking models and implementations

open-mmlab/mmaction2

3 papers

3,888

Datasets

Most implemented papers

Most implemented Social Latest No code

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

open-mmlab/mmaction2 • • CVPR 2017

The paucity of videos in current action classification datasets (UCF-101 and HMDB-51) has made it difficult to identify good video architectures, as most methods obtain similar performance on existing small-scale benchmarks.

Paper
Code

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

wongkinyiu/yolov7 • • CVPR 2023

YOLOv7 surpasses all known object detectors in both speed and accuracy in the range from 5 FPS to 160 FPS and has the highest accuracy 56. 8% AP among all known real-time object detectors with 30 FPS or higher on GPU V100.

Paper
Code

TSM: Temporal Shift Module for Efficient Video Understanding

MIT-HAN-LAB/temporal-shift-module • • ICCV 2019

The explosive growth in video streaming gives rise to challenges on performing video understanding at high accuracy and low computation cost.

Paper
Code

Video Polyp Segmentation: A Deep Learning Perspective

DengPingFan/PraNet • • 27 Mar 2022

We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era.

Paper
Code

CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning

rohitgirdhar/CATER • • 10 Oct 2019

In this work, we build a video dataset with fully observable and controllable object and scene bias, and which truly requires spatiotemporal understanding in order to be solved.

Paper
Code

Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking

BingfengYan/CO-MOT • • 22 May 2023

Existing end-to-end Multi-Object Tracking (e2e-MOT) methods have not surpassed non-end-to-end tracking-by-detection methods.

Paper
Code

Learning Adaptive Discriminative Correlation Filters via Temporal Consistency Preserving Spatial Feature Selection for Robust Visual Tracking

XU-TIANYANG/LADCF • 30 Jul 2018

The key innovations of the proposed method include adaptive spatial feature selection and temporal consistent constraints, with which the new tracker enables joint spatial-temporal filter learning in a lower dimensional discriminative manifold.

Paper
Code

Weakly Supervised Convolutional LSTM Approach for Tool Tracking in Laparoscopic Videos

CAMMA-public/ConvLSTM-Surgical-Tool-Tracker • • 4 Dec 2018

Results: We build a baseline tracker on top of the CNN model and demonstrate that our approach based on the ConvLSTM outperforms the baseline in tool presence detection, spatial localization, and motion tracking by over 5. 0%, 13. 9%, and 12. 6%, respectively.

Paper
Code

SPARK: Spatial-aware Online Incremental Attack Against Visual Tracking

tsingqguo/AttackTracker • • ECCV 2020

We identify that online object tracking poses two new challenges: 1) it is difficult to generate imperceptible perturbations that can transfer across frames, and 2) real-time trackers require the attack to satisfy a certain level of efficiency.

Paper
Code

Argus: Efficient Activity Detection System for Extended Video Analysis

JunweiLiang/Object_Detection_Tracking • • Proceedings of the IEEE Winter Conference on Applications of Computer Vision Workshops 2020

We propose an Efficient Activity Detection System, Argus, for Extended Video Analysis in the surveillance scenario.

Paper
Code

Video Object Tracking

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result