TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	DEVA	Overall	86.2	# 5
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	DEVA	Jaccard (Seen)	85.4	# 4
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	DEVA	Jaccard (Unseen)	89.9	# 2
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	DEVA	F-Measure (Seen)	89.9	# 4
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	DEVA	F-Measure (Unseen)	89.1	# 4
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	DEVA	FPS	25.3	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tracking-anything-in-high-quality/semi-supervised-video-object-segmentation-on-18)](https://paperswithcode.com/sota/semi-supervised-video-object-segmentation-on-18?p=tracking-anything-in-high-quality)`

Tracking Anything in High Quality

26 Jul 2023 · Jiawen Zhu, Zhenyu Chen, Zeqi Hao, Shijie Chang, Lu Zhang, Dong Wang, Huchuan Lu, Bin Luo, Jun-Yan He, Jin-Peng Lan, Hanyuan Chen, Chenyang Li ·

Visual object tracking is a fundamental video task in computer vision. Recently, the notably increasing power of perception algorithms allows the unification of single/multiobject and box/mask-based tracking. Among them, the Segment Anything Model (SAM) attracts much attention. In this report, we propose HQTrack, a framework for High Quality Tracking anything in videos. HQTrack mainly consists of a video multi-object segmenter (VMOS) and a mask refiner (MR). Given the object to be tracked in the initial frame of a video, VMOS propagates the object masks to the current frame. The mask results at this stage are not accurate enough since VMOS is trained on several closeset video object segmentation (VOS) datasets, which has limited ability to generalize to complex and corner scenes. To further improve the quality of tracking masks, a pretrained MR model is employed to refine the tracking results. As a compelling testament to the effectiveness of our paradigm, without employing any tricks such as test-time data augmentations and model ensemble, HQTrack ranks the 2nd place in the Visual Object Tracking and Segmentation (VOTS2023) challenge. Code and models are available at https://github.com/jiawen-zhu/HQTrack.

PDF Abstract

Code

Add Remove Mark official

jiawen-zhu/hqtrack official

733

Tasks

Add Remove

Object

Object Tracking

Semantic Segmentation

Semi-Supervised Video Object Segmentation

Video Object Segmentation

Video Semantic Segmentation

Visual Object Tracking

Datasets

YouTube-VOS 2018

OVIS

Results from the Paper

Edit

Ranked #5 on Semi-Supervised Video Object Segmentation on YouTube-VOS 2019 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	DEVA	Overall	86.2	# 5	Compare
			Jaccard (Seen)	85.4	# 4	Compare
			Jaccard (Unseen)	89.9	# 2	Compare
			F-Measure (Seen)	89.9	# 4	Compare
			F-Measure (Unseen)	89.1	# 4	Compare
			FPS	25.3	# 4	Compare

Methods

Add Remove

1-bit Adam • Adam

Edit Social Preview

Tracking Anything in High Quality

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove