TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Object Detection	EPIC KITCHENS-seen splits	Temporal ROI Align	mAP	42.2	# 1
Video Object Detection	EPIC KITCHENS-unseen splits	Temporal ROI Align	mAP	39.6	# 1
Video Object Detection	ImageNet VID	Temporal ROI Align (ResNeXt101)	MAP	84.3	# 15
Video Instance Segmentation	YouTube-VIS	Temporal ROI Align	mask AP	38	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporal-roi-align-for-video-object/video-object-detection-on-epic-kitchens-seen)](https://paperswithcode.com/sota/video-object-detection-on-epic-kitchens-seen?p=temporal-roi-align-for-video-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporal-roi-align-for-video-object/video-object-detection-on-epic-kitchens-1)](https://paperswithcode.com/sota/video-object-detection-on-epic-kitchens-1?p=temporal-roi-align-for-video-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporal-roi-align-for-video-object/video-instance-segmentation-on-youtube-vis)](https://paperswithcode.com/sota/video-instance-segmentation-on-youtube-vis?p=temporal-roi-align-for-video-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporal-roi-align-for-video-object/video-object-detection-on-imagenet-vid)](https://paperswithcode.com/sota/video-object-detection-on-imagenet-vid?p=temporal-roi-align-for-video-object)`

Temporal RoI Align for Video Object Recognition

8 Sep 2021 · Tao Gong, Kai Chen, Xinjiang Wang, Qi Chu, Feng Zhu, Dahua Lin, Nenghai Yu, Huamin Feng ·

Video object detection is challenging in the presence of appearance deterioration in certain video frames. Therefore, it is a natural choice to aggregate temporal information from other frames of the same video into the current frame. However, RoI Align, as one of the most core procedures of video detectors, still remains extracting features from a single-frame feature map for proposals, making the extracted RoI features lack temporal information from videos. In this work, considering the features of the same object instance are highly similar among frames in a video, a novel Temporal RoI Align operator is proposed to extract features from other frames feature maps for current frame proposals by utilizing feature similarity. The proposed Temporal RoI Align operator can extract temporal information from the entire video for proposals. We integrate it into single-frame video detectors and other state-of-the-art video detectors, and conduct quantitative experiments to demonstrate that the proposed Temporal RoI Align operator can consistently and significantly boost the performance. Besides, the proposed Temporal RoI Align can also be applied into video instance segmentation. Codes are available at https://github.com/open-mmlab/mmtracking

PDF Abstract

Code

Add Remove Mark official

open-mmlab/mmtracking official

↳ Quickstart in

Colab

3,375

Tasks

Add Remove

Instance Segmentation

Object

object-detection

Object Detection

Object Recognition

Semantic Segmentation

Video Instance Segmentation

Video Object Detection

Datasets

YouTube-VIS 2019 ImageNet VID

Results from the Paper

Edit

Ranked #1 on Video Instance Segmentation on YouTube-VIS

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Object Detection	EPIC KITCHENS-seen splits	Temporal ROI Align	mAP	42.2	# 1	Compare
Video Object Detection	EPIC KITCHENS-unseen splits	Temporal ROI Align	mAP	39.6	# 1	Compare
Video Object Detection	ImageNet VID	Temporal ROI Align (ResNeXt101)	MAP	84.3	# 15	Compare
Video Instance Segmentation	YouTube-VIS	Temporal ROI Align	mask AP	38	# 1	Compare

Methods

Add Remove

ALIGN • Temporal ROIAlign

Edit Social Preview

Temporal RoI Align for Video Object Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove