TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
2D Object Detection	CeyMo	TOOD	mAP	65.6	# 3
Object Detection	COCO test-dev	TAL + TAP	box mAP	42.5	# 169
Object Detection	COCO test-dev	TAL + TAP	AP50	60.3	# 129
Object Detection	COCO test-dev	TAL + TAP	AP75	46.4	# 110

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tood-task-aligned-one-stage-object-detection/2d-object-detection-on-ceymo)](https://paperswithcode.com/sota/2d-object-detection-on-ceymo?p=tood-task-aligned-one-stage-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/tood-task-aligned-one-stage-object-detection/object-detection-on-coco)](https://paperswithcode.com/sota/object-detection-on-coco?p=tood-task-aligned-one-stage-object-detection)`

TOOD: Task-aligned One-stage Object Detection

ICCV 2021 · Chengjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott, Weilin Huang ·

One-stage object detection is commonly implemented by optimizing two sub-tasks: object classification and localization, using heads with two parallel branches, which might lead to a certain level of spatial misalignment in predictions between the two tasks. In this work, we propose a Task-aligned One-stage Object Detection (TOOD) that explicitly aligns the two tasks in a learning-based manner. First, we design a novel Task-aligned Head (T-Head) which offers a better balance between learning task-interactive and task-specific features, as well as a greater flexibility to learn the alignment via a task-aligned predictor. Second, we propose Task Alignment Learning (TAL) to explicitly pull closer (or even unify) the optimal anchors for the two tasks during training via a designed sample assignment scheme and a task-aligned loss. Extensive experiments are conducted on MS-COCO, where TOOD achieves a 51.1 AP at single-model single-scale testing. This surpasses the recent one-stage detectors by a large margin, such as ATSS (47.7 AP), GFL (48.2 AP), and PAA (49.0 AP), with fewer parameters and FLOPs. Qualitative results also demonstrate the effectiveness of TOOD for better aligning the tasks of object classification and localization. Code is available at https://github.com/fcjian/TOOD.