TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Object Detection	CrowdHuman (full body)	DDQ DETR (R50)	AP	93.8	# 3
Object Detection	CrowdHuman (full body)	DDQ DETR (R50)	mMR	39.7	# 3
Object Detection	CrowdHuman (full body)	DDQ DETR (R50)	Recall	98.7	# 1
Object Detection	CrowdHuman (full body)	DDQ FCN (R50 One-Stage)	AP	92.7	# 6
Object Detection	CrowdHuman (full body)	DDQ FCN (R50 One-Stage)	mMR	41.0	# 7
Object Detection	CrowdHuman (full body)	DDQ FCN (R50 One-Stage)	Recall	98.2	# 3
Object Detection	CrowdHuman (full body)	DDQ R-CNN (R50)	AP	93.5	# 4
Object Detection	CrowdHuman (full body)	DDQ R-CNN (R50)	mMR	40.4	# 5
Object Detection	CrowdHuman (full body)	DDQ R-CNN (R50)	Recall	98.6	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dense-distinct-query-for-end-to-end-object/object-detection-on-crowdhuman-full-body)](https://paperswithcode.com/sota/object-detection-on-crowdhuman-full-body?p=dense-distinct-query-for-end-to-end-object)`

Dense Distinct Query for End-to-End Object Detection

CVPR 2023 · Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Chengqi Lyu, Wenwei Zhang, Ping Luo, Kai Chen ·

One-to-one label assignment in object detection has successfully obviated the need for non-maximum suppression (NMS) as postprocessing and makes the pipeline end-to-end. However, it triggers a new dilemma as the widely used sparse queries cannot guarantee a high recall, while dense queries inevitably bring more similar queries and encounter optimization difficulties. As both sparse and dense queries are problematic, then what are the expected queries in end-to-end object detection? This paper shows that the solution should be Dense Distinct Queries (DDQ). Concretely, we first lay dense queries like traditional detectors and then select distinct ones for one-to-one assignments. DDQ blends the advantages of traditional and recent end-to-end detectors and significantly improves the performance of various detectors including FCN, R-CNN, and DETRs. Most impressively, DDQ-DETR achieves 52.1 AP on MS-COCO dataset within 12 epochs using a ResNet-50 backbone, outperforming all existing detectors in the same setting. DDQ also shares the benefit of end-to-end detectors in crowded scenes and achieves 93.8 AP on CrowdHuman. We hope DDQ can inspire researchers to consider the complementarity between traditional methods and end-to-end detectors. The source code can be found at \url{https://github.com/jshilong/DDQ}.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

jshilong/ddq official

236

Tasks

Add Remove

Object

object-detection

Object Detection

Datasets

MS COCO

CrowdHuman

Results from the Paper

Edit

Ranked #3 on Object Detection on CrowdHuman (full body)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Object Detection	CrowdHuman (full body)	DDQ DETR (R50)	AP	93.8	# 3	Compare
			mMR	39.7	# 3	Compare
			Recall	98.7	# 1	Compare
Object Detection	CrowdHuman (full body)	DDQ FCN (R50 One-Stage)	AP	92.7	# 6	Compare
			mMR	41.0	# 7	Compare
			Recall	98.2	# 3	Compare
Object Detection	CrowdHuman (full body)	DDQ R-CNN (R50)	AP	93.5	# 4	Compare
			mMR	40.4	# 5	Compare
			Recall	98.6	# 2	Compare

Methods

Add Remove

Convolution • FCN • Max Pooling

Edit Social Preview

Dense Distinct Query for End-to-End Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove