TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Object Detection	COCO minival	Co-DETR	box AP	65.9	# 1
Object Detection	COCO minival	Co-DETR	Params (M)	348	# 2
Object Detection	COCO minival	Co-DETR (Swin-L)	box AP	64.7	# 4
Object Detection	COCO minival	Co-DETR (Swin-L)	Params (M)	218	# 1
Object Detection	COCO test-dev	Co-DETR	box mAP	66.0	# 1
Object Detection	COCO test-dev	Co-DETR	Params (M)	348	# 5
Object Detection	COCO test-dev	Co-DETR (Swin-L)	box mAP	64.8	# 5
Object Detection	COCO test-dev	Co-DETR (Swin-L)	Params (M)	218	# 6
Object Detection	LVIS v1.0 minival	Co-DETR (single-scale)	box AP	72.0	# 1
Instance Segmentation	LVIS v1.0 val	Co-DETR (single-scale)	mask AP	56.0	# 1
Object Detection	LVIS v1.0 val	Co-DETR (single-scale)	box AP	68.0	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/detrs-with-collaborative-hybrid-assignments/object-detection-on-coco-minival)](https://paperswithcode.com/sota/object-detection-on-coco-minival?p=detrs-with-collaborative-hybrid-assignments)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/detrs-with-collaborative-hybrid-assignments/object-detection-on-coco)](https://paperswithcode.com/sota/object-detection-on-coco?p=detrs-with-collaborative-hybrid-assignments)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/detrs-with-collaborative-hybrid-assignments/object-detection-on-lvis-v1-0-minival)](https://paperswithcode.com/sota/object-detection-on-lvis-v1-0-minival?p=detrs-with-collaborative-hybrid-assignments)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/detrs-with-collaborative-hybrid-assignments/instance-segmentation-on-lvis-v1-0-val)](https://paperswithcode.com/sota/instance-segmentation-on-lvis-v1-0-val?p=detrs-with-collaborative-hybrid-assignments)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/detrs-with-collaborative-hybrid-assignments/object-detection-on-lvis-v1-0-val)](https://paperswithcode.com/sota/object-detection-on-lvis-v1-0-val?p=detrs-with-collaborative-hybrid-assignments)`

DETRs with Collaborative Hybrid Assignments Training

ICCV 2023 · Zhuofan Zong, Guanglu Song, Yu Liu ·

In this paper, we provide the observation that too few queries assigned as positive samples in DETR with one-to-one set matching leads to sparse supervision on the encoder's output which considerably hurt the discriminative feature learning of the encoder and vice visa for attention learning in the decoder. To alleviate this, we present a novel collaborative hybrid assignments training scheme, namely $\mathcal{C}$o-DETR, to learn more efficient and effective DETR-based detectors from versatile label assignment manners. This new training scheme can easily enhance the encoder's learning ability in end-to-end detectors by training the multiple parallel auxiliary heads supervised by one-to-many label assignments such as ATSS and Faster RCNN. In addition, we conduct extra customized positive queries by extracting the positive coordinates from these auxiliary heads to improve the training efficiency of positive samples in the decoder. In inference, these auxiliary heads are discarded and thus our method introduces no additional parameters and computational cost to the original detector while requiring no hand-crafted non-maximum suppression (NMS). We conduct extensive experiments to evaluate the effectiveness of the proposed approach on DETR variants, including DAB-DETR, Deformable-DETR, and DINO-Deformable-DETR. The state-of-the-art DINO-Deformable-DETR with Swin-L can be improved from 58.5% to 59.5% AP on COCO val. Surprisingly, incorporated with ViT-L backbone, we achieve 66.0% AP on COCO test-dev and 67.9% AP on LVIS val, outperforming previous methods by clear margins with much fewer model sizes. Codes are available at \url{https://github.com/Sense-X/Co-DETR}.

PDF Abstract ICCV 2023 PDF ICCV 2023 Abstract

Code

Add Remove Mark official

open-mmlab/mmdetection official

27,765

sense-x/co-detr official

784

code-implementation1/Code1

Tasks

Add Remove

Instance Segmentation

Object Detection

set matching

Datasets

ImageNet

MS COCO

LVIS

Objects365

Results from the Paper

Add Remove

Ranked #1 on Object Detection on LVIS v1.0 val (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Object Detection	COCO minival	Co-DETR	box AP	65.9	# 1	Compare
Object Detection	COCO minival	Co-DETR	Params (M)	348	# 2	Compare
Object Detection	COCO minival	Co-DETR (Swin-L)	box AP	64.7	# 4	Compare
Object Detection	COCO minival	Co-DETR (Swin-L)	Params (M)	218	# 1	Compare
Object Detection	COCO test-dev	Co-DETR	box mAP	66.0	# 1	Compare
Object Detection	COCO test-dev	Co-DETR	Params (M)	348	# 5	Compare
Object Detection	COCO test-dev	Co-DETR (Swin-L)	box mAP	64.8	# 5	Compare
Object Detection	COCO test-dev	Co-DETR (Swin-L)	Params (M)	218	# 6	Compare
Object Detection	LVIS v1.0 minival	Co-DETR (single-scale)	box AP	72.0	# 1	Compare
Instance Segmentation	LVIS v1.0 val	Co-DETR (single-scale)	mask AP	56.0	# 1	Compare
Object Detection	LVIS v1.0 val	Co-DETR (single-scale)	box AP	68.0	# 1	Compare

Methods

Add Remove

1x1 Convolution • Absolute Position Encodings • Adam • ATSS • BPE • Convolution • Dense Connections • Detr • Dropout • FCOS • Feedforward Network • FPN • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Non Maximum Suppression • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

DETRs with Collaborative Hybrid Assignments Training

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove