TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	ADE20K	ConvNeXt V2-H (FCMAE)	Validation mIoU	55	# 45
Semantic Segmentation	ADE20K	ConvNeXt V2-L (Supervised)	Validation mIoU	51.6	# 87
Semantic Segmentation	ADE20K	Swin V2-H	Validation mIoU	54.2	# 59
Semantic Segmentation	ADE20K	Swin-L	Validation mIoU	53.5	# 74
Semantic Segmentation	ADE20K	ConvNeXt V2-L	Validation mIoU	53.7	# 68
Semantic Segmentation	ADE20K	ConvNeXt V1-L	Validation mIoU	50.5	# 104
Semantic Segmentation	ADE20K	ConvNeXt V2-B	Validation mIoU	52.1	# 83
Semantic Segmentation	ADE20K	Swin-B	Validation mIoU	52.8	# 80
Semantic Segmentation	ADE20K	ConvNeXt V1-B	Validation mIoU	49.9	# 116

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/convnext-v2-co-designing-and-scaling-convnets/semantic-segmentation-on-ade20k)](https://paperswithcode.com/sota/semantic-segmentation-on-ade20k?p=convnext-v2-co-designing-and-scaling-convnets)`

ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders

CVPR 2023 · Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, Saining Xie ·

Driven by improved architectures and better representation learning frameworks, the field of visual recognition has enjoyed rapid modernization and performance boost in the early 2020s. For example, modern ConvNets, represented by ConvNeXt, have demonstrated strong performance in various scenarios. While these models were originally designed for supervised learning with ImageNet labels, they can also potentially benefit from self-supervised learning techniques such as masked autoencoders (MAE). However, we found that simply combining these two approaches leads to subpar performance. In this paper, we propose a fully convolutional masked autoencoder framework and a new Global Response Normalization (GRN) layer that can be added to the ConvNeXt architecture to enhance inter-channel feature competition. This co-design of self-supervised learning techniques and architectural improvement results in a new model family called ConvNeXt V2, which significantly improves the performance of pure ConvNets on various recognition benchmarks, including ImageNet classification, COCO detection, and ADE20K segmentation. We also provide pre-trained ConvNeXt V2 models of various sizes, ranging from an efficient 3.7M-parameter Atto model with 76.7% top-1 accuracy on ImageNet, to a 650M Huge model that achieves a state-of-the-art 88.9% accuracy using only public training data.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

facebookresearch/convnext-v2 official

1,334

rwightman/pytorch-image-models

29,671

Westlake-AI/openmixup

567

leondgarse/keras_cv_attention_models

554

Jacky-Android/convnext-v2-pytorch

See all 10 implementations

Tasks

Add Remove

Object Detection

Representation Learning

Self-Supervised Learning

Semantic Segmentation

Datasets

MS COCO

ADE20K ImageNet-1K

Results from the Paper

Edit

Ranked #45 on Semantic Segmentation on ADE20K

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	ADE20K	ConvNeXt V2-H (FCMAE)	Validation mIoU	55	# 45	Compare
Semantic Segmentation	ADE20K	ConvNeXt V2-L (Supervised)	Validation mIoU	51.6	# 87	Compare
Semantic Segmentation	ADE20K	Swin V2-H	Validation mIoU	54.2	# 59	Compare
Semantic Segmentation	ADE20K	Swin-L	Validation mIoU	53.5	# 74	Compare
Semantic Segmentation	ADE20K	ConvNeXt V2-L	Validation mIoU	53.7	# 68	Compare
Semantic Segmentation	ADE20K	ConvNeXt V1-L	Validation mIoU	50.5	# 104	Compare
Semantic Segmentation	ADE20K	ConvNeXt V2-B	Validation mIoU	52.1	# 83	Compare
Semantic Segmentation	ADE20K	Swin-B	Validation mIoU	52.8	# 80	Compare
Semantic Segmentation	ADE20K	ConvNeXt V1-B	Validation mIoU	49.9	# 116	Compare

Methods

Add Remove

AutoEncoder • ConvNeXt

Edit Social Preview

ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove