TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Object Detection	COCO test-dev	Faster R-CNN (ImageNet+300M)	box mAP	37.4	# 201
Object Detection	COCO test-dev	Faster R-CNN (ImageNet+300M)	AP50	58	# 138
Object Detection	COCO test-dev	Faster R-CNN (ImageNet+300M)	AP75	40.1	# 140
Object Detection	COCO test-dev	Faster R-CNN (ImageNet+300M)	APS	17.5	# 133
Object Detection	COCO test-dev	Faster R-CNN (ImageNet+300M)	APM	41.1	# 128
Object Detection	COCO test-dev	Faster R-CNN (ImageNet+300M)	APL	51.2	# 122
Object Detection	COCO test-dev	Faster R-CNN (ImageNet+300M)	Hardware Burden	None	# 1
Object Detection	COCO test-dev	Faster R-CNN (ImageNet+300M)	Operations per network pass	None	# 1
Pose Estimation	COCO test-dev	Faster R-CNN (ImageNet+300M)	AP	64.4	# 38
Pose Estimation	COCO test-dev	Faster R-CNN (ImageNet+300M)	AP50	85.7	# 38
Pose Estimation	COCO test-dev	Faster R-CNN (ImageNet+300M)	AP75	70.7	# 34
Pose Estimation	COCO test-dev	Faster R-CNN (ImageNet+300M)	APL	69.8	# 37
Pose Estimation	COCO test-dev	Faster R-CNN (ImageNet+300M)	APM	61.8	# 31
Image Classification	ImageNet	ResNet-101 (JFT-300M Finetuning)	Top 1 Accuracy	79.2%	# 710
Semantic Segmentation	PASCAL VOC 2007	DeepLabv3 (ImageNet+300M)	Mean IoU	81.3	# 2
Semantic Segmentation	PASCAL VOC 2012 val	DeepLabv3 (ImageNet+300M)	mIoU	76.5%	# 19

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/revisiting-unreasonable-effectiveness-of-data/semantic-segmentation-on-pascal-voc-2007)](https://paperswithcode.com/sota/semantic-segmentation-on-pascal-voc-2007?p=revisiting-unreasonable-effectiveness-of-data)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/revisiting-unreasonable-effectiveness-of-data/semantic-segmentation-on-pascal-voc-2012-val)](https://paperswithcode.com/sota/semantic-segmentation-on-pascal-voc-2012-val?p=revisiting-unreasonable-effectiveness-of-data)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/revisiting-unreasonable-effectiveness-of-data/pose-estimation-on-coco-test-dev)](https://paperswithcode.com/sota/pose-estimation-on-coco-test-dev?p=revisiting-unreasonable-effectiveness-of-data)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/revisiting-unreasonable-effectiveness-of-data/object-detection-on-coco)](https://paperswithcode.com/sota/object-detection-on-coco?p=revisiting-unreasonable-effectiveness-of-data)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/revisiting-unreasonable-effectiveness-of-data/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=revisiting-unreasonable-effectiveness-of-data)`

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

ICCV 2017 · Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta ·

The success of deep learning in vision can be attributed to: (a) models with high capacity; (b) increased computational power; and (c) availability of large-scale labeled data. Since 2012, there have been significant advances in representation capabilities of the models and computational capabilities of GPUs. But the size of the biggest dataset has surprisingly remained constant. What will happen if we increase the dataset size by 10x or 100x? This paper takes a step towards clearing the clouds of mystery surrounding the relationship between `enormous data' and visual deep learning. By exploiting the JFT-300M dataset which has more than 375M noisy labels for 300M images, we investigate how the performance of current vision tasks would change if this data was used for representation learning. Our paper delivers some surprising (and some expected) findings. First, we find that the performance on vision tasks increases logarithmically based on volume of training data size. Second, we show that representation learning (or pre-training) still holds a lot of promise. One can improve performance on many vision tasks by just training a better base model. Finally, as expected, we present new state-of-the-art results for different vision tasks including image classification, object detection, semantic segmentation and human pose estimation. Our sincere hope is that this inspires vision community to not undervalue the data and develop collective efforts in building larger datasets.

PDF Abstract ICCV 2017 PDF ICCV 2017 Abstract

Code

Add Remove Mark official

Tencent/tencent-ml-images

3,048

Ranja-S/sensitivity

Tasks

Add Remove

Image Classification

object-detection

Object Detection

Pose Estimation

Representation Learning

Semantic Segmentation

Datasets

Introduced in the Paper:

JFT-300M

Used in the Paper:

ImageNet

MS COCO

PASCAL VOC 2007

Results from the Paper

Edit

Ranked #2 on Semantic Segmentation on PASCAL VOC 2007

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Object Detection	COCO test-dev	Faster R-CNN (ImageNet+300M)	box mAP	37.4	# 201	Compare
			AP50	58	# 138	Compare
			AP75	40.1	# 140	Compare
			APS	17.5	# 133	Compare
			APM	41.1	# 128	Compare
			APL	51.2	# 122	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare
Pose Estimation	COCO test-dev	Faster R-CNN (ImageNet+300M)	AP	64.4	# 38	Compare
			AP50	85.7	# 38	Compare
			AP75	70.7	# 34	Compare
			APL	69.8	# 37	Compare
			APM	61.8	# 31	Compare
Image Classification	ImageNet	ResNet-101 (JFT-300M Finetuning)	Top 1 Accuracy	79.2%	# 710	Compare
Semantic Segmentation	PASCAL VOC 2007	DeepLabv3 (ImageNet+300M)	Mean IoU	81.3	# 2	Compare
Semantic Segmentation	PASCAL VOC 2012 val	DeepLabv3 (ImageNet+300M)	mIoU	76.5%	# 19	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Faster R-CNN • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet • RMSProp • RoIPool • RPN • SGD with Momentum • Softmax • Step Decay • Weight Decay • Xavier Initialization

Edit Social Preview

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove