TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	CIFAR-10	Stochastic Depth	Percentage correct	94.77	# 136
Image Classification	CIFAR-100	Stochastic Depth	Percentage correct	75.42	# 147
Image Classification	SVHN	Stochastic Depth	Percentage error	1.75	# 21

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-networks-with-stochastic-depth/image-classification-on-svhn)](https://paperswithcode.com/sota/image-classification-on-svhn?p=deep-networks-with-stochastic-depth)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-networks-with-stochastic-depth/image-classification-on-cifar-10)](https://paperswithcode.com/sota/image-classification-on-cifar-10?p=deep-networks-with-stochastic-depth)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-networks-with-stochastic-depth/image-classification-on-cifar-100)](https://paperswithcode.com/sota/image-classification-on-cifar-100?p=deep-networks-with-stochastic-depth)`

Deep Networks with Stochastic Depth

30 Mar 2016 · Gao Huang, Yu Sun, Zhuang Liu, Daniel Sedra, Kilian Weinberger ·

Very deep convolutional networks with hundreds of layers have led to significant reductions in error on competitive benchmarks. Although the unmatched expressiveness of the many layers can be highly desirable at test time, training very deep networks comes with its own set of challenges. The gradients can vanish, the forward flow often diminishes, and the training time can be painfully slow. To address these problems, we propose stochastic depth, a training procedure that enables the seemingly contradictory setup to train short networks and use deep networks at test time. We start with very deep networks but during training, for each mini-batch, randomly drop a subset of layers and bypass them with the identity function. This simple approach complements the recent success of residual networks. It reduces training time substantially and improves the test error significantly on almost all data sets that we used for evaluation. With stochastic depth we can increase the depth of residual networks even beyond 1200 layers and still yield meaningful improvements in test error (4.91% on CIFAR-10).

PDF Abstract

Code

Add Remove Mark official

yueatsprograms/Stochastic_Depth official

469

rwightman/pytorch-image-models

29,680

pytorch/vision

15,409

osmr/imgclsmob

2,918

ry/tensorflow-resnet

1,649

See all 17 implementations

Tasks

Add Remove

Image Classification

Datasets

CIFAR-10

ImageNet

CIFAR-100

SVHN

Results from the Paper

Edit

Ranked #21 on Image Classification on SVHN

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	CIFAR-10	Stochastic Depth	Percentage correct	94.77	# 136	Compare
Image Classification	CIFAR-100	Stochastic Depth	Percentage correct	75.42	# 147	Compare
Image Classification	SVHN	Stochastic Depth	Percentage error	1.75	# 21	Compare

Methods

Add Remove

Stochastic Depth

Edit Social Preview

Deep Networks with Stochastic Depth

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove