RegNetY

Last updated on Feb 14, 2021

regnety_002

Parameters 3 Million
FLOPs 256 Million
File Size 12.19 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_002
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 1024
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_004

Parameters 4 Million
FLOPs 516 Million
File Size 16.73 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_004
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 1024
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_006

Parameters 6 Million
FLOPs 772 Million
File Size 23.26 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_006
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 1024
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_008

Parameters 6 Million
FLOPs 1 Billion
File Size 24.05 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_008
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 1024
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_016

Parameters 11 Million
FLOPs 2 Billion
File Size 43.03 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_016
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 1024
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_032

Parameters 19 Million
FLOPs 4 Billion
File Size 74.47 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_032
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 512
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_040

Parameters 21 Million
FLOPs 5 Billion
File Size 79.07 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_040
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 512
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_064

Parameters 31 Million
FLOPs 8 Billion
File Size 117.06 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_064
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 512
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_080

Parameters 39 Million
FLOPs 10 Billion
File Size 149.85 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_080
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 512
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_120

Parameters 52 Million
FLOPs 16 Billion
File Size 198.12 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_120
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 512
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_160

Parameters 84 Million
FLOPs 20 Billion
File Size 319.40 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_160
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 512
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
regnety_320

Parameters 145 Million
FLOPs 41 Billion
File Size 553.98 MB
Training Data ImageNet
Training Resources 8x NVIDIA V100 GPUs
Training Time

Training Techniques SGD with Momentum, Weight Decay
Architecture 1x1 Convolution, Batch Normalization, Convolution, Dense Connections, Global Average Pooling, Grouped Convolution, ReLU, Squeeze-and-Excitation Block
ID regnety_320
Epochs 100
Crop Pct 0.875
Momentum 0.9
Batch Size 256
Image Size 224
Weight Decay 0.00005
Interpolation bicubic
SHOW MORE
SHOW LESS
README.md

Summary

RegNetY is a convolutional network design space with simple, regular models with parameters: depth $d$, initial width $w_{0} > 0$, and slope $w_{a} > 0$, and generates a different block width $u_{j}$ for each block $j < d$. The key restriction for the RegNet types of model is that there is a linear parameterisation of block widths (the design space only contains models with this linear structure):

$$ u_{j} = w_{0} + w_{a}\cdot{j} $$

For RegNetX authors have additional restrictions: we set $b = 1$ (the bottleneck ratio), $12 \leq d \leq 28$, and $w_{m} \geq 2$ (the width multiplier).

For RegNetY authors make one change, which is to include Squeeze-and-Excitation blocks.

How do I load this model?

To load a pretrained model:

import timm
m = timm.create_model('regnety_008', pretrained=True)
m.eval()

Replace the model name with the variant you want to use, e.g. regnety_008. You can find the IDs in the model summaries at the top of this page.

How do I train this model?

You can follow the timm recipe scripts for training a new model afresh.

Citation

@misc{radosavovic2020designing,
      title={Designing Network Design Spaces}, 
      author={Ilija Radosavovic and Raj Prateek Kosaraju and Ross Girshick and Kaiming He and Piotr Dollár},
      year={2020},
      eprint={2003.13678},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Results

Image Classification on ImageNet

Image Classification on ImageNet
MODEL TOP 1 ACCURACY TOP 5 ACCURACY
regnety_032 82.01% 95.91%
regnety_320 80.8% 95.25%
regnety_120 80.38% 95.12%
regnety_160 80.28% 94.97%
regnety_080 79.87% 94.83%
regnety_064 79.73% 94.76%
regnety_040 79.23% 94.64%
regnety_016 77.87% 93.73%
regnety_008 76.32% 93.07%
regnety_006 75.27% 92.53%
regnety_004 74.02% 91.76%
regnety_002 70.28% 89.55%