GroupNorm + WS

Model Name:*

Description with Markdown (optional):

# Weight Standardization

## Introduction

[ALGORITHM]

```
@article{weightstandardization,
  author    = {Siyuan Qiao and Huiyu Wang and Chenxi Liu and Wei Shen and Alan Yuille},
    title     = {Weight Standardization},
      journal   = {arXiv preprint arXiv:1903.10520},
        year      = {2019},
        }
```
        
## Results and Models
        
Faster R-CNN
        
 | Backbone  | Style   | Normalization | Lr schd | Mem (GB) | Inf time (fps) | box AP | mask AP | Config | Download |
 |:---------:|:-------:|:-------------:|:-------:|:--------:|:--------------:|:------:|:-------:|:------:|:--------:|
| R-50-FPN  | pytorch | GN+WS         | 1x      | 5.9      | 11.7           | 39.7   | -       | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/faster_rcnn_r50_fpn_gn_ws-all_1x_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/faster_rcnn_r50_fpn_gn_ws-all_1x_coco/faster_rcnn_r50_fpn_gn_ws-all_1x_coco_20200130-613d9fe2.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/faster_rcnn_r50_fpn_gn_ws-all_1x_coco/faster_rcnn_r50_fpn_gn_ws-all_1x_coco_20200130_210936.log.json) |
| R-101-FPN | pytorch | GN+WS         | 1x      | 8.9      | 9.0            | 41.7   | -       | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/faster_rcnn_r101_fpn_gn_ws-all_1x_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/faster_rcnn_r101_fpn_gn_ws-all_1x_coco/faster_rcnn_r101_fpn_gn_ws-all_1x_coco_20200205-a93b0d75.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/faster_rcnn_r101_fpn_gn_ws-all_1x_coco/faster_rcnn_r101_fpn_gn_ws-all_1x_coco_20200205_232146.log.json) |
| X-50-32x4d-FPN | pytorch | GN+WS    | 1x      | 7.0      | 10.3           | 40.7   | -       | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/faster_rcnn_x50_32x4d_fpn_gn_ws-all_1x_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/faster_rcnn_x50_32x4d_fpn_gn_ws-all_1x_coco/faster_rcnn_x50_32x4d_fpn_gn_ws-all_1x_coco_20200203-839c5d9d.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/faster_rcnn_x50_32x4d_fpn_gn_ws-all_1x_coco/faster_rcnn_x50_32x4d_fpn_gn_ws-all_1x_coco_20200203_220113.log.json) |
| X-101-32x4d-FPN | pytorch | GN+WS   | 1x      | 10.8     | 7.6            | 42.1   | -       | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/faster_rcnn_x101_32x4d_fpn_gn_ws-all_1x_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/faster_rcnn_x101_32x4d_fpn_gn_ws-all_1x_coco/faster_rcnn_x101_32x4d_fpn_gn_ws-all_1x_coco_20200212-27da1bc2.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/faster_rcnn_x101_32x4d_fpn_gn_ws-all_1x_coco/faster_rcnn_x101_32x4d_fpn_gn_ws-all_1x_coco_20200212_195302.log.json) |
        
Mask R-CNN
        
| Backbone  | Style   | Normalization | Lr schd   | Mem (GB) | Inf time (fps) | box AP | mask AP | Config | Download |
|:---------:|:-------:|:-------------:|:---------:|:--------:|:--------------:|:------:|:-------:|:------:|:--------:|
| R-50-FPN  | pytorch | GN+WS         | 2x        | 7.3      | 10.5       | 40.6        | 36.6    | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/mask_rcnn_r50_fpn_gn_ws-all_2x_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_r50_fpn_gn_ws-all_2x_coco/mask_rcnn_r50_fpn_gn_ws-all_2x_coco_20200226-16acb762.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_r50_fpn_gn_ws-all_2x_coco/mask_rcnn_r50_fpn_gn_ws-all_2x_coco_20200226_062128.log.json) |
| R-101-FPN | pytorch | GN+WS         | 2x        | 10.3     | 8.6        | 42.0        | 37.7    | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/mask_rcnn_r101_fpn_gn_ws-all_2x_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_r101_fpn_gn_ws-all_2x_coco/mask_rcnn_r101_fpn_gn_ws-all_2x_coco_20200212-ea357cd9.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_r101_fpn_gn_ws-all_2x_coco/mask_rcnn_r101_fpn_gn_ws-all_2x_coco_20200212_213627.log.json) |
| X-50-32x4d-FPN | pytorch | GN+WS    | 2x        | 8.4      | 9.3       | 41.1        | 37.0    | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/mask_rcnn_x50_32x4d_fpn_gn_ws-all_2x_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_x50_32x4d_fpn_gn_ws-all_2x_coco/mask_rcnn_x50_32x4d_fpn_gn_ws-all_2x_coco_20200216-649fdb6f.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_x50_32x4d_fpn_gn_ws-all_2x_coco/mask_rcnn_x50_32x4d_fpn_gn_ws-all_2x_coco_20200216_201500.log.json) |
| X-101-32x4d-FPN | pytorch | GN+WS   | 2x        | 12.2     | 7.1       | 42.1        | 37.9    | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/mask_rcnn_x101_32x4d_fpn_gn_ws-all_2x_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_x101_32x4d_fpn_gn_ws-all_2x_coco/mask_rcnn_x101_32x4d_fpn_gn_ws-all_2x_coco_20200319-33fb95b5.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_x101_32x4d_fpn_gn_ws-all_2x_coco/mask_rcnn_x101_32x4d_fpn_gn_ws-all_2x_coco_20200319_104101.log.json) |
| R-50-FPN  | pytorch | GN+WS         | 20-23-24e | 7.3      | -        | 41.1        | 37.1    | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/mask_rcnn_r50_fpn_gn_ws-all_20_23_24e_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_r50_fpn_gn_ws-all_20_23_24e_coco/mask_rcnn_r50_fpn_gn_ws-all_20_23_24e_coco_20200213-487d1283.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_r50_fpn_gn_ws-all_20_23_24e_coco/mask_rcnn_r50_fpn_gn_ws-all_20_23_24e_coco_20200213_035123.log.json) |
| R-101-FPN | pytorch | GN+WS         | 20-23-24e | 10.3     | -        | 43.1        | 38.6    | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/mask_rcnn_r101_fpn_gn_ws-all_20_23_24e_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_r101_fpn_gn_ws-all_20_23_24e_coco/mask_rcnn_r101_fpn_gn_ws-all_20_23_24e_coco_20200213-57b5a50f.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_r101_fpn_gn_ws-all_20_23_24e_coco/mask_rcnn_r101_fpn_gn_ws-all_20_23_24e_coco_20200213_130142.log.json) |
| X-50-32x4d-FPN | pytorch | GN+WS    | 20-23-24e | 8.4      | -        | 42.1        | 38.0    | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/mask_rcnn_x50_32x4d_fpn_gn_ws-all_20_23_24e_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_x50_32x4d_fpn_gn_ws-all_20_23_24e_coco/mask_rcnn_x50_32x4d_fpn_gn_ws-all_20_23_24e_coco_20200226-969bcb2c.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_x50_32x4d_fpn_gn_ws-all_20_23_24e_coco/mask_rcnn_x50_32x4d_fpn_gn_ws-all_20_23_24e_coco_20200226_093732.log.json) |
| X-101-32x4d-FPN | pytorch | GN+WS   | 20-23-24e | 12.2     | -        | 42.7        | 38.5    | [config](https://github.com/open-mmlab/mmdetection/tree/master/configs/gn%2Bws/mask_rcnn_x101_32x4d_fpn_gn_ws-all_20_23_24e_coco.py) | [model](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_x101_32x4d_fpn_gn_ws-all_20_23_24e_coco/mask_rcnn_x101_32x4d_fpn_gn_ws-all_20_23_24e_coco_20200316-e6cd35ef.pth) &#124; [log](http://download.openmmlab.com/mmdetection/v2.0/gn%2Bws/mask_rcnn_x101_32x4d_fpn_gn_ws-all_20_23_24e_coco/mask_rcnn_x101_32x4d_fpn_gn_ws-all_20_23_24e_coco_20200316_013741.log.json) |
        
Note:
        
- GN+WS requires about 5% more memory than GN, and it is only 5% slower than GN.
- In the paper, a 20-23-24e lr schedule is used instead of 2x.
- The X-50-GN and X-101-GN pretrained models are also shared by the authors.

Paper:*

Code URL (optional):

lr sched	1x
Memory (M)	8900.0
Backbone Layers	101
inference time (s/im)	0.11111

Attached motifs:

FPN

RESNET

WEIGHT STANDARDIZATION

RPN

ROIPOOL

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

FPN

lr sched	1x
Memory (M)	5900.0
Backbone Layers	50
inference time (s/im)	0.08547

Attached motifs:

FPN

RESNET

WEIGHT STANDARDIZATION

RPN

ROIPOOL

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

FPN

lr sched	1x
Memory (M)	10800.0
Backbone Layers	101
inference time (s/im)	0.13158

Attached motifs:

FPN

WEIGHT STANDARDIZATION

RPN

ROIPOOL

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

RESNEXT

FPN

lr sched	1x
Memory (M)	7000.0
Backbone Layers	50
inference time (s/im)	0.09709

Attached motifs:

FPN

ROIPOOL

SOFTMAX

CONVOLUTION

WEIGHT STANDARDIZATION

RPN

GROUP NORMALIZATION

FPN

lr sched	20-23-24e
Memory (M)	10300.0
Backbone Layers	101

Attached motifs:

FPN

RESNET

WEIGHT STANDARDIZATION

RPN

DENSE CONNECTIONS

ROIALIGN

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

FPN

lr sched	2x
Memory (M)	10300.0
Backbone Layers	101
inference time (s/im)	0.11628

Attached motifs:

FPN

RESNET

WEIGHT STANDARDIZATION

RPN

DENSE CONNECTIONS

ROIALIGN

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

FPN

lr sched	20-23-24e
Memory (M)	7300.0
Backbone Layers	50

Attached motifs:

FPN

RESNET

WEIGHT STANDARDIZATION

RPN

DENSE CONNECTIONS

ROIALIGN

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

FPN

lr sched	2x
Memory (M)	7300.0
Backbone Layers	50
inference time (s/im)	0.09524

Attached motifs:

FPN

RESNET

WEIGHT STANDARDIZATION

RPN

DENSE CONNECTIONS

ROIALIGN

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

FPN

lr sched	20-23-24e
Memory (M)	12200.0
Backbone Layers	101

Attached motifs:

FPN

WEIGHT STANDARDIZATION

RPN

DENSE CONNECTIONS

ROIALIGN

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

RESNEXT

FPN

lr sched	2x
Memory (M)	12200.0
Backbone Layers	101
inference time (s/im)	0.14085

Attached motifs:

FPN

WEIGHT STANDARDIZATION

RPN

DENSE CONNECTIONS

ROIALIGN

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

RESNEXT

FPN

lr sched	20-23-24e
Memory (M)	8400.0
Backbone Layers	50

Attached motifs:

FPN

WEIGHT STANDARDIZATION

RPN

DENSE CONNECTIONS

ROIALIGN

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

FPN

lr sched	2x
Memory (M)	8400.0
Backbone Layers	50
inference time (s/im)	0.10753

Attached motifs:

FPN

WEIGHT STANDARDIZATION

RPN

DENSE CONNECTIONS

ROIALIGN

SOFTMAX

CONVOLUTION

GROUP NORMALIZATION

FPN

MODEL	BOX AP
Mask R-CNN GroupNorm + WS (R-101-FPN, 20-23-24e, pytorch)	43.1
Mask R-CNN GroupNorm + WS (X-101-32x4d-FPN, 20-23-24e, pytorch)	42.7
Mask R-CNN GroupNorm + WS (X-101-32x4d-FPN, 2x, pytorch)	42.1
Faster R-CNN GroupNorm + WS (X-101-32x4d-FPN, 1x, pytorch)	42.1
Mask R-CNN GroupNorm + WS (X-50-32x4d-FPN, 20-23-24e, pytorch)	42.1
Mask R-CNN GroupNorm + WS (R-101-FPN, 2x, pytorch)	42.0
Faster R-CNN GroupNorm + WS (R-101-FPN, 1x, pytorch)	41.7
Mask R-CNN GroupNorm + WS (R-50-FPN, 20-23-24e, pytorch)	41.1
Mask R-CNN GroupNorm + WS (X-50-32x4d-FPN, 2x, pytorch)	41.1
Faster R-CNN GroupNorm + WS (X-50-32x4d-FPN, 1x, pytorch)	40.7
Mask R-CNN GroupNorm + WS (R-50-FPN, 2x, pytorch)	40.6
Faster R-CNN GroupNorm + WS (R-50-FPN, 1x, pytorch)	39.7

MODEL	MASK AP
Mask R-CNN GroupNorm + WS (R-101-FPN, 20-23-24e, pytorch)	38.6
Mask R-CNN GroupNorm + WS (X-101-32x4d-FPN, 20-23-24e, pytorch)	38.5
Mask R-CNN GroupNorm + WS (X-50-32x4d-FPN, 20-23-24e, pytorch)	38.0
Mask R-CNN GroupNorm + WS (X-101-32x4d-FPN, 2x, pytorch)	37.9
Mask R-CNN GroupNorm + WS (R-101-FPN, 2x, pytorch)	37.7
Mask R-CNN GroupNorm + WS (R-50-FPN, 20-23-24e, pytorch)	37.1
Mask R-CNN GroupNorm + WS (X-50-32x4d-FPN, 2x, pytorch)	37.0
Mask R-CNN GroupNorm + WS (R-50-FPN, 2x, pytorch)	36.6

open-mmlab / mmdetection

Weight Standardization

Introduction

Results and Models

Results

Object Detection on COCO minival

Object Detection on COCO minival

Instance Segmentation on COCO minival

Architecture	Softmax, RPN, Weight Standardization, Convolution, Group Normalization, FPN, RoIPool, ResNet
lr sched	1x
Memory (M)	8900.0
Backbone Layers	101
inference time (s/im)	0.11111
SHOW MORE
SHOW LESS

Architecture	Softmax, RPN, ResNeXt, Weight Standardization, Convolution, Group Normalization, FPN, RoIPool
lr sched	1x
Memory (M)	10800.0
Backbone Layers	101
inference time (s/im)	0.13158
SHOW MORE
SHOW LESS

Backbone	Style	Normalization	Lr schd	Mem (GB)	Inf time (fps)	box AP	mask AP	Config	Download
R-50-FPN	pytorch	GN+WS	1x	5.9	11.7	39.7	-	config	model \| log
R-101-FPN	pytorch	GN+WS	1x	8.9	9.0	41.7	-	config	model \| log
X-50-32x4d-FPN	pytorch	GN+WS	1x	7.0	10.3	40.7	-	config	model \| log
X-101-32x4d-FPN	pytorch	GN+WS	1x	10.8	7.6	42.1	-	config	model \| log