Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution

ICCV 2019 Yunpeng ChenHaoqi FanBing XuZhicheng YanYannis KalantidisMarcus RohrbachShuicheng YanJiashi Feng

In natural images, information is conveyed at different frequencies where higher frequencies are usually encoded with fine details and lower frequencies are usually encoded with global structures. Similarly, the output feature maps of a convolution layer can also be seen as a mixture of information at different frequencies... (read more)

PDF Abstract

Results from the Paper


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT LEADERBOARD
Image Classification ImageNet Oct-ResNet-152 (SE) Top 1 Accuracy 82.9% # 33
Top 5 Accuracy 96.3% # 23
Number of params 67M # 2