RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network

This work introduces RevSilo, the first reversible bidirectional multi-scale feature fusion module. Like other reversible methods, RevSilo eliminates the need to store hidden activations by recomputing them. However, existing reversible methods do not apply to multi-scale feature fusion and are, therefore, not applicable to a large class of networks. Bidirectional multi-scale feature fusion promotes local and global coherence and has become a de facto design principle for networks targeting spatially sensitive tasks, e.g., HRNet (Sun et al., 2019a) and EfficientDet (Tan et al., 2020). These networks achieve state-of-the-art results across various computer vision tasks when paired with high-resolution inputs. However, training them requires substantial accelerator memory for saving large, multi-resolution activations. These memory requirements inherently cap the size of neural networks, limiting improvements that come from scale. Operating across resolution scales, RevSilo alleviates these issues. Stacking RevSilos, we create RevBiFPN, a fully reversible bidirectional feature pyramid network. RevBiFPN is competitive with networks such as EfficientNet while using up to 19.8x lesser training memory for image classification. When fine-tuned on MS COCO, RevBiFPN provides up to a 2.5% boost in AP over HRNet using fewer MACs and a 2.4x reduction in training-time memory.

PDF Abstract

Datasets


Results from the Paper


Ranked #313 on Image Classification on ImageNet (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Image Classification ImageNet RevBiFPN-S6 Top 1 Accuracy 84.2% # 313
Number of params 142.3M # 880
GFLOPs 38.1 # 409
Image Classification ImageNet RevBiFPN-S1 Top 1 Accuracy 75.9% # 857
Number of params 5.11M # 409
GFLOPs 0.62 # 72
Image Classification ImageNet RevBiFPN-S5 Top 1 Accuracy 83.7% # 365
Number of params 82M # 808
GFLOPs 21.8 # 369
Image Classification ImageNet RevBiFPN-S4 Top 1 Accuracy 83% # 437
Number of params 48.7M # 718
GFLOPs 10.6 # 302
Image Classification ImageNet RevBiFPN-S3 Top 1 Accuracy 81.1% # 607
Number of params 19.6M # 535
GFLOPs 3.33 # 177
Image Classification ImageNet RevBiFPN-S2 Top 1 Accuracy 79% # 727
Number of params 10.6M # 480
GFLOPs 1.37 # 127
Image Classification ImageNet RevBiFPN-S0 Top 1 Accuracy 72.8% # 918
Number of params 3.42M # 375
GFLOPs 0.31 # 30

Methods