HRFormer: High-Resolution Transformer for Dense Prediction

18 Oct 2021  ·  Yuhui Yuan, Rao Fu, Lang Huang, WeiHong Lin, Chao Zhang, Xilin Chen, Jingdong Wang ·

We present a High-Resolution Transformer (HRFormer) that learns high-resolution representations for dense prediction tasks, in contrast to the original Vision Transformer that produces low-resolution representations and has high memory and computational cost. We take advantage of the multi-resolution parallel design introduced in high-resolution convolutional networks (HRNet), along with local-window self-attention that performs self-attention over small non-overlapping image windows, for improving the memory and computation efficiency. In addition, we introduce a convolution into the FFN to exchange information across the disconnected image windows. We demonstrate the effectiveness of the High-Resolution Transformer on both human pose estimation and semantic segmentation tasks, e.g., HRFormer outperforms Swin transformer by $1.3$ AP on COCO pose estimation with $50\%$ fewer parameters and $30\%$ fewer FLOPs. Code is available at: https://github.com/HRNet/HRFormer.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Pose Estimation AIC HRFormer (HRFomer-B) AP 34.4 # 3
AP50 78.3 # 1
AP75 24.8 # 1
AR 38.7 # 1
AR50 80.9 # 1
Pose Estimation COCO test-dev HRFormer-B AP 76.2 # 16
AP50 92.7 # 10
AP75 83.8 # 13
APL 82.3 # 10
APM 72.5 # 14
AR 81.2 # 14
Multi-Person Pose Estimation CrowdPose HRFormer-B mAP @0.5:0.95 72.4 # 5
AP Easy 80.0 # 5
AP Medium 73.5 # 5
AP Hard 62.4 # 5
Image Classification ImageNet HRFormer-T Top 1 Accuracy 78.5% # 760
Number of params 8.0M # 463
GFLOPs 1.8 # 138
Image Classification ImageNet HRFormer-B Top 1 Accuracy 82.8% # 453
Number of params 50.3M # 728
GFLOPs 13.7 # 330
Multi-Person Pose Estimation OCHuman HRFormer-B Validation AP 62.1 # 3
AP50 81.4 # 4
AP75 67.1 # 3

Results from Other Papers


Task Dataset Model Metric Name Metric Value Rank Source Paper Compare
Pose Estimation AIC HRFormer (HRFomer-S) AP 31.6 # 7
AP75 20.9 # 4
AR 35.8 # 4
AR50 78.0 # 4

Methods