Improving Image Restoration by Revisiting Global Information Aggregation

8 Dec 2021  ·  Xiaojie Chu, Liangyu Chen, Chengpeng Chen, Xin Lu ·

Global operations, such as global average pooling, are widely used in top-performance image restorers. They aggregate global information from input features along entire spatial dimensions but behave differently during training and inference in image restoration tasks: they are based on different regions, namely the cropped patches (from images) and the full-resolution images. This paper revisits global information aggregation and finds that the image-based features during inference have a different distribution than the patch-based features during training. This train-test inconsistency negatively impacts the performance of models, which is severely overlooked by previous works. To reduce the inconsistency and improve test-time performance, we propose a simple method called Test-time Local Converter (TLC). Our TLC converts global operations to local ones only during inference so that they aggregate features within local spatial regions rather than the entire large images. The proposed method can be applied to various global modules (e.g., normalization, channel and spatial attention) with negligible costs. Without the need for any fine-tuning, TLC improves state-of-the-art results on several image restoration tasks, including single-image motion deblurring, video deblurring, defocus deblurring, and image denoising. In particular, with TLC, our Restormer-Local improves the state-of-the-art result in single image deblurring from 32.92 dB to 33.57 dB on GoPro dataset. The code is available at https://github.com/megvii-research/tlc.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Image Defocus Deblurring DPD Restormer-TLC Combined PSNR 26.24 # 3
Combined SSIM 0.825 # 1
LPIPS 0.168 # 2
Image Defocus Deblurring DPD (Dual-view) Restormer - TLC PSNR 27.02 # 2
SSIM 0.847 # 2
Deblurring GoPro Restormer-Local PSNR 33.57 # 9
SSIM 0.966 # 8
Deblurring GoPro MPRNet-local PSNR 33.31 # 13
SSIM 0.964 # 10
Deblurring GoPro RNN-MBP-Local PSNR 33.8 # 7
SSIM 0.966 # 8
Image Deblurring GoPro HINet-TLC PSNR 33.08 # 17
SSIM 0.962 # 17
Image Deblurring GoPro MPRNet-TLC PSNR 33.31 # 11
SSIM 0.964 # 11
Params (M) 20.1 # 10
Image Deblurring GoPro Restormer-TLC PSNR 33.57 # 10
SSIM 0.966 # 10
Params (M) 26.13 # 12
Deblurring GoPro HINet-local PSNR 33.08 # 16
SSIM 0.962 # 17
Deblurring HIDE (trained on GOPRO) Restormer-TLC PSNR (sRGB) 31.49 # 4
SSIM (sRGB) 0.945 # 5
Params (M) 26.13 # 8
Deblurring HIDE (trained on GOPRO) MPRNet-TLC PSNR (sRGB) 31.19 # 8
SSIM (sRGB) 0.942 # 8
Params (M) 20.1 # 5
Deblurring MSU BASED MPR local SSIM 0.94542 # 5
PSNR 31.65037 # 2
VMAF 67.01788 # 3
LPIPS 0.08323 # 7
ERQAv2.0 0.74521 # 4
Subjective 0.4407 # 8
Grayscale Image Denoising urban100 sigma15 Restormer-Local PSNR 33.85 # 1
Grayscale Image Denoising Urban100 sigma25 Restormer-Local PSNR 31.55 # 1
Color Image Denoising Urban100 sigma30 Restormer-Local PSNR 33.06 # 1
Color Image Denoising Urban100 sigma50 Restormer-Local PSNR 30.17 # 1
Grayscale Image Denoising Urban100 sigma50 Restormer-Local PSNR 28.41 # 1

Methods