Rethinking Coarse-to-Fine Approach in Single Image Deblurring

Coarse-to-fine strategies have been extensively used for the architecture design of single image deblurring networks. Conventional methods typically stack sub-networks with multi-scale input images and gradually improve sharpness of images from the bottom sub-network to the top sub-network, yielding inevitably high computational costs. Toward a fast and accurate deblurring network design, we revisit the coarse-to-fine strategy and present a multi-input multi-output U-net (MIMO-UNet). The MIMO-UNet has three distinct features. First, the single encoder of the MIMO-UNet takes multi-scale input images to ease the difficulty of training. Second, the single decoder of the MIMO-UNet outputs multiple deblurred images with different scales to mimic multi-cascaded U-nets using a single U-shaped network. Last, asymmetric feature fusion is introduced to merge multi-scale features in an efficient manner. Extensive experiments on the GoPro and RealBlur datasets demonstrate that the proposed network outperforms the state-of-the-art methods in terms of both accuracy and computational complexity. Source code is available for research purposes at https://github.com/chosj95/MIMO-UNet.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Image Deblurring GoPro MIMO-UNet++ PSNR 32.68 # 22
SSIM 0.959 # 20
Params (M) 16.1 # 5
Deblurring GoPro MIMO-UNet++ PSNR 32.68 # 24
SSIM 0.959 # 21
Deblurring RealBlur-J MIMO-UNet++ SSIM (sRGB) 0.921 # 8
PSNR (sRGB) 32.05 # 6
Params(M) 16.1 # 4
Deblurring RSBlur MIMO-UNet+ Average PSNR 33.37 # 5
Deblurring RSBlur MIMO-UNet Average PSNR 32.73 # 6

Methods