TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Super-Resolution	Set5 - 2x upscaling	EDT-B	PSNR	38.63	# 5
Image Super-Resolution	Set5 - 2x upscaling	EDT-B	SSIM	0.9632	# 5
Image Super-Resolution	Set5 - 3x upscaling	EDT-B	PSNR	35.13	# 5
Image Super-Resolution	Set5 - 3x upscaling	EDT-B	SSIM	0.9328	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/on-efficient-transformer-and-image-pre/image-super-resolution-on-set5-2x-upscaling)](https://paperswithcode.com/sota/image-super-resolution-on-set5-2x-upscaling?p=on-efficient-transformer-and-image-pre)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/on-efficient-transformer-and-image-pre/image-super-resolution-on-set5-3x-upscaling)](https://paperswithcode.com/sota/image-super-resolution-on-set5-3x-upscaling?p=on-efficient-transformer-and-image-pre)`

On Efficient Transformer-Based Image Pre-training for Low-Level Vision

19 Dec 2021 · Wenbo Li, Xin Lu, Shengju Qian, Jiangbo Lu, Xiangyu Zhang, Jiaya Jia ·

Pre-training has marked numerous state of the arts in high-level computer vision, while few attempts have ever been made to investigate how pre-training acts in image processing systems. In this paper, we tailor transformer-based pre-training regimes that boost various low-level tasks. To comprehensively diagnose the influence of pre-training, we design a whole set of principled evaluation tools that uncover its effects on internal representations. The observations demonstrate that pre-training plays strikingly different roles in low-level tasks. For example, pre-training introduces more local information to higher layers in super-resolution (SR), yielding significant performance gains, while pre-training hardly affects internal feature representations in denoising, resulting in limited gains. Further, we explore different methods of pre-training, revealing that multi-related-task pre-training is more effective and data-efficient than other alternatives. Finally, we extend our study to varying data scales and model sizes, as well as comparisons between transformers and CNNs-based architectures. Based on the study, we successfully develop state-of-the-art models for multiple low-level tasks. Code is released at https://github.com/fenglinglwb/EDT.

PDF Abstract

Code

Add Remove Mark official

fenglinglwb/edt official

119

Tasks

Add Remove

Denoising

Image Super-Resolution

Super-Resolution

Datasets

ImageNet

Urban100

Set5

Manga109

Results from the Paper

Add Remove

Ranked #5 on Image Super-Resolution on Set5 - 2x upscaling (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Super-Resolution	Set5 - 2x upscaling	EDT-B	PSNR	38.63	# 5	Compare
Image Super-Resolution	Set5 - 2x upscaling	EDT-B	SSIM	0.9632	# 5	Compare
Image Super-Resolution	Set5 - 3x upscaling	EDT-B	PSNR	35.13	# 5	Compare
Image Super-Resolution	Set5 - 3x upscaling	EDT-B	SSIM	0.9328	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

On Efficient Transformer-Based Image Pre-training for Low-Level Vision

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove