TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Quality Assessment	MSU FR VQA Database	AHIQ	SRCC	0.937	# 1
Video Quality Assessment	MSU FR VQA Database	AHIQ	SRCC	0.937	# 4
Video Quality Assessment	MSU FR VQA Database	AHIQ	KLCC	0.8015	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/attentions-help-cnns-see-better-attention/image-quality-assessment-on-msu-fr-vqa)](https://paperswithcode.com/sota/image-quality-assessment-on-msu-fr-vqa?p=attentions-help-cnns-see-better-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/attentions-help-cnns-see-better-attention/video-quality-assessment-on-msu-video-quality-1)](https://paperswithcode.com/sota/video-quality-assessment-on-msu-video-quality-1?p=attentions-help-cnns-see-better-attention)`

Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network

22 Apr 2022 · Shanshan Lao, Yuan Gong, Shuwei Shi, Sidi Yang, Tianhe Wu, Jiahao Wang, Weihao Xia, Yujiu Yang ·

Image quality assessment (IQA) algorithm aims to quantify the human perception of image quality. Unfortunately, there is a performance drop when assessing the distortion images generated by generative adversarial network (GAN) with seemingly realistic texture. In this work, we conjecture that this maladaptation lies in the backbone of IQA models, where patch-level prediction methods use independent image patches as input to calculate their scores separately, but lack spatial relationship modeling among image patches. Therefore, we propose an Attention-based Hybrid Image Quality Assessment Network (AHIQ) to deal with the challenge and get better performance on the GAN-based IQA task. Firstly, we adopt a two-branch architecture, including a vision transformer (ViT) branch and a convolutional neural network (CNN) branch for feature extraction. The hybrid architecture combines interaction information among image patches captured by ViT and local texture details from CNN. To make the features from shallow CNN more focused on the visually salient region, a deformable convolution is applied with the help of semantic information from the ViT branch. Finally, we use a patch-wise score prediction module to obtain the final score. The experiments show that our model outperforms the state-of-the-art methods on four standard IQA datasets and AHIQ ranked first on the Full Reference (FR) track of the NTIRE 2022 Perceptual Image Quality Assessment Challenge.

PDF Abstract

Code

Add Remove Mark official

iigroup/ahiq official

iigroup/maniqa

255

Tasks

Add Remove

Generative Adversarial Network

Image Quality Assessment

Video Quality Assessment

Datasets

CSIQ

PIPAL

MSU FR VQA Database

Results from the Paper

Edit

Ranked #1 on Image Quality Assessment on MSU FR VQA Database

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Quality Assessment	MSU FR VQA Database	AHIQ	SRCC	0.937	# 1	Compare
Video Quality Assessment	MSU FR VQA Database	AHIQ	SRCC	0.937	# 4	Compare
Video Quality Assessment	MSU FR VQA Database	AHIQ	KLCC	0.8015	# 4	Compare

Methods

Add Remove

Convolution • Deformable Convolution • Dense Connections • Layer Normalization • Linear Layer • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Vision Transformer

Edit Social Preview

Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove