Search Results for author: Xiongkuo Min

Found 98 papers, 51 papers with code

R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?

no code implementations7 Oct 2024 Chunyi Li, Jianbo Zhang, ZiCheng Zhang, HaoNing Wu, Yuan Tian, Wei Sun, Guo Lu, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai

However, various corruptions in the real world mean that images will not be as ideal as in simulations, presenting significant challenges for the practical application of LMMs.

Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs

no code implementations30 Sep 2024 ZiCheng Zhang, Ziheng Jia, HaoNing Wu, Chunyi Li, Zijian Chen, Yingjie Zhou, Wei Sun, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai

With the rising interest in research on Large Multi-modal Models (LMMs) for video understanding, many studies have emphasized general video comprehension capabilities, neglecting the systematic exploration into video quality understanding.

Benchmarking Multiple-choice +2

Subjective and Objective Quality-of-Experience Evaluation Study for Live Video Streaming

no code implementations26 Sep 2024 Zehao Zhu, Wei Sun, Jun Jia, Wei Wu, Sibin Deng, Kai Li, Ying Chen, Xiongkuo Min, Jia Wang, Guangtao Zhai

For the subjective QoE study, we introduce the first live video streaming QoE dataset, TaoLive QoE, which consists of $42$ source videos collected from real live broadcasts and $1, 155$ corresponding distorted ones degraded due to a variety of streaming distortions, including conventional streaming distortions such as compression, stalling, as well as live streaming-specific distortions like frame skipping, variable frame rate, etc.

Optical Flow Estimation

Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending

no code implementations17 Sep 2024 Yongyang Pan, Xiaohong Liu, Siqi Luo, Yi Xin, Xiao Guo, Xiaoming Liu, Xiongkuo Min, Guangtao Zhai

Rapid advancements in multimodal large language models have enabled the creation of hyper-realistic images from textual descriptions.

Explore the Hallucination on Low-level Perception for MLLMs

no code implementations15 Sep 2024 Yinan Sun, ZiCheng Zhang, HaoNing Wu, Xiaohong Liu, Weisi Lin, Guangtao Zhai, Xiongkuo Min

However, these models also exhibit hallucinations, which limit their reliability as AI systems, especially in tasks involving low-level visual perception and understanding.

Hallucination Question Answering +1

3DGCQA: A Quality Assessment Database for 3D AI-Generated Contents

no code implementations11 Sep 2024 Yingjie Zhou, ZiCheng Zhang, Farong Wen, Jun Jia, Yanwei Jiang, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

To provide a valuable resource for future research and development in 3D content generation and quality assessment, the dataset has been open-sourced in https://github. com/zyj-2000/3DGCQA.

3D Generation Text to 3D

Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild

no code implementations9 Sep 2024 Xiongkuo Min, Yixuan Gao, Yuqin Cao, Guangtao Zhai, Wenjun Zhang, Huifang Sun, Chang Wen Chen

RichIQA is characterized by two key novel designs: (1) a three-stage image quality prediction network which exploits the powerful feature representation capability of the Convolutional vision Transformer (CvT) and mimics the short-term and long-term memory mechanisms of human brain; (2) a multi-label training strategy in which rich subjective quality information like MOS, SOS and DOS are concurrently used to train the quality prediction network.

Image Quality Assessment

Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency

1 code implementation1 Sep 2024 Wei Sun, Weixia Zhang, Yuqin Cao, Linhan Cao, Jun Jia, Zijian Chen, ZiCheng Zhang, Xiongkuo Min, Guangtao Zhai

To address this problem, we design a multi-branch deep neural network (DNN) to assess the quality of UHD images from three perspectives: global aesthetic characteristics, local technical distortions, and salient content perception.

4k Image Quality Assessment

LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models

no code implementations26 Aug 2024 Qihang Ge, Wei Sun, Yu Zhang, Yunhao Li, Zhongpeng Ji, Fengyu Sun, Shangling Jui, Xiongkuo Min, Guangtao Zhai

Then, we design a spatiotemporal vision encoder to extract spatial and temporal features to represent the quality characteristics of videos, which are subsequently mapped into the language space by the spatiotemporal projector for modality alignment.

Large Language Model Video Quality Assessment +1

How Does Audio Influence Visual Attention in Omnidirectional Videos? Database and Model

no code implementations10 Aug 2024 Yuxin Zhu, Huiyu Duan, Kaiwei Zhang, Yucheng Zhu, Xilei Zhu, Long Teng, Xiongkuo Min, Guangtao Zhai

To advance the research on audio-visual saliency prediction for ODVs, we further establish a new benchmark based on the AVS-ODV database by testing numerous state-of-the-art saliency models, including visual-only models and audio-visual models.

Saliency Prediction

SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression

no code implementations8 Aug 2024 Linhan Cao, Wei Sun, Xiongkuo Min, Jun Jia, ZiCheng Zhang, Zijian Chen, Yucheng Zhu, Lizhou Liu, Qiubo Chen, Jing Chen, Guangtao Zhai

Just noticeable distortion (JND), representing the threshold of distortion in an image that is minimally perceptible to the human visual system (HVS), is crucial for image compression algorithms to achieve a trade-off between transmission bit rate and image quality.

Image Compression

ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images

no code implementations31 Jul 2024 Xilei Zhu, Liu Yang, Huiyu Duan, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet

However, the corresponding image quality assessment (IQA) research for egocentric spatial images is still lacking.

Image Quality Assessment

Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model

no code implementations31 Jul 2024 Zhichao Zhang, Xinyue Li, Wei Sun, Jun Jia, Xiongkuo Min, ZiCheng Zhang, Chunyi Li, Zijian Chen, Puyi Wang, Zhongpeng Ji, Fengyu Sun, Shangling Jui, Guangtao Zhai

For the objective perspective, we establish a benchmark for evaluating existing quality assessment metrics on the LGVQ dataset, which reveals that current metrics perform poorly on the LGVQ dataset.

Benchmarking Large Language Model +4

UniProcessor: A Text-induced Unified Low-level Image Processor

1 code implementation30 Jul 2024 Huiyu Duan, Xiongkuo Min, Sijing Wu, Wei Shen, Guangtao Zhai

In this paper, we propose a text-induced unified image processor for low-level vision tasks, termed UniProcessor, which can effectively process various degradation types and levels, and support multimodal control.

Image Enhancement Image Restoration +1

UNQA: Unified No-Reference Quality Assessment for Audio, Image, Video, and Audio-Visual Content

no code implementations29 Jul 2024 Yuqin Cao, Xiongkuo Min, Yixuan Gao, Wei Sun, Weisi Lin, Guangtao Zhai

In this paper, we propose the Unified No-reference Quality Assessment model (UNQA) for audio, image, video, and A/V content, which tries to train a single QA model across different media modalities.

HazeCLIP: Towards Language Guided Real-World Image Dehazing

1 code implementation18 Jul 2024 Ruiyi Wang, Wenhao Li, Xiaohong Liu, Chunyi Li, ZiCheng Zhang, Xiongkuo Min, Guangtao Zhai

Existing methods have achieved remarkable performance in single image dehazing, particularly on synthetic datasets.

Image Dehazing Single Image Dehazing

GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval

1 code implementation17 Jul 2024 Han Zhou, Wei Dong, Xiaohong Liu, Shuaicheng Liu, Xiongkuo Min, Guangtao Zhai, Jun Chen

Most existing Low-light Image Enhancement (LLIE) methods either directly map Low-Light (LL) to Normal-Light (NL) images or use semantic or illumination maps as guides.

Decoder Low-Light Image Enhancement +5

Quality-guided Skin Tone Enhancement for Portrait Photography

no code implementations22 Jun 2024 Shiqi Gao, Huiyu Duan, Xinyue Li, Kang Fu, Yicong Peng, Qihang Xu, Yuanyuan Chang, Jia Wang, Xiongkuo Min, Guangtao Zhai

In this paper, we propose a quality-guided image enhancement paradigm that enables image enhancement models to learn the distribution of images with various quality ratings.

Image Enhancement

CMC-Bench: Towards a New Paradigm of Visual Signal Compression

1 code implementation13 Jun 2024 Chunyi Li, Xiele Wu, HaoNing Wu, Donghui Feng, ZiCheng Zhang, Guo Lu, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

With the development of Large Multimodal Models (LMMs), a Cross Modality Compression (CMC) paradigm of Image-Text-Image has emerged.

Image Compression

GAIA: Rethinking Action Quality Assessment for AI-Generated Videos

1 code implementation10 Jun 2024 Zijian Chen, Wei Sun, Yuan Tian, Jun Jia, ZiCheng Zhang, Jiarui Wang, Ru Huang, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

Assessing action quality is both imperative and challenging due to its significant impact on the quality of AI-generated videos, further complicated by the inherently ambiguous nature of actions within AI-generated video (AIGV).

Action Quality Assessment

A-Bench: Are LMMs Masters at Evaluating AI-generated Images?

1 code implementation5 Jun 2024 ZiCheng Zhang, HaoNing Wu, Chunyi Li, Yingjie Zhou, Wei Sun, Xiongkuo Min, Zijian Chen, Xiaohong Liu, Weisi Lin, Guangtao Zhai

How to accurately and efficiently assess AI-generated images (AIGIs) remains a critical challenge for generative models.

Dual-Branch Network for Portrait Image Quality Assessment

1 code implementation14 May 2024 Wei Sun, Weixia Zhang, Yanwei Jiang, HaoNing Wu, ZiCheng Zhang, Jun Jia, Yingjie Zhou, Zhongpeng Ji, Xiongkuo Min, Weisi Lin, Guangtao Zhai

We employ the fidelity loss to train the model via a learning-to-rank manner to mitigate inconsistencies in quality scores in the portrait image quality assessment dataset PIQ.

Image Quality Assessment Learning-To-Rank +2

Enhancing Blind Video Quality Assessment with Rich Quality-aware Features

1 code implementation14 May 2024 Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai

Motivated by previous researches that leverage pre-trained features extracted from various computer vision models as the feature representation for BVQA, we further explore rich quality-aware features from pre-trained blind image quality assessment (BIQA) and BVQA models as auxiliary features to help the BVQA model to handle complex distortions and diverse content of social media videos.

Video Quality Assessment

Understanding and Evaluating Human Preferences for AI Generated Images with Instruction Tuning

1 code implementation12 May 2024 Jiarui Wang, Huiyu Duan, Guangtao Zhai, Xiongkuo Min

Artificial Intelligence Generated Content (AIGC) has grown rapidly in recent years, among which AI-based image generation has gained widespread attention due to its efficient and imaginative image creation ability.

Image Generation Image Quality Assessment

LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM

1 code implementation28 Apr 2024 ZiCheng Zhang, HaoNing Wu, Yingjie Zhou, Chunyi Li, Wei Sun, Chaofeng Chen, Xiongkuo Min, Xiaohong Liu, Weisi Lin, Guangtao Zhai

Although large multi-modality models (LMMs) have seen extensive exploration and application in various quality assessment studies, their integration into Point Cloud Quality Assessment (PCQA) remains unexplored.

Point Cloud Quality Assessment

Large Multi-modality Model Assisted AI-Generated Image Quality Assessment

1 code implementation27 Apr 2024 Puyi Wang, Wei Sun, ZiCheng Zhang, Jun Jia, Yanwei Jiang, Zhichao Zhang, Xiongkuo Min, Guangtao Zhai

Traditional deep neural network (DNN)-based image quality assessment (IQA) models leverage convolutional neural networks (CNN) or Transformer to learn the quality-aware feature representation, achieving commendable performance on natural scene images.

Image Quality Assessment

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

no code implementations25 Apr 2024 Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, HaoNing Wu, Yixuan Gao, Yuqin Cao, ZiCheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng, Jianquan Yang, Weigang Wang, Xi Fang, Xiaoxin Lv, Jun Yan, Tianwu Zhi, Yabin Zhang, Yaohui Li, Yang Li, Jingwen Xu, Jianzhao Liu, Yiting Liao, Junlin Li, Zihao Yu, Yiting Lu, Xin Li, Hossein Motamednia, S. Farhad Hosseini-Benvidi, Fengbin Guan, Ahmad Mahmoudi-Aznaveh, Azadeh Mansouri, Ganzorig Gankhuyag, Kihwan Yoon, Yifang Xu, Haotian Fan, Fangyuan Kong, Shiling Zhao, Weifeng Dong, Haibing Yin, Li Zhu, Zhiling Wang, Bingchen Huang, Avinab Saha, Sandeep Mishra, Shashank Gupta, Rajesh Sureddi, Oindrila Saha, Luigi Celona, Simone Bianco, Paolo Napoletano, Raimondo Schettini, Junfeng Yang, Jing Fu, Wei zhang, Wenzhi Cao, Limei Liu, Han Peng, Weijun Yuan, Zhan Li, Yihang Cheng, Yifan Deng, Haohui Li, Bowen Qu, Yao Li, Shuqing Luo, Shunzhou Wang, Wei Gao, Zihao Lu, Marcos V. Conde, Xinrui Wang, Zhibo Chen, Ruling Liao, Yan Ye, Qiulin Wang, Bing Li, Zhaokun Zhou, Miao Geng, Rui Chen, Xin Tao, Xiaoyu Liang, Shangkun Sun, Xingyuan Ma, Jiaze Li, Mengduo Yang, Haoran Xu, Jie zhou, Shiding Zhu, Bohan Yu, Pengfei Chen, Xinrui Xu, Jiabin Shen, Zhichao Duan, Erfan Asadi, Jiahe Liu, Qi Yan, Youran Qu, Xiaohui Zeng, Lele Wang, Renjie Liao

A total of 196 participants have registered in the video track.

Image Quality Assessment Image Restoration +2

THQA: A Perceptual Quality Assessment Database for Talking Heads

1 code implementation13 Apr 2024 Yingjie Zhou, ZiCheng Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, Zhihua Wang, Xiao-Ping Zhang, Guangtao Zhai

In the realm of media technology, digital humans have gained prominence due to rapid advancements in computer technology.

Video Quality Assessment

How is Visual Attention Influenced by Text Guidance? Database and Model

no code implementations11 Apr 2024 Yinan Sun, Xiongkuo Min, Huiyu Duan, Guangtao Zhai

Finally, considering the effect of text descriptions on visual attention, while most existing saliency models ignore this impact, we further propose a text-guided saliency (TGSal) prediction model, which extracts and integrates both image features and text features to predict the image saliency under various text-description conditions.

Saliency Prediction

AIGIQA-20K: A Large Database for AI-Generated Image Quality Assessment

no code implementations4 Apr 2024 Chunyi Li, Tengchuan Kou, Yixuan Gao, Yuqin Cao, Wei Sun, ZiCheng Zhang, Yingjie Zhou, Zhichao Zhang, Weixia Zhang, HaoNing Wu, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

With the rapid advancements in AI-Generated Content (AIGC), AI-Generated Images (AIGIs) have been widely applied in entertainment, education, and social media.

Image Quality Assessment

Perceptual Video Quality Assessment: A Survey

no code implementations5 Feb 2024 Xiongkuo Min, Huiyu Duan, Wei Sun, Yucheng Zhu, Guangtao Zhai

Perceptual video quality assessment plays a vital role in the field of video processing due to the existence of quality degradations introduced in various stages of video signal acquisition, compression, transmission and display.

Survey Video Quality Assessment

Q-Refine: A Perceptual Quality Refiner for AI-Generated Image

1 code implementation2 Jan 2024 Chunyi Li, HaoNing Wu, ZiCheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai

With the rapid evolution of the Text-to-Image (T2I) model in recent years, their unsatisfactory generation result has become a challenge.

Image Quality Assessment

Perceptual Quality Assessment for Video Frame Interpolation

no code implementations25 Dec 2023 Jinliang Han, Xiongkuo Min, Yixuan Gao, Jun Jia, Lei Sun, Zuowei Cao, Yonglin Luo, Guangtao Zhai

To evaluate the quality of VFI frames without reference videos, a no-reference perceptual quality assessment method is proposed in this paper.

Full-Reference Image Quality Assessment Triplet +1

Exploring the Naturalness of AI-Generated Images

1 code implementation9 Dec 2023 Zijian Chen, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhongpeng Ji, Fengyu Sun, Shangling Jui, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

In this paper, we take the first step to benchmark and assess the visual naturalness of AI-generated images.

FS-BAND: A Frequency-Sensitive Banding Detector

no code implementations30 Nov 2023 Zijian Chen, Wei Sun, ZiCheng Zhang, Ru Huang, Fangfang Lu, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

Banding artifact, as known as staircase-like contour, is a common quality annoyance that happens in compression, transmission, etc.

Image Quality Assessment

BAND-2k: Banding Artifact Noticeable Database for Banding Detection and Quality Assessment

1 code implementation29 Nov 2023 Zijian Chen, Wei Sun, Jun Jia, Fangfang Lu, ZiCheng Zhang, Jing Liu, Ru Huang, Xiongkuo Min, Guangtao Zhai

The quality score of a banding image is generated by pooling the banding detection maps masked by the spatial frequency filters.

2k Image Quality Assessment +1

Audio-visual Saliency for Omnidirectional Videos

no code implementations9 Nov 2023 Yuxin Zhu, Xilei Zhu, Huiyu Duan, Jie Li, Kaiwei Zhang, Yucheng Zhu, Li Chen, Xiongkuo Min, Guangtao Zhai

Visual saliency prediction for omnidirectional videos (ODVs) has shown great significance and necessity for omnidirectional videos to help ODV coding, ODV transmission, ODV rendering, etc..

Saliency Prediction

A No-Reference Quality Assessment Method for Digital Human Head

no code implementations25 Oct 2023 Yingjie Zhou, ZiCheng Zhang, Wei Sun, Xiongkuo Min, Xianghe Ma, Guangtao Zhai

In this paper, we develop a novel no-reference (NR) method based on Transformer to deal with DHQA in a multi-task manner.

Geometry-Aware Video Quality Assessment for Dynamic Digital Human

no code implementations24 Oct 2023 ZiCheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, Guangtao Zhai

Usually, DDHs are displayed as 2D rendered animation videos and it is natural to adapt video quality assessment (VQA) methods to DDH quality assessment (DDH-QA) tasks.

Attribute Video Quality Assessment +1

Joint Gaze-Location and Gaze-Object Detection

no code implementations26 Aug 2023 Danyang Tu, Wei Shen, Wei Sun, Xiongkuo Min, Guangtao Zhai

In contrast, we reframe the gaze following detection task as detecting human head locations and their gaze followings simultaneously, aiming at jointly detect human gaze location and gaze object in a unified and single-stage pipeline.

Object object-detection +1

StableVQA: A Deep No-Reference Quality Assessment Model for Video Stability

1 code implementation9 Aug 2023 Tengchuan Kou, Xiaohong Liu, Wei Sun, Jun Jia, Xiongkuo Min, Guangtao Zhai, Ning Liu

Indeed, most existing quality assessment models evaluate video quality as a whole without specifically taking the subjective experience of video stability into consideration.

Video Quality Assessment Video Stabilization +1

Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models

1 code implementation26 Jul 2023 Wei Sun, Wen Wen, Xiongkuo Min, Long Lan, Guangtao Zhai, Kede Ma

By minimalistic, we restrict our family of BVQA models to build only upon basic blocks: a video preprocessor (for aggressive spatiotemporal downsampling), a spatial quality analyzer, an optional temporal quality analyzer, and a quality regressor, all with the simplest possible instantiations.

Video Quality Assessment Visual Question Answering (VQA)

Perceptual Quality Assessment of Omnidirectional Audio-visual Signals

1 code implementation20 Jul 2023 Xilei Zhu, Huiyu Duan, Yuqin Cao, Yuxin Zhu, Yucheng Zhu, Jing Liu, Li Chen, Xiongkuo Min, Guangtao Zhai

Omnidirectional videos (ODVs) play an increasingly important role in the application fields of medical, education, advertising, tourism, etc.

Blind Image Quality Assessment: A Fuzzy Neural Network for Opinion Score Distribution Prediction

1 code implementation IEEE Transactions on Circuits and Systems for Video Technology 2023 Yixuan Gao, Xiongkuo Min, Yucheng Zhu, Xiao-Ping Zhang, Guangtao Zhai

On the other hand, we also prove the feasibility of the proposed method in predicting the MOS of image quality on several popular IQA databases, including CSIQ, TID2013, LIVE MD, and LIVE Challenge.

Subjective and Objective Audio-Visual Quality Assessment for User Generated Content

1 code implementation IEEE Transactions on Image Processing 2023 Yuqin Cao, Xiongkuo Min, Wei Sun, Guangtao Zhai

Then, to facilitate the development of AVQA fields, we construct a benchmark of AVQA models on the proposed SJTU-UAV database and other two AVQA databases, of which the benchmark models consist of AVQA models designed for synthetically distorted A/V sequences and AVQA models built through combining the popular VQA methods and audio features via support vector regressor (SVR).

Video Quality Assessment Visual Question Answering (VQA)

Advancing Zero-Shot Digital Human Quality Assessment through Text-Prompted Evaluation

1 code implementation6 Jul 2023 ZiCheng Zhang, Wei Sun, Yingjie Zhou, HaoNing Wu, Chunyi Li, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

To address this gap, we propose SJTU-H3D, a subjective quality assessment database specifically designed for full-body digital humans.

AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence

1 code implementation1 Jul 2023 Jiarui Wang, Huiyu Duan, Jing Liu, Shi Chen, Xiongkuo Min, Guangtao Zhai

In this paper, in order to get a better understanding of the human visual preferences for AIGIs, a large-scale IQA database for AIGC is established, which is named as AIGCIQA2023.

Image Quality Assessment Text-to-Image Generation

GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment

1 code implementation9 Jun 2023 ZiCheng Zhang, Wei Sun, Houning Wu, Yingjie Zhou, Chunyi Li, Xiongkuo Min, Guangtao Zhai, Weisi Lin

Model-based 3DQA methods extract features directly from the 3D models, which are characterized by their high degree of complexity.

Point Cloud Quality Assessment

AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment

1 code implementation7 Jun 2023 Chunyi Li, ZiCheng Zhang, HaoNing Wu, Wei Sun, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

With the rapid advancements of the text-to-image generative model, AI-generated images (AGIs) have been widely applied to entertainment, education, social media, etc.

Image Quality Assessment

Masked Autoencoders as Image Processors

1 code implementation30 Mar 2023 Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Long Teng, Jia Wang, Guangtao Zhai

Recently, masked autoencoders (MAE) for feature pre-training have further unleashed the potential of Transformers, leading to state-of-the-art performances on various high-level vision tasks.

Deblurring Image Defocus Deblurring +2

MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos

1 code implementation CVPR 2023 ZiCheng Zhang, Wei Wu, Wei Sun, Dangyang Tu, Wei Lu, Xiongkuo Min, Ying Chen, Guangtao Zhai

User-generated content (UGC) live videos are often bothered by various distortions during capture procedures and thus exhibit diverse visual qualities.

Video Quality Assessment Visual Question Answering (VQA)

A Perceptual Quality Assessment Exploration for AIGC Images

1 code implementation22 Mar 2023 ZiCheng Zhang, Chunyi Li, Wei Sun, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

\underline{AI} \underline{G}enerated \underline{C}ontent (\textbf{AIGC}) has gained widespread attention with the increasing efficiency of deep learning in content creation.

Image Quality Assessment

VDPVE: VQA Dataset for Perceptual Video Enhancement

1 code implementation16 Mar 2023 Yixuan Gao, Yuqin Cao, Tengchuan Kou, Wei Sun, Yunlong Dong, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

Few researchers have specifically proposed a video quality assessment method for video enhancement, and there is also no comprehensive video quality assessment dataset available in public.

Deblurring valid +3

Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images

1 code implementation14 Mar 2023 ZiCheng Zhang, Wei Sun, Yingjie Zhou, Jun Jia, Zhichao Zhang, Jing Liu, Xiongkuo Min, Guangtao Zhai

Computer graphics images (CGIs) are artificially generated by means of computer programs and are widely perceived under various scenarios, such as games, streaming media, etc.

Audio-Visual Quality Assessment for User Generated Content: Database and Method

no code implementations4 Mar 2023 Yuqin Cao, Xiongkuo Min, Wei Sun, XiaoPing Zhang, Guangtao Zhai

Specifically, we construct the first UGC AVQA database named the SJTU-UAV database, which includes 520 in-the-wild UGC audio and video (A/V) sequences, and conduct a user study to obtain the mean opinion scores of the A/V sequences.

Video Quality Assessment Visual Question Answering (VQA)

EEP-3DQA: Efficient and Effective Projection-based 3D Model Quality Assessment

no code implementations17 Feb 2023 ZiCheng Zhang, Wei Sun, Yingjie Zhou, Wei Lu, Yucheng Zhu, Xiongkuo Min, Guangtao Zhai

Currently, great numbers of efforts have been put into improving the effectiveness of 3D model quality assessment (3DQA) methods.

DDH-QA: A Dynamic Digital Humans Quality Assessment Database

1 code implementation24 Dec 2022 ZiCheng Zhang, Yingjie Zhou, Wei Sun, Wei Lu, Xiongkuo Min, Yu Wang, Guangtao Zhai

In recent years, large amounts of effort have been put into pushing forward the real-world application of dynamic digital human (DDH).

Video Quality Assessment

Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop

1 code implementation3 Oct 2022 Weixia Zhang, Dingquan Li, Xiongkuo Min, Guangtao Zhai, Guodong Guo, Xiaokang Yang, Kede Ma

No-reference image quality assessment (NR-IQA) aims to quantify how humans perceive visual distortions of digital images without access to their undistorted references.

Perceptual Quality Assessment for Digital Human Heads

1 code implementation20 Sep 2022 ZiCheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, Yuzhe Wu, Guangtao Zhai

Digital humans are attracting more and more research interest during the last decade, the generation, representation, rendering, and animation of which have been put into large amounts of effort.

MM-PCQA: Multi-Modal Learning for No-reference Point Cloud Quality Assessment

1 code implementation1 Sep 2022 ZiCheng Zhang, Wei Sun, Xiongkuo Min, Quan Zhou, Jun He, Qiyuan Wang, Guangtao Zhai

In specific, we split the point clouds into sub-models to represent local geometry distortions such as point shift and down-sampling.

Point Cloud Quality Assessment

Evaluating Point Cloud from Moving Camera Videos: A No-Reference Metric

1 code implementation30 Aug 2022 ZiCheng Zhang, Wei Sun, Yucheng Zhu, Xiongkuo Min, Wei Wu, Ying Chen, Guangtao Zhai

To tackle the challenge of point cloud quality assessment (PCQA), many PCQA methods have been proposed to evaluate the visual quality levels of point clouds by assessing the rendered static 2D projections.

Image Quality Assessment Point Cloud Quality Assessment +2

Perceptual Quality Assessment of Omnidirectional Images

no code implementations6 Jul 2022 Huiyu Duan, Guangtao Zhai, Xiongkuo Min, Yucheng Zhu, Yi Fang, Xiaokang Yang

The original and distorted omnidirectional images, subjective quality ratings, and the head and eye movement data together constitute the OIQA database.

Image Quality Assessment

Subjective Quality Assessment for Images Generated by Computer Graphics

no code implementations10 Jun 2022 Tao Wang, ZiCheng Zhang, Wei Sun, Xiongkuo Min, Wei Lu, Guangtao Zhai

However, limited work has been put forward to tackle the problem of computer graphics generated images' quality assessment (CG-IQA).

NR-IQA

A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences

no code implementations9 Jun 2022 Yu Fan, ZiCheng Zhang, Wei Sun, Xiongkuo Min, Wei Lu, Tao Wang, Ning Liu, Guangtao Zhai

Point cloud is one of the most widely used digital formats of 3D models, the visual quality of which is quite sensitive to distortions such as downsampling, noise, and compression.

Point Cloud Quality Assessment

Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency

no code implementations9 Jun 2022 Wei Lu, Wei Sun, Wenhan Zhu, Xiongkuo Min, ZiCheng Zhang, Tao Wang, Guangtao Zhai

In this paper, we first conduct an example experiment (i. e. the face detection task) to demonstrate that the quality of the SIs has a crucial impact on the performance of the IVSS, and then propose a saliency-based deep neural network for the blind quality assessment of the SIs, which helps IVSS to filter the low-quality SIs and improve the detection and recognition performance.

Face Detection Image Quality Assessment

Deep Neural Network for Blind Visual Quality Assessment of 4K Content

no code implementations9 Jun 2022 Wei Lu, Wei Sun, Xiongkuo Min, Wenhan Zhu, Quan Zhou, Jun He, Qiyuan Wang, ZiCheng Zhang, Tao Wang, Guangtao Zhai

In this paper, we propose a deep learning-based BIQA model for 4K content, which on one hand can recognize true and pseudo 4K content and on the other hand can evaluate their perceptual visual quality.

4k Blind Image Quality Assessment +1

A No-Reference Deep Learning Quality Assessment Method for Super-resolution Images Based on Frequency Maps

no code implementations9 Jun 2022 ZiCheng Zhang, Wei Sun, Xiongkuo Min, Wenhan Zhu, Tao Wang, Wei Lu, Guangtao Zhai

Therefore, in this paper, we propose a no-reference deep-learning image quality assessment method based on frequency maps because the artifacts caused by SISR algorithms are quite sensitive to frequency information.

Image Quality Assessment Image Super-Resolution

Perceptual Quality Assessment for Fine-Grained Compressed Images

no code implementations8 Jun 2022 ZiCheng Zhang, Wei Sun, Wei Wu, Ying Chen, Xiongkuo Min, Guangtao Zhai

Nowadays, the mainstream full-reference (FR) metrics are effective to predict the quality of compressed images at coarse-grained levels (the bit rates differences of compressed images are obvious), however, they may perform poorly for fine-grained compressed images whose bit rates differences are quite subtle.

Full-Reference Image Quality Assessment Image Compression

Video-based Human-Object Interaction Detection from Tubelet Tokens

no code implementations4 Jun 2022 Danyang Tu, Wei Sun, Xiongkuo Min, Guangtao Zhai, Wei Shen

We present a novel vision Transformer, named TUTOR, which is able to learn tubelet tokens, served as highly-abstracted spatiotemporal representations, for video-based human-object interaction (V-HOI) detection.

Human-Object Interaction Detection

Deep Decomposition and Bilinear Pooling Network for Blind Night-Time Image Quality Evaluation

no code implementations12 May 2022 Qiuping Jiang, Jiawu Xu, Yudong Mao, Wei Zhou, Xiongkuo Min, Guangtao Zhai

The DDB-Net contains three modules, i. e., an image decomposition module, a feature encoding module, and a bilinear pooling module.

Blind Image Quality Assessment

A Deep Learning based No-reference Quality Assessment Model for UGC Videos

1 code implementation29 Apr 2022 Wei Sun, Xiongkuo Min, Wei Lu, Guangtao Zhai

The proposed model utilizes very sparse frames to extract spatial features and dense frames (i. e. the video chunk) with a very low spatial resolution to extract motion features, which thereby has low computational complexity.

Image Quality Assessment Video Quality Assessment

Saliency in Augmented Reality

1 code implementation18 Apr 2022 Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Jing Li, Guangtao Zhai

Therefore, in this paper, we mainly analyze the interaction effect between background (BG) scenes and AR contents, and study the saliency prediction problem in AR.

Saliency Prediction

Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows

no code implementations20 Mar 2022 Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen

Iwin Transformer is a hierarchical Transformer which progressively performs token representation learning and token agglomeration within irregular windows.

Human-Object Interaction Detection Object +4

Parameterized Image Quality Score Distribution Prediction

no code implementations2 Mar 2022 Yixuan Gao, Xiongkuo Min, Wenhan Zhu, Xiao-Ping Zhang, Guangtao Zhai

Experimental results verifythe feasibility of using alpha stable model to describe the IQSD, and prove the effectiveness of objective alpha stable model basedIQSD prediction method.

valid

Learning Invisible Markers for Hidden Codes in Offline-to-Online Photography

no code implementations CVPR 2022 Jun Jia, Zhongpai Gao, Dandan Zhu, Xiongkuo Min, Guangtao Zhai, Xiaokang Yang

In addition, the automatic localization of hidden codes significantly reduces the time of manually correcting geometric distortions for photos, which is a revolutionary innovation for information hiding in mobile applications.

Self-Conditioned Probabilistic Learning of Video Rescaling

1 code implementation ICCV 2021 Yuan Tian, Guo Lu, Xiongkuo Min, Zhaohui Che, Guangtao Zhai, Guodong Guo, Zhiyong Gao

After optimization, the downscaled video by our framework preserves more meaningful information, which is beneficial for both the upscaling step and the downstream tasks, e. g., video action recognition task.

Video Compression Video Super-Resolution

No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models

2 code implementations5 Jul 2021 ZiCheng Zhang, Wei Sun, Xiongkuo Min, Tao Wang, Wei Lu, Guangtao Zhai

Therefore, many related studies such as point cloud quality assessment (PCQA) and mesh quality assessment (MQA) have been carried out to measure the visual quality degradations of 3D models.

Point Cloud Quality Assessment

Deep Learning based Full-reference and No-reference Quality Assessment Models for Compressed UGC Videos

1 code implementation2 Jun 2021 Wei Sun, Tao Wang, Xiongkuo Min, Fuwang Yi, Guangtao Zhai

The proposed VQA framework consists of three modules, the feature extraction module, the quality regression module, and the quality pooling module.

regression Video Quality Assessment

Blurry Video Frame Interpolation

1 code implementation CVPR 2020 Wang Shen, Wenbo Bao, Guangtao Zhai, Li Chen, Xiongkuo Min, Zhiyong Gao

Existing works reduce motion blur and up-convert frame rate through two separate ways, including frame deblurring and frame interpolation.

Deblurring Video Enhancement +1

Toward Better Understanding of Saliency Prediction in Augmented 360 Degree Videos

no code implementations12 Dec 2019 Yucheng Zhu, Xiongkuo Min, Dandan Zhu, Ke Gu, Jiantao Zhou, Guangtao Zhai, Xiaokang Yang, Wenjun Zhang

The saliency annotations of head and eye movements for both original and augmented videos are collected and together constitute the ARVR dataset.

Object Recognition Optical Flow Estimation +1

How is Gaze Influenced by Image Transformations? Dataset and Model

1 code implementation16 May 2019 Zhaohui Che, Ali Borji, Guangtao Zhai, Xiongkuo Min, Guodong Guo, Patrick Le Callet

Data size is the bottleneck for developing deep saliency models, because collecting eye-movement data is very time consuming and expensive.

Data Augmentation Generative Adversarial Network +1

Invariance Analysis of Saliency Models versus Human Gaze During Scene Free Viewing

1 code implementation10 Oct 2018 Zhaohui Che, Ali Borji, Guangtao Zhai, Xiongkuo Min

Most of current studies on human gaze and saliency modeling have used high-quality stimuli.

Data Augmentation

Terahertz Security Image Quality Assessment by No-reference Model Observers

no code implementations12 Jul 2017 Menghan Hu, Xiongkuo Min, Guangtao Zhai, Wenhan Zhu, Yucheng Zhu, Zhaodi Wang, Xiaokang Yang, Guang Tian

Subsequently, the existing no-reference IQA algorithms, which were 5 opinion-aware approaches viz., NFERM, GMLF, DIIVINE, BRISQUE and BLIINDS2, and 8 opinion-unaware approaches viz., QAC, SISBLIM, NIQE, FISBLIM, CPBD, S3 and Fish_bb, were executed for the evaluation of the THz security image quality.

Image Quality Assessment

Cannot find the paper you are looking for? You can Submit a new open access paper.