Search Results for author: Huiyu Duan

Found 12 papers, 5 papers with code

How is Visual Attention Influenced by Text Guidance? Database and Model

no code implementations11 Apr 2024 Yinan Sun, Xiongkuo Min, Huiyu Duan, Guangtao Zhai

Finally, considering the effect of text descriptions on visual attention, while most existing saliency models ignore this impact, we further propose a text-guided saliency (TGSal) prediction model, which extracts and integrates both image features and text features to predict the image saliency under various text-description conditions.

Saliency Prediction

Perceptual Video Quality Assessment: A Survey

no code implementations5 Feb 2024 Xiongkuo Min, Huiyu Duan, Wei Sun, Yucheng Zhu, Guangtao Zhai

Perceptual video quality assessment plays a vital role in the field of video processing due to the existence of quality degradations introduced in various stages of video signal acquisition, compression, transmission and display.

Video Quality Assessment

Audio-visual Saliency for Omnidirectional Videos

no code implementations9 Nov 2023 Yuxin Zhu, Xilei Zhu, Huiyu Duan, Jie Li, Kaiwei Zhang, Yucheng Zhu, Li Chen, Xiongkuo Min, Guangtao Zhai

Visual saliency prediction for omnidirectional videos (ODVs) has shown great significance and necessity for omnidirectional videos to help ODV coding, ODV transmission, ODV rendering, etc..

Saliency Prediction

Perceptual Quality Assessment of Omnidirectional Audio-visual Signals

1 code implementation20 Jul 2023 Xilei Zhu, Huiyu Duan, Yuqin Cao, Yuxin Zhu, Yucheng Zhu, Jing Liu, Li Chen, Xiongkuo Min, Guangtao Zhai

Omnidirectional videos (ODVs) play an increasingly important role in the application fields of medical, education, advertising, tourism, etc.

AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence

1 code implementation1 Jul 2023 Jiarui Wang, Huiyu Duan, Jing Liu, Shi Chen, Xiongkuo Min, Guangtao Zhai

In this paper, in order to get a better understanding of the human visual preferences for AIGIs, a large-scale IQA database for AIGC is established, which is named as AIGCIQA2023.

Image Quality Assessment Text-to-Image Generation

Masked Autoencoders as Image Processors

1 code implementation30 Mar 2023 Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Long Teng, Jia Wang, Guangtao Zhai

Recently, masked autoencoders (MAE) for feature pre-training have further unleashed the potential of Transformers, leading to state-of-the-art performances on various high-level vision tasks.

Deblurring Image Defocus Deblurring +2

Perceptual Quality Assessment of Omnidirectional Images

no code implementations6 Jul 2022 Huiyu Duan, Guangtao Zhai, Xiongkuo Min, Yucheng Zhu, Yi Fang, Xiaokang Yang

The original and distorted omnidirectional images, subjective quality ratings, and the head and eye movement data together constitute the OIQA database.

Image Quality Assessment

Saliency in Augmented Reality

1 code implementation18 Apr 2022 Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Jing Li, Guangtao Zhai

Therefore, in this paper, we mainly analyze the interaction effect between background (BG) scenes and AR contents, and study the saliency prediction problem in AR.

Saliency Prediction

Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows

no code implementations20 Mar 2022 Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen

Iwin Transformer is a hierarchical Transformer which progressively performs token representation learning and token agglomeration within irregular windows.

Human-Object Interaction Detection Object +4

Cannot find the paper you are looking for? You can Submit a new open access paper.