Search Results for author: Huiyu Duan

Found 27 papers, 10 papers with code

Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model

no code implementations24 Feb 2025 Kang Fu, Huiyu Duan, ZiCheng Zhang, Xiaohong Liu, Xiongkuo Min, Jia Wang, Guangtao Zhai

This database encompasses 969 validated 3D assets generated from 170 prompts via 6 popular text-to-3D asset generation models, and corresponding subjective quality ratings for these assets from the perspectives of quality, authenticity, and text-asset correspondence, respectively.

Text to 3D

HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment

no code implementations2 Jan 2025 Zitong Xu, Huiyu Duan, Guangji Ma, Liu Yang, Jiarui Wang, Qingbo Wu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet

To address the issue and facilitate the advancement of IHAs, we introduce the first Image Quality Assessment Database for image Harmony evaluation (HarmonyIQAD), which consists of 1, 350 harmonized images generated by 9 different IHAs, and the corresponding human visual preference scores.

Image Harmonization Image Quality Assessment +1

ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos

no code implementations29 Dec 2024 Xilei Zhu, Huiyu Duan, Liu Yang, Yucheng Zhu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet

With the rapid development of eXtended Reality (XR), egocentric spatial shooting and display technologies have further enhanced immersion and engagement for users.

Video Quality Assessment Visual Question Answering (VQA)

FineVQ: Fine-Grained User Generated Content Video Quality Assessment

no code implementations26 Dec 2024 Huiyu Duan, Qiang Hu, Jiarui Wang, Liu Yang, Zitong Xu, Lu Liu, Xiongkuo Min, Chunlei Cai, Tianxiao Ye, Xiaoyun Zhang, Guangtao Zhai

The rapid growth of user-generated content (UGC) videos has produced an urgent need for effective video quality assessment (VQA) algorithms to monitor video quality and guide optimization and recommendation procedures.

Video Quality Assessment Visual Question Answering (VQA)

F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration

no code implementations17 Dec 2024 Lu Liu, Huiyu Duan, Qiang Hu, Liu Yang, Chunlei Cai, Tianxiao Ye, Huayu Liu, Xiaoyun Zhang, Guangtao Zhai

The FaceQ database comprises 12, 255 images generated by 29 models across three tasks: (1) face generation, (2) face customization, and (3) face restoration.

Benchmarking Face Generation +1

AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM

1 code implementation26 Nov 2024 Jiarui Wang, Huiyu Duan, Guangtao Zhai, Juntong Wang, Xiongkuo Min

The rapid advancement of large multimodal models (LMMs) has led to the rapid expansion of artificial intelligence generated videos (AIGVs), which highlights the pressing need for effective video quality assessment (VQA) models designed specifically for AIGVs.

Benchmarking Text-to-Video Generation +3

MMHead: Towards Fine-grained Multi-modal 3D Facial Animation

no code implementations10 Oct 2024 Sijing Wu, Yunhao Li, Yichao Yan, Huiyu Duan, Ziwei Liu, Guangtao Zhai

To fill this gap, we first construct a large-scale multi-modal 3D facial animation dataset, MMHead, which consists of 49 hours of 3D facial motion sequences, speech audios, and rich hierarchical text annotations.

Motion Generation text annotation +1

How Does Audio Influence Visual Attention in Omnidirectional Videos? Database and Model

no code implementations10 Aug 2024 Yuxin Zhu, Huiyu Duan, Kaiwei Zhang, Yucheng Zhu, Xilei Zhu, Long Teng, Xiongkuo Min, Guangtao Zhai

To advance the research on audio-visual saliency prediction for ODVs, we further establish a new benchmark based on the AVS-ODV database by testing numerous state-of-the-art saliency models, including visual-only models and audio-visual models.

Prediction Saliency Prediction

ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images

no code implementations31 Jul 2024 Xilei Zhu, Liu Yang, Huiyu Duan, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet

In this paper, we establish the Egocentric Spatial Images Quality Assessment Database (ESIQAD), the first IQA database dedicated for egocentric spatial images as far as we know.

Image Quality Assessment

UniProcessor: A Text-induced Unified Low-level Image Processor

1 code implementation30 Jul 2024 Huiyu Duan, Xiongkuo Min, Sijing Wu, Wei Shen, Guangtao Zhai

In this paper, we propose a text-induced unified image processor for low-level vision tasks, termed UniProcessor, which can effectively process various degradation types and levels, and support multimodal control.

Image Enhancement Image Restoration +1

Quality-guided Skin Tone Enhancement for Portrait Photography

no code implementations22 Jun 2024 Shiqi Gao, Huiyu Duan, Xinyue Li, Kang Fu, Yicong Peng, Qihang Xu, Yuanyuan Chang, Jia Wang, Xiongkuo Min, Guangtao Zhai

In this paper, we propose a quality-guided image enhancement paradigm that enables image enhancement models to learn the distribution of images with various quality ratings.

Image Enhancement

MVBIND: Self-Supervised Music Recommendation For Videos Via Embedding Space Binding

no code implementations15 May 2024 Jiajie Teng, Huiyu Duan, Yucheng Zhu, Sijing Wu, Guangtao Zhai

However, at present, the background music of short videos is generally chosen by the video producer, and there is a lack of automatic music recommendation methods for short videos.

Cross-Modal Retrieval Music Recommendation +1

Quality Assessment for AI Generated Images with Instruction Tuning

1 code implementation12 May 2024 Jiarui Wang, Huiyu Duan, Guangtao Zhai, Xiongkuo Min

Artificial Intelligence Generated Content (AIGC) has grown rapidly in recent years, among which AI-based image generation has gained widespread attention due to its efficient and imaginative image creation ability.

Image Generation Image Quality Assessment

How is Visual Attention Influenced by Text Guidance? Database and Model

no code implementations11 Apr 2024 Yinan Sun, Xiongkuo Min, Huiyu Duan, Guangtao Zhai

Finally, considering the effect of text descriptions on visual attention, while most existing saliency models ignore this impact, we further propose a text-guided saliency (TGSal) prediction model, which extracts and integrates both image features and text features to predict the image saliency under various text-description conditions.

Prediction Saliency Prediction

Perceptual Video Quality Assessment: A Survey

no code implementations5 Feb 2024 Xiongkuo Min, Huiyu Duan, Wei Sun, Yucheng Zhu, Guangtao Zhai

Perceptual video quality assessment plays a vital role in the field of video processing due to the existence of quality degradations introduced in various stages of video signal acquisition, compression, transmission and display.

Survey Video Quality Assessment

Audio-visual Saliency for Omnidirectional Videos

no code implementations9 Nov 2023 Yuxin Zhu, Xilei Zhu, Huiyu Duan, Jie Li, Kaiwei Zhang, Yucheng Zhu, Li Chen, Xiongkuo Min, Guangtao Zhai

Visual saliency prediction for omnidirectional videos (ODVs) has shown great significance and necessity for omnidirectional videos to help ODV coding, ODV transmission, ODV rendering, etc..

Prediction Saliency Prediction

Perceptual Quality Assessment of Omnidirectional Audio-visual Signals

1 code implementation20 Jul 2023 Xilei Zhu, Huiyu Duan, Yuqin Cao, Yuxin Zhu, Yucheng Zhu, Jing Liu, Li Chen, Xiongkuo Min, Guangtao Zhai

Omnidirectional videos (ODVs) play an increasingly important role in the application fields of medical, education, advertising, tourism, etc.

AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence

1 code implementation1 Jul 2023 Jiarui Wang, Huiyu Duan, Jing Liu, Shi Chen, Xiongkuo Min, Guangtao Zhai

In this paper, in order to get a better understanding of the human visual preferences for AIGIs, a large-scale IQA database for AIGC is established, which is named as AIGCIQA2023.

Image Quality Assessment Text-to-Image Generation

Masked Autoencoders as Image Processors

1 code implementation30 Mar 2023 Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Long Teng, Jia Wang, Guangtao Zhai

Recently, masked autoencoders (MAE) for feature pre-training have further unleashed the potential of Transformers, leading to state-of-the-art performances on various high-level vision tasks.

Deblurring Image Defocus Deblurring +2

Perceptual Quality Assessment of Omnidirectional Images

no code implementations6 Jul 2022 Huiyu Duan, Guangtao Zhai, Xiongkuo Min, Yucheng Zhu, Yi Fang, Xiaokang Yang

The original and distorted omnidirectional images, subjective quality ratings, and the head and eye movement data together constitute the OIQA database.

Image Quality Assessment

Saliency in Augmented Reality

1 code implementation18 Apr 2022 Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Jing Li, Guangtao Zhai

Therefore, in this paper, we mainly analyze the interaction effect between background (BG) scenes and AR contents, and study the saliency prediction problem in AR.

Prediction Saliency Prediction

Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows

no code implementations20 Mar 2022 Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen

Iwin Transformer is a hierarchical Transformer which progressively performs token representation learning and token agglomeration within irregular windows.

Human-Object Interaction Detection Object +4

Cannot find the paper you are looking for? You can Submit a new open access paper.