1 code implementation • 4 Dec 2024 • Qi Zheng, Yibo Fan, Leilei Huang, Tianyu Zhu, Jiaming Liu, Zhijian Hao, Shuo Xing, Chia-Ju Chen, Xiongkuo Min, Alan C. Bovik, Zhengzhong Tu
Numerous deep learning-based VQA models have been developed, with progress in this direction driven by the creation of content-diverse, large-scale human-labeled databases that supply ground truth psychometric video quality data.
1 code implementation • 26 Nov 2024 • Ruoxi Zhu, Zhengzhong Tu, Jiaming Liu, Alan C. Bovik, Yibo Fan
Moreover, MWFormer allows for a novel way of tuning, during application, to either a single type of weather restoration or to hybrid weather restoration without any retraining, offering greater controllability than existing methods.
no code implementations • 17 Oct 2024 • Bowen Chen, Zaixi Shang, Jae Won Chung, David Lerner, Werner Robitza, Rakesh Rao Ramachandra Rao, Alexander Raake, Alan C. Bovik
To bridge this data gap, we introduce the LIVE-Viasat Real-World Satellite QoE Database.
no code implementations • 11 Oct 2024 • Abhijay Ghildyal, Yuanhan Chen, Saman Zadtootaghaj, Nabajeet Barman, Alan C. Bovik
The advent of AI has influenced many aspects of human life, from self-driving cars and intelligent chatbots to text-based image and video generation models capable of creating realistic images and videos based on user prompts (text-to-image, image-to-image, and image-to-video).
no code implementations • 13 Aug 2024 • Yu-Chih Chen, Avinab Saha, ALEXANDRE CHAPIRO, Christian Häne, Jean-Charles Bazin, Bo Qiu, Stefano Zanetti, Ioannis Katsavounidis, Alan C. Bovik
We study the visual quality judgments of human subjects on digital human avatars (sometimes referred to as "holograms" in the parlance of virtual reality [VR] and augmented reality [AR] systems) that have been subjected to distortions.
no code implementations • 4 Aug 2024 • Krishna Srikar Durbha, Alan C. Bovik
Here we develop a perceptually optimized method of constructing optimal per-shot bitrate and quality ladders, using an ensemble of low-level features and Visual Information Fidelity (VIF) features extracted from different scales and subbands.
no code implementations • 24 Jun 2024 • Sandeep Mishra, Oindrila Saha, Alan C. Bovik
Our method generates 3D animals that are not possible to create using previous text-to-3D generative methods.
no code implementations • 11 Jun 2024 • Sandeep Mishra, Oindrila Saha, Alan C. Bovik
A NeRF is then initialized using this 3D shape using depth-controlled SDS.
no code implementations • 20 Apr 2024 • Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Hassene Tmar, Alan C. Bovik
The deep learning revolution has strongly impacted low-level image processing tasks such as style/domain transfer, enhancement/restoration, and visual quality assessments.
no code implementations • 20 Apr 2024 • Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Hassene Tmar, Alan C. Bovik
High Dynamic Range (HDR) videos have enjoyed a surge in popularity in recent years due to their ability to represent a wider range of contrast and color than Standard Dynamic Range (SDR) videos.
no code implementations • 22 Mar 2024 • Abhinau K. Venkataramanan, Alan C. Bovik
We demonstrate the usefulness of the new subjective database by benchmarking objective models of visual quality on it.
1 code implementation • 5 Jan 2024 • Sandeep Mishra, Mukul Jha, Alan C. Bovik
We conducted a large-scale subjective study of the perceptual quality of User-Generated Mobile Video Content on a set of mobile-originated videos obtained from the Indian social media platform ShareChat.
no code implementations • 13 Dec 2023 • Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Alan C. Bovik
Recent years have seen steady growth in the popularity and availability of High Dynamic Range (HDR) content, particularly videos, streamed over the internet.
no code implementations • 12 Dec 2023 • Krishna Srikar Durbha, Hassene Tmar, Cosmin Stejerean, Ioannis Katsavounidis, Alan C. Bovik
Over the past few years, a variety of methods have been proposed to construct optimal bitrate ladders including using low-level features to predict cross-over bitrates, optimal resolutions for each bitrate, predicting visual quality, etc.
no code implementations • 27 Nov 2023 • Hakan Emre Gedik, Abhinau K. Venkataramanan, Alan C. Bovik
Deep learning techniques have revolutionized the fields of image restoration and image quality assessment in recent years.
no code implementations • 26 Nov 2023 • Abhinau K. Venkataramanan, Alan C. Bovik
Information-theoretic image quality assessment (IQA) models such as Visual Information Fidelity (VIF) and Spatio-temporal Reduced Reference Entropic Differences (ST-RRED) have enjoyed great success by seamlessly integrating natural scene statistics (NSS) with information theory.
1 code implementation • 18 Nov 2023 • Shreshth Saini, Avinab Saha, Alan C. Bovik
Our findings demonstrate that self-supervised pre-trained neural networks on SDR content can be further fine-tuned in a self-supervised setting using limited unlabeled HDR videos to achieve state-of-the-art performance on the only publicly available VQA database for HDR content, the LIVE-HDR VQA database.
no code implementations • 26 May 2023 • Avinab Saha, Yu-Chih Chen, Chase Davis, Bo Qiu, Xiaoming Wang, Rahul Gowda, Ioannis Katsavounidis, Alan C. Bovik
We present the outcomes of a recent large-scale subjective study of Mobile Cloud Gaming Video Quality Assessment (MCG-VQA) on a diverse set of gaming videos.
1 code implementation • 24 May 2023 • Yunxiang Li, Meixu Chen, Wenxuan Yang, Kai Wang, Jun Ma, Alan C. Bovik, You Zhang
Image translation has wide applications, such as style transfer and modality conversion, usually aiming to generate images having both high degrees of realism and faithfulness.
1 code implementation • 14 May 2023 • Maniratnam Mandal, Deepti Ghadiyaram, Danna Gurari, Alan C. Bovik
The photographs taken by visually impaired users often suffer from one or both of two kinds of quality issues: technical quality (distortions), and semantic quality, such as framing and aesthetic composition.
1 code implementation • 3 May 2023 • Yu-Chih Chen, Avinab Saha, Chase Davis, Bo Qiu, Xiaoming Wang, Rahul Gowda, Ioannis Katsavounidis, Alan C. Bovik
The mobile cloud gaming industry has been rapidly growing over the last decade.
no code implementations • 25 Apr 2023 • Joshua P. Ebenezer, Zaixi Shang, Yixu Chen, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik
We conducted a large-scale study of human perceptual quality judgments of High Dynamic Range (HDR) and Standard Dynamic Range (SDR) videos subjected to scaling and compression levels and viewed on three different display devices.
no code implementations • 25 Apr 2023 • Joshua P. Ebenezer, Zaixi Shang, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik
We introduce a novel feature set, which we call HDRMAX features, that when included into Video Quality Assessment (VQA) algorithms designed for Standard Dynamic Range (SDR) videos, sensitizes them to distortions of High Dynamic Range (HDR) videos that are inadequately accounted for by these algorithms.
no code implementations • 25 Apr 2023 • Joshua P. Ebenezer, Zaixi Shang, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik
We present a no-reference video quality model and algorithm that delivers standout performance for High Dynamic Range (HDR) videos, which we call HDR-ChipQA.
no code implementations • 6 Apr 2023 • Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Alan C. Bovik
The Visual Multimethod Assessment Fusion (VMAF) algorithm has recently emerged as a state-of-the-art approach to video quality prediction, that now pervades the streaming and social media industry.
2 code implementations • CVPR 2023 • Avinab Saha, Sandeep Mishra, Alan C. Bovik
To advance research in this field, we propose a Mixture of Experts approach to train two separate encoders to learn high-level content and low-level image quality features in an unsupervised setting.
Ranked #3 on No-Reference Image Quality Assessment on CSIQ
no code implementations • 20 Sep 2022 • Zaixi Shang, Joshua P. Ebenezer, Alan C. Bovik, Yongjun Wu, Hai Wei, Sriram Sethuraman
High Dynamic Range (HDR) videos can represent a much greater range of brightness and color than Standard Dynamic Range (SDR) videos and are rapidly becoming an industry standard.
1 code implementation • 29 Jun 2022 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
Perceptual video quality assessment (VQA) is an integral component of many streaming and video sharing platforms.
Ranked #1 on Video Quality Assessment on LIVE-ETRI (using extra training data)
no code implementations • 10 Jun 2022 • Somdyuti Paul, Andrey Norkin, Alan C. Bovik
Adaptive video streaming relies on the construction of efficient bitrate ladders to deliver the best possible visual quality to viewers under bandwidth constraints.
no code implementations • 21 May 2022 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We consider the problem of capturing distortions arising from changes in frame rate as part of Video Quality Assessment (VQA).
no code implementations • 26 Apr 2022 • Li-Heng Chen, Christos G. Bampis, Zhi Li, Lukáš Krasula, Alan C. Bovik
By conducting extensive experimental tests on existing deep image compression models, we show results that our new resizing parameter estimation framework can provide Bj{\o}ntegaard-Delta rate (BD-rate) improvement of about 10% against leading perceptual quality engines.
no code implementations • 31 Mar 2022 • Xiangxu Yu, Zhengzhong Tu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
In recent years, with the vigorous development of the video game industry, the proportion of gaming videos on major video websites like YouTube has dramatically increased.
no code implementations • 30 Mar 2022 • Meixu Chen, Richard Webb, Alan C. Bovik
In our learning based approach, we implement foveation by introducing a Foveation Generator Unit (FGU) that generates foveation masks which direct the allocation of bits, significantly increasing compression efficiency while making it possible to retain an impression of little to no additional visual loss given an appropriate viewing geometry.
no code implementations • 24 Mar 2022 • Xiangxu Yu, Zhenqiang Ying, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
A number of studies have been directed towards understanding the perceptual characteristics of professionally generated gaming videos arising in gaming video streaming, online gaming, and cloud gaming.
2 code implementations • 23 Feb 2022 • Abhinau K. Venkataramanan, Cosmin Stejerean, Alan C. Bovik
Fusion-based quality assessment has emerged as a powerful method for developing high-performance quality models from quality models that individually achieve lower performances.
1 code implementation • 5 Jan 2022 • Qi Zheng, Zhengzhong Tu, Pavan C. Madhusudana, Xiaoyang Zeng, Alan C. Bovik, Yibo Fan
Video quality assessment (VQA) remains an important and challenging problem that affects many applications at the widest scales.
2 code implementations • 25 Oct 2021 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We consider the problem of obtaining image quality representations in a self-supervised manner.
Ranked #2 on Video Quality Assessment on LIVE-ETRI (using extra training data)
no code implementations • 5 Oct 2021 • Somdyuti Paul, Andrey Norkin, Alan C. Bovik
Block based motion estimation is integral to inter prediction processes performed in hybrid video codecs.
no code implementations • 27 Sep 2021 • Pavan C Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
In this work we address the problem of frame rate dependent Video Quality Assessment (VQA) when the videos to be compared have different frame rate and compression factor.
Ranked #2 on Video Quality Assessment on LIVE-YT-HFR
1 code implementation • 17 Sep 2021 • Joshua P. Ebenezer, Zaixi Shang, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik
We propose a new model for no-reference video quality assessment (VQA).
Ranked #1 on Video Quality Assessment on LIVE Livestream
no code implementations • 15 Jun 2021 • Zaixi Shang, Joshua P. Ebenezer, Alan C. Bovik, Yongjun Wu, Hai Wei, Sriram Sethuraman
Video live streaming is gaining prevalence among video streaming services, especially for the delivery of popular sporting events.
no code implementations • 20 May 2021 • Li-Heng Chen, Christos G. Bampis, Zhi Li, Chao Chen, Alan C. Bovik
The layers of convolutional neural networks (CNNs) can be used to alter the resolution of their inputs, but the scaling factors are limited to integer values.
no code implementations • 31 Mar 2021 • Dae Yeol Lee, Hyunsuk Ko, Jongho Kim, Alan C. Bovik
As a stringent test of the new model, we apply it to the difficult problem of predicting the quality of videos subjected not only to compression, but also to downsampling in space and/or time.
no code implementations • 30 Jan 2021 • Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
Video and image quality assessment has long been projected as a regression problem, which requires predicting a continuous quality score given an input stimulus.
no code implementations • 29 Jan 2021 • Dae Yeol Lee, Hyunsuk Ko, Jongho Kim, Alan C. Bovik
It is well-known that natural images possess statistical regularities that can be captured by bandpass decomposition and divisive normalization processes that approximate early neural processing in the human visual system.
1 code implementation • 26 Jan 2021 • Zhengzhong Tu, Xiangxu Yu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
However, these models are either incapable or inefficient for predicting the quality of complex and diverse UGC videos in practical applications.
Ranked #4 on Video Quality Assessment on LIVE Livestream
1 code implementation • 16 Jan 2021 • Abhinau K. Venkataramanan, Chengyang Wu, Alan C. Bovik, Ioannis Katsavounidis, Zafar Shahid
The Structural Similarity (SSIM) Index is a very widely used image/video quality model that continues to play an important role in the perceptual evaluation of compression algorithms, encoding recipes and numerous other image/video processing algorithms.
no code implementations • 29 Dec 2020 • Todd Goodall, Alan C. Bovik
Towards enhancing DVP education we have created a carefully constructed gallery of educational tools that is designed to complement a comprehensive corpus of online lectures by providing examples of DVP on real-world content, along with a user-friendly interface that organizes numerous key DVP topics ranging from analog video, to human visual processing, to modern video codecs, etc.
1 code implementation • 26 Oct 2020 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We consider the problem of conducting frame rate dependent video quality assessment (VQA) on videos of diverse frame rates, including high frame rate (HFR) videos.
Ranked #1 on Video Quality Assessment on LIVE-YT-HFR
1 code implementation • 29 Sep 2020 • Meixu Chen, Todd Goodall, Anjul Patney, Alan C. Bovik
Our framework exploits the regularities inherent to video motion, which we capture by using displaced frame differences as video representations to train the neural network.
1 code implementation • 22 Sep 2020 • Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik
Banding artifacts, which manifest as staircase-like color bands on pictures or video frames, is a common distortion caused by compression of low-textured smooth regions.
1 code implementation • 22 Jul 2020 • Pavan C. Madhusudana, Xiangxu Yu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We also conducted a holistic evaluation of existing state-of-the-art Full and No-Reference video quality algorithms, and statistically benchmarked their performance on the new database.
no code implementations • 3 Jul 2020 • Li-Heng Chen, Christos G. Bampis, Zhi Li, Andrey Norkin, Alan C. Bovik
Mean squared error (MSE) and $\ell_p$ norms have largely dominated the measurement of loss in neural networks due to their simplicity and analytical properties.
no code implementations • 19 Jun 2020 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
High frame rate videos are increasingly getting popular in recent years, driven by the strong requirements of the entertainment and streaming industries to provide high quality of experiences to consumers.
Ranked #3 on Video Quality Assessment on LIVE-YT-HFR
5 code implementations • 29 May 2020 • Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
Recent years have witnessed an explosion of user-generated content (UGC) videos shared and streamed over the Internet, thanks to the evolution of affordable and reliable consumer capture devices, and the tremendous popularity of social media platforms.
Ranked #13 on Video Quality Assessment on LIVE-FB LSVQ
no code implementations • 27 Feb 2020 • Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik
Banding artifact, or false contouring, is a common video compression impairment that tends to appear on large flat regions in encoded videos.
no code implementations • 25 Feb 2020 • Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
Many objective video quality assessment (VQA) algorithms include a key step of temporal pooling of frame-level quality scores.
1 code implementation • 19 Oct 2019 • Li-Heng Chen, Christos G. Bampis, Zhi Li, Andrey Norkin, Alan C. Bovik
By building on top of an existing deep image compression model, we are able to demonstrate a bitrate reduction of as much as $31\%$ over MSE optimization, given a specified perceptual quality (VMAF) level.
1 code implementation • 15 Jun 2019 • Somdyuti Paul, Andrey Norkin, Alan C. Bovik
In VP9 video codec, the sizes of blocks are decided during encoding by recursively partitioning 64$\times$64 superblocks using rate-distortion optimization (RDO).
no code implementations • 26 Nov 2018 • Sungsoo Kim, Jin Soo Park, Christos G. Bampis, Jaeseong Lee, Mia K. Markey, Alexandros G. Dimakis, Alan C. Bovik
We propose a video compression framework using conditional Generative Adversarial Networks (GANs).
no code implementations • 5 Mar 2018 • Zeina Sinno, Alan C. Bovik
We demonstrate the value of the new resource, which we call the LIVE Video Quality Challenge Database (LIVE-VQC), by conducting a comparison of leading NR video quality predictors on it.
1 code implementation • 28 Aug 2017 • Hui Zeng, Lei Zhang, Alan C. Bovik
Recognizing this, we propose a new representation of perceptual image quality, called probabilistic quality representation (PQR), to describe the image subjective score distribution, whereby a more robust loss function can be employed to train a deep BIQA model.
1 code implementation • 2 Mar 2017 • Christos G. Bampis, Alan C. Bovik
Mobile streaming video data accounts for a large and increasing percentage of wireless network traffic.
Multimedia
1 code implementation • 15 Sep 2016 • Deepti Ghadiyaram, Alan C. Bovik
Current top-performing blind perceptual image quality prediction models are generally trained on legacy databases of human quality opinion scores on synthetically distorted images.
no code implementations • 9 Nov 2015 • Deepti Ghadiyaram, Alan C. Bovik
Towards overcoming these limitations, we designed and created a new database that we call the LIVE In the Wild Image Quality Challenge Database, which contains widely diverse authentic image distortions on a large number of images captured using a representative variety of modern mobile devices.
Blind Image Quality Assessment Small Data Image Classification
1 code implementation • IEEE Transacations on Image Processing 2014 • Michele A. Saad, Alan C. Bovik, Christophe Charrier
3) We show that the proposed NSS and motion coherency models are appropriate for quality assessment of videos, and we utilize them to design a blind VQA algorithm that correlates highly with human judgments of quality.
Ranked #4 on Video Quality Assessment on LIVE-ETRI
no code implementations • 14 Aug 2013 • Wufeng Xue, Lei Zhang, Xuanqin Mou, Alan C. Bovik
We present a new effective and efficient IQA model, called gradient magnitude similarity deviation (GMSD).
Ranked #6 on Image Quality Assessment on MSU FR VQA Database