ChipQA: No-Reference Video Quality Prediction via Space-Time Chips

17 Sep 2021  ·  Joshua P. Ebenezer, Zaixi Shang, Yongjun Wu, Hai Wei, Sriram Sethuraman, Alan C. Bovik ·

We propose a new model for no-reference video quality assessment (VQA). Our approach uses a new idea of highly-localized space-time (ST) slices called Space-Time Chips (ST Chips). ST Chips are localized cuts of video data along directions that \textit{implicitly} capture motion. We use perceptually-motivated bandpass and normalization models to first process the video data, and then select oriented ST Chips based on how closely they fit parametric models of natural video statistics. We show that the parameters that describe these statistics can be used to reliably predict the quality of videos, without the need for a reference video. The proposed method implicitly models ST video naturalness, and deviations from naturalness. We train and test our model on several large VQA databases, and show that our model achieves state-of-the-art performance at reduced cost, without requiring motion computation.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Video Quality Assessment KoNViD-1k ChipQA PLCC 0.7625 # 18
Video Quality Assessment LIVE-ETRI ChipQA SRCC 0.6323 # 3
Video Quality Assessment LIVE Livestream ChipQA SRCC 0.7575 # 1
Video Quality Assessment LIVE-VQC ChipQA PLCC 0.7299 # 17
Video Quality Assessment YouTube-UGC ChipQA PLCC 0.6911 # 14

Methods