Spatiotemporal Statistics for Video Quality Assessment

被引:148
作者
Li, Xuelong [1 ]
Guo, Qun [1 ]
Lu, Xiaoqiang [1 ]
机构
[1] Chinese Acad Sci, Xian Inst Opt & Precis Mech, State Key Lab Transient Opt & Photon, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710119, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Video quality assessment; no-reference; 3D-DCT; natural video; spatiotemporal statistics; NATURAL SCENE STATISTICS; DCT DOMAIN; IMAGE; PREDICTION; VISIBILITY; MECHANISMS; SHAPE;
D O I
10.1109/TIP.2016.2568752
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is an important task to design models for universal no-reference video quality assessment (NR-VQA) in multiple video processing and computer vision applications. However, most existing NR-VQA metrics are designed for specific distortion types, which are not often aware in practical applications. A further deficiency is that the spatial and temporal information of videos is hardly considered simultaneously. In this paper, we propose a new NR-VQA metric based on the spatiotemporal natural video statistics in 3D discrete cosine transform (3D-DCT) domain. In the proposed method, a set of features are first extracted based on the statistical analysis of 3D-DCT coefficients to characterize the spatiotemporal statistics of videos in different views. These features are used to predict the perceived video quality via the efficient linear support vector regression model afterward. The contributions of this paper are: 1) we explore the spatiotemporal statistics of videos in the 3D-DCT domain that has the inherent spatiotemporal encoding advantage over other widely used 2D transformations; 2) we extract a small set of simple but effective statistical features for video visual quality prediction; and 3) the proposed method is universal for multiple types of distortions and robust to different databases. The proposed method is tested on four widely used video databases. Extensive experimental results demonstrate that the proposed method is competitive with the state-of-art NR-VQA metrics and the top-performing full-reference VQA and reduced-reference VQA metrics.
引用
收藏
页码:3329 / 3342
页数:14
相关论文
共 51 条
[1]   Fast algorithm for the 3-D DCT-II [J].
Boussakta, S ;
Alshibami, HO .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (04) :992-1001
[2]   Automatic Prediction of Perceptual Image and Video Quality [J].
Bovik, Alan Conrad .
PROCEEDINGS OF THE IEEE, 2013, 101 (09) :2008-2024
[3]   Motion analysis in 3D DCT domain and its application to video coding [J].
Bozinovic, N ;
Konrad, J .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2005, 20 (06) :510-528
[4]   No-reference image quality assessment based on DCT domain statistics [J].
Brandao, Tomas ;
Queluz, Maria Paula .
SIGNAL PROCESSING, 2008, 88 (04) :822-833
[5]   No-Reference Quality Assessment of H.264/AVC Encoded Video [J].
Brandao, Tomas ;
Queluz, Maria Paula .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (11) :1437-1447
[6]   SPATIOTEMPORAL CHARACTERISTICS OF VISUAL MECHANISMS - EXCITATORY-INHIBITORY MODEL [J].
BURBECK, CA ;
KELLY, DH .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA, 1980, 70 (09) :1121-1126
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]   SUBJECTIVE ASSESSMENT OF H.264/AVC VIDEO SEQUENCES TRANSMITTED OVER A NOISY CHANNEL [J].
De Simone, F. ;
Naccari, M. ;
Tagliasacchi, M. ;
Dufaux, F. ;
Tubaro, S. ;
Ebrahimi, T. .
QOMEX: 2009 INTERNATIONAL WORKSHOP ON QUALITY OF MULTIMEDIA EXPERIENCE, 2009, :204-+
[9]   CONCURRENT PROCESSING STREAMS IN MONKEY VISUAL-CORTEX [J].
DEYOE, EA ;
VANESSEN, DC .
TRENDS IN NEUROSCIENCES, 1988, 11 (05) :219-226
[10]   STATISTICS OF NATURAL TIME-VARYING IMAGES [J].
DONG, DW ;
ATICK, JJ .
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1995, 6 (03) :345-358