A Completely Blind Video Integrity Oracle

被引:290
作者
Mittal, Anish [1 ,2 ]
Saad, Michele A. [1 ,2 ]
Bovik, Alan C. [1 ,2 ]
机构
[1] Intel Corp, HERE, Santa Clara, CA 95051 USA
[2] Univ Texas Austin, Dept Elect & Comp Engn, Lab Image & Video Engn, Austin, TX 78712 USA
关键词
Intrinsic video statistics; quality assessment; temporal self similarity; spatial domain; IMAGE QUALITY ASSESSMENT; NO-REFERENCE VIDEO; AREA V2; RESPONSES; NEURONS; REPRESENTATION; PREDICTION; SPACE; V1;
D O I
10.1109/TIP.2015.2502725
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considerable progress has been made toward developing still picture perceptual quality analyzers that do not require any reference picture and that are not trained on human opinion scores of distorted images. However, there do not yet exist any such completely blind video quality assessment (VQA) models. Here, we attempt to bridge this gap by developing a new VQA model called the video intrinsic integrity and distortion evaluation oracle (VIIDEO). The new model does not require the use of any additional information other than the video being quality evaluated. VIIDEO embodies models of intrinsic statistical regularities that are observed in natural vidoes, which are used to quantify disturbances introduced due to distortions. An algorithm derived from the VIIDEO model is thereby able to predict the quality of distorted videos without any external knowledge about the pristine source, anticipated distortions, or human judgments of video quality. Even with such a paucity of information, we are able to show that the VIIDEO algorithm performs much better than the legacy full reference quality measure MSE on the LIVE VQA database and delivers performance comparable with a leading human judgment trained blind VQA model. We believe that the VIIDEO algorithm is a significant step toward making real-time monitoring of completely blind video quality possible. The software release of VIIDEO can be obtained online (http://live.ece.utexas.edu/research/quality/VIIDEO_release.zip).
引用
收藏
页码:289 / 300
页数:12
相关论文
共 51 条
[11]   TEMPORAL DECORRELATION - A THEORY OF LAGGED AND NONLAGGED RESPONSES IN THE LATERAL GENICULATE-NUCLEUS [J].
DONG, DW ;
ATICK, JJ .
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1995, 6 (02) :159-178
[12]   A prototype no-reference video quality system [J].
Dosselmann, Richard ;
Yang, Xue Dong .
FOURTH CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION, PROCEEDINGS, 2007, :411-+
[13]   Visual Response Properties of V1 Neurons Projecting to V2 in Macaque [J].
El-Shamayleh, Yasmine ;
Kumbhani, Romesh D. ;
Dhruv, Neel T. ;
Movshon, J. Anthony .
JOURNAL OF NEUROSCIENCE, 2013, 33 (42) :16594-16605
[14]   Metamers of the ventral stream [J].
Freeman, Jeremy ;
Simoncelli, Eero P. .
NATURE NEUROSCIENCE, 2011, 14 (09) :1195-U130
[15]   NORMALIZATION OF CELL RESPONSES IN CAT STRIATE CORTEX [J].
HEEGER, DJ .
VISUAL NEUROSCIENCE, 1992, 9 (02) :181-197
[16]  
Hegdé J, 2000, J NEUROSCI, V20, part. no.
[17]   Representation of angles embedded within contour stimuli in area V2 of macaque monkeys [J].
Ito, M ;
Komatsu, H .
JOURNAL OF NEUROSCIENCE, 2004, 24 (13) :3313-3324
[18]  
Karatzoglou A., 2004, J. Stat. Softw., V11, P1, DOI [DOI 10.18637/JSS.V011.I09, 10.18637/jss.v011.i09]
[19]  
Kawayoke Y, 2008, IEEE IMAGE PROC, P385, DOI 10.1109/ICIP.2008.4711772
[20]   NO-REFERENCE VIDEO QUALITY EVALUATION FOR HIGH-DEFINITION VIDEO [J].
Keimel, Christian ;
Oelbaum, Tobias ;
Diepold, Klaus .
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, :1145-1148