A Completely Blind Video Integrity Oracle

被引:290
作者
Mittal, Anish [1 ,2 ]
Saad, Michele A. [1 ,2 ]
Bovik, Alan C. [1 ,2 ]
机构
[1] Intel Corp, HERE, Santa Clara, CA 95051 USA
[2] Univ Texas Austin, Dept Elect & Comp Engn, Lab Image & Video Engn, Austin, TX 78712 USA
关键词
Intrinsic video statistics; quality assessment; temporal self similarity; spatial domain; IMAGE QUALITY ASSESSMENT; NO-REFERENCE VIDEO; AREA V2; RESPONSES; NEURONS; REPRESENTATION; PREDICTION; SPACE; V1;
D O I
10.1109/TIP.2015.2502725
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considerable progress has been made toward developing still picture perceptual quality analyzers that do not require any reference picture and that are not trained on human opinion scores of distorted images. However, there do not yet exist any such completely blind video quality assessment (VQA) models. Here, we attempt to bridge this gap by developing a new VQA model called the video intrinsic integrity and distortion evaluation oracle (VIIDEO). The new model does not require the use of any additional information other than the video being quality evaluated. VIIDEO embodies models of intrinsic statistical regularities that are observed in natural vidoes, which are used to quantify disturbances introduced due to distortions. An algorithm derived from the VIIDEO model is thereby able to predict the quality of distorted videos without any external knowledge about the pristine source, anticipated distortions, or human judgments of video quality. Even with such a paucity of information, we are able to show that the VIIDEO algorithm performs much better than the legacy full reference quality measure MSE on the LIVE VQA database and delivers performance comparable with a leading human judgment trained blind VQA model. We believe that the VIIDEO algorithm is a significant step toward making real-time monitoring of completely blind video quality possible. The software release of VIIDEO can be obtained online (http://live.ece.utexas.edu/research/quality/VIIDEO_release.zip).
引用
收藏
页码:289 / 300
页数:12
相关论文
共 51 条
[1]  
[Anonymous], P INT WORKSH VID PRO
[2]  
[Anonymous], IEEE ICIP 2005 SEPT
[3]   Neurons inmonkey visual area V2 encode combinations of orientations [J].
Anzai, Akiyuki ;
Peng, Xinmiao ;
Van Essen, David C. .
NATURE NEUROSCIENCE, 2007, 10 (10) :1313-1321
[4]  
Babu R. V., 2004, P INT WORKSH PACK VI
[5]   Structure and function of visual area MT [J].
Born, RT ;
Bradley, DC .
ANNUAL REVIEW OF NEUROSCIENCE, 2005, 28 :157-189
[6]   Automatic Prediction of Perceptual Image and Video Quality [J].
Bovik, Alan Conrad .
PROCEEDINGS OF THE IEEE, 2013, 101 (09) :2008-2024
[7]   No-reference quality metric for degraded and enhanced video [J].
Caviedes, JE ;
Oberti, F .
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2003, PTS 1-3, 2003, 5150 :621-632
[8]   EXPERIMENTS IN SEGMENTING TEXTON PATTERNS USING LOCALIZED SPATIAL FILTERS [J].
CLARK, M ;
BOVIK, AC .
PATTERN RECOGNITION, 1989, 22 (06) :707-717
[9]   UNCERTAINTY RELATION FOR RESOLUTION IN SPACE, SPATIAL-FREQUENCY, AND ORIENTATION OPTIMIZED BY TWO-DIMENSIONAL VISUAL CORTICAL FILTERS [J].
DAUGMAN, JG .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (07) :1160-1169
[10]   SUBJECTIVE ASSESSMENT OF H.264/AVC VIDEO SEQUENCES TRANSMITTED OVER A NOISY CHANNEL [J].
De Simone, F. ;
Naccari, M. ;
Tagliasacchi, M. ;
Dufaux, F. ;
Tubaro, S. ;
Ebrahimi, T. .
QOMEX: 2009 INTERNATIONAL WORKSHOP ON QUALITY OF MULTIMEDIA EXPERIENCE, 2009, :204-+