Stereoscopic Video Quality Prediction Based on End-to-End Dual Stream Deep Neural Networks

被引:13
作者
Zhou, Wei [1 ]
Chen, Zhibo [1 ]
Li, Weiping [1 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Anhui, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III | 2018年 / 11166卷
关键词
Convolutional neural network; Stereoscopic video; No-reference video quality assessment; Spatiotemporal pooling;
D O I
10.1007/978-3-030-00764-5_44
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a no-reference stereoscopic video quality assessment (NR-SVQA) method based on an end-to-end dual stream deep neural network (DNN), which incorporates left and right view sub-networks. The end-to-end dual stream network takes image patch pairs from left and right view pivotal frames as inputs and evaluates the perceptual quality of each image patch pair. By combining multiple convolution, max-pooling and fully-connected layers with regression in the framework, distortion related features are learned end-to-end and purely data driven. Then, a spatiotemporal pooling strategy is employed on these image patch pairs to estimate the entire stereoscopic video quality. The proposed network architecture, which we name End-to-end Dual stream deep Neural network (EDN), is trained and tested on the well-known stereoscopic video dataset divided by reference videos. Experimental results demonstrate that our proposed method outperforms state-of-the-art algorithms.
引用
收藏
页码:482 / 492
页数:11
相关论文
共 27 条
[1]  
[Anonymous], 2010, P ADV NEUR INF PROC
[2]  
[Anonymous], 2016, PATTERN RECOGN, DOI DOI 10.1016/j.patcog.2016.01.034
[3]  
[Anonymous], PROC CVPR IEEE
[4]  
[Anonymous], SUBJECTIVE VIDEO QUA
[5]  
Bosse S, 2016, IEEE IMAGE PROC, P3773, DOI 10.1109/ICIP.2016.7533065
[6]   Blind Stereoscopic Video Quality Assessment: From Depth Perception to Overall Experience [J].
Chen, Zhibo ;
Zhou, Wei ;
Li, Weiping .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (02) :721-734
[7]  
Han J., 2012, VISUAL COMMUN-US, P1
[8]   A fast learning algorithm for deep belief nets [J].
Hinton, Geoffrey E. ;
Osindero, Simon ;
Teh, Yee-Whye .
NEURAL COMPUTATION, 2006, 18 (07) :1527-1554
[9]   Blind Image Quality Assessment via Deep Learning [J].
Hou, Weilong ;
Gao, Xinbo ;
Tao, Dacheng ;
Li, Xuelong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (06) :1275-1286
[10]   No reference stereo video quality assessment based on motion feature in tensor decomposition domain [J].
Jiang, Gangyi ;
Liu, Shanshan ;
Yu, Mei ;
Shao, Feng ;
Peng, Zongju ;
Chen, Fen .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 50 :247-262