Leveraging visual attention and neural activity for stereoscopic 3D visual comfort assessment

被引:0
作者
Qiuping Jiang
Feng Shao
Gangyi Jiang
Mei Yu
Zongju Peng
机构
[1] Ningbo University,Faculty of Information Science and Engineering
来源
Multimedia Tools and Applications | 2017年 / 76卷
关键词
Quality of experience (QoE); Stereoscopic three-dimensional (S3D); Visual comfort assessment (VCA); Visual attention; Neural activity; Random forest (RF);
D O I
暂无
中图分类号
学科分类号
摘要
Visual comfort assessment (VCA) for stereoscopic three-dimensional (S3D) images is a challenging problem in the community of 3D quality of experience (3D-QoE). The goal of VCA is to automatically predict the degree of perceived visual discomfort in line with subjective judgment. The challenges of VCA typically lie in the following two aspects: 1) formulating effective visual comfort-aware features, and 2) finding an appropriate way to pool them into an overall visual comfort score. In this paper, a novel two-stage framework is proposed to address these problems. In the first stage, primary predictive feature (PPF) and advanced predictive feature (APF) are separately extracted and then integrated to reflect the perceived visual discomfort for 3D viewing. Specifically, we compute the S3D visual attention-weighted disparity statistics and neural activities of the middle temporal (MT) area in human brain to construct the PPF and APF, respectively. Followed by the first stage, the integrated visual comfort-aware features are fused with a single visual comfort score by using random forest (RF) regression, mapping from a high-dimensional feature space into a low-dimensional quality (visual comfort) space. Comparison results with five state-of-the-art relevant models on a standard benchmark database confirm the superior performance of our proposed method.
引用
收藏
页码:9405 / 9425
页数:20
相关论文
共 137 条
[1]  
Achanta R(2012)SLIC superpixels compared to state-of-the-art superpixel methods IEEE Trans Pattern Anal Mach Intell 34 2274-2282
[2]  
Shaji A(2013)State-of-the-art in visual attention modeling IEEE Trans Pattern Anal Mach Intell 35 185-207
[3]  
Smith K(2001)Random forests Mach Learn 45 5-32
[4]  
Lucchi A(2010)Visual fatigue modeling and analysis for stereoscopic video Opt Eng 51 017206-283
[5]  
Fua P(1997)Responses of primary visual cortical neurons to binocular disparity without depth perception Nature 389 280-680
[6]  
üsstrunk S(1998)Cortical area MT and the perception of stereoscopic depth Nature 394 677-1415
[7]  
Borji A(1999)Organization of disparity-selective neurons in macaque area MT J Neurosci 19 1398-1111
[8]  
Itti L(2003)Coding of horizontal disparity and velocity by MT neurons in the alert macaque J Neurophysiol 89 1094-2636
[9]  
Breiman L(2014)Saliency detection for stereoscopic images IEEE Trans Image Process 23 2625-2098
[10]  
Choi J(2014)3-D object retrieval with Hausdorff distance learning IEEE Trans Ind Electron 61 2088-4303