3D Visual Activity Assessment Based on Natural Scene Statistics

被引:41
作者
Lee, Kwanghyun [1 ]
Moorthy, Anush Krishna [2 ]
Lee, Sanghoon [1 ]
Bovik, Alan Conrad [2 ]
机构
[1] Yonsei Univ, Ctr Informat Technol, Seoul 120749, South Korea
[2] Univ Texas Austin, Dept Elect & Comp Engn, Lab Image & Video Engn, Austin, TX 78712 USA
基金
新加坡国家研究基金会; 美国国家科学基金会;
关键词
3D visual activity (3DVA); visual natural scene statistic (visual NSS); human visual system (HVS); 3D coordinate transform; stereoscopic video; HORIZONTAL DISPARITY; IMAGE STATISTICS; QUALITY; VIDEO; NORMALIZATION; ENHANCEMENT; COMPRESSION; SENSITIVITY; ADAPTATION; DISCOMFORT;
D O I
10.1109/TIP.2013.2290592
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most challenging ongoing issues in the field of 3D visual research is how to perceptually quantify object and surface visualizations that are displayed within a virtual 3D space between a human eye and 3D display. To seek an effective method of quantification, it is necessary to measure various elements related to the perception of 3D objects at different depths. We propose a new framework for quantifying 3D visual information that we call 3D visual activity (3DVA), which utilizes natural scene statistics measured over 3D visual coordinates. We account for important aspects of 3D perception by carrying out a 3D coordinate transform reflecting the nonuniform sampling resolution of the eye and the process of stereoscopic fusion. The 3DVA utilizes the empirical distortions of wavelet coefficients to a parametric generalized Gaussian probability distribution model and a set of 3D perceptual weights. We conducted a series of simulations that demonstrate the effectiveness of the 3DVA for quantifying the statistical dynamics of visual 3D space with respect to disparity, motion, texture, and color. A successful example application is also provided, whereby 3DVA is applied to the problem of predicting visual fatigue experienced when viewing 3D displays.
引用
收藏
页码:450 / 465
页数:16
相关论文
共 72 条
[1]  
[Anonymous], 2008, IEEE STANDARDS ASS S
[2]  
[Anonymous], 2012, Recommendation BT.500-13
[3]  
[Anonymous], 2012, MIDDLEBURY STEREO
[4]  
[Anonymous], 2005, IM SAF RED INC UND B
[5]   PERIPHERAL SPATIAL VISION - LIMITS IMPOSED BY OPTICS, PHOTORECEPTORS, AND RECEPTOR POOLING [J].
BANKS, MS ;
SEKULER, AB ;
ANDERSON, SJ .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1991, 8 (11) :1775-1787
[6]   STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT [J].
BLAND, JM ;
ALTMAN, DG .
LANCET, 1986, 1 (8476) :307-310
[7]  
Boev A., 2009, D53 MOBILE3DTV
[8]  
Bose T., 2004, DIGITAL SIGNAL IMAGE
[9]   MULTICHANNEL TEXTURE ANALYSIS USING LOCALIZED SPATIAL FILTERS [J].
BOVIK, AC ;
CLARK, M ;
GEISLER, WS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (01) :55-73
[10]   Image compression via joint statistical characterization in the wavelet domain [J].
Buccigrossi, RW ;
Simoncelli, EP .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (12) :1688-1701