Computing 3D saliency from a 2D image

被引:0
作者
Ramenahalli, Sudarshan [1 ]
Niebur, Ernst [1 ]
机构
[1] Johns Hopkins Univ, Zanvyl Krieger Mind Brain Inst, Baltimore, MD 21218 USA
来源
2013 47TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS) | 2013年
关键词
VISUAL-ATTENTION; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A saliency map 111 is a model of visual selective attention using purely bottom-up features of an image like color, intensity and orientation. Another bottom-up feature of visual input is depth, the distance between eye (or sensor) and objects in the visual field. In this report we study the effect of depth on saliency. Different from previous work, we do not use measured depth (disparity) information but, instead, compute a 3D depth map from the 2D image using a depth learning algorithm. This computed depth is then added as an additional feature channel to the 2D saliency map, and all feature channels are linearly combined with equal weights to obtain a 3-dimensional saliency map. We compare the efficacy of saliency maps (2D and 3D) in predicting human eye fixations using three different performance measures. The 3D saliency map outperforms its 2D counterpart in predicting human eye fixations on all measures. Perhaps surprisingly, our 3D saliency map computed from a 2D image performs better than an existing 3D saliency model that uses explicit depth information.
引用
收藏
页数:5
相关论文
共 21 条
[1]  
[Anonymous], 2006, Advances in Neural Information Processing Systems
[2]   Attention switching in depth using random-dot autostereograms: Attention gradient asymmetries [J].
Arnott, SR ;
Shedden, JM .
PERCEPTION & PSYCHOPHYSICS, 2000, 62 (07) :1459-1473
[3]   Spatial cuing in a stereoscopic display: Evidence for a "depth-aware" attentional focus [J].
Atchley, P ;
Kramer, AF ;
Andersen, GJ ;
Theeuwes, J .
PSYCHONOMIC BULLETIN & REVIEW, 1997, 4 (04) :524-529
[4]  
Aziz M. Z., 2010, APPL IM PATT REC WOR, P1
[5]   Quantitative Analysis of Human-Model Agreement in Visual Saliency Modeling: A Comparative Study [J].
Borji, Ali ;
Sihite, Dicky N. ;
Itti, Laurent .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (01) :55-69
[6]   A bimodal laser-based attention system [J].
Frintrop, S ;
Rome, E ;
Nüchter, A ;
Surmann, H .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2005, 100 (1-2) :124-151
[7]  
Frintrop S., 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566), P2167
[8]  
Gautier Josselin, 2012, COGN COMPUT, V4, P1
[9]   Recovering surface layout from an image [J].
Hoiem, Derek ;
Efros, Alexei A. ;
Hebert, Martial .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 75 (01) :151-172
[10]   A model of saliency-based visual attention for rapid scene analysis [J].
Itti, L ;
Koch, C ;
Niebur, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259