Active Visual Segmentation

被引:56
作者
Mishra, Ajay K. [1 ]
Aloimonos, Yiannis [1 ]
Cheong, Loong-Fah [2 ]
Kassim, Ashraf A. [2 ]
机构
[1] Univ Maryland, Dept Comp Sci, Comp Vis Lab, College Pk, MD 20742 USA
[2] Natl Univ Singapore, Singapore 117576, Singapore
关键词
Fixation-based segmentation; object segmentation; polar space; cue integration; scale invariance; visual attention; EYE-MOVEMENTS; ATTENTION; FEATURES; SALIENCY; MODEL;
D O I
10.1109/TPAMI.2011.171
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attention is an integral part of the human visual system and has been widely studied in the visual attention literature. The human eyes fixate at important locations in the scene, and every fixation point lies inside a particular region of arbitrary shape and size, which can either be an entire object or a part of it. Using that fixation point as an identification marker on the object, we propose a method to segment the object of interest by finding the "optimal" closed contour around the fixation point in the polar space, avoiding the perennial problem of scale in the Cartesian space. The proposed segmentation process is carried out in two separate steps: First, all visual cues are combined to generate the probabilistic boundary edge map of the scene; second, in this edge map, the "optimal" closed contour around a given fixation point is found. Having two separate steps also makes it possible to establish a simple feedback between the mid-level cue (regions) and the low-level visual cues (edges). In fact, we propose a segmentation refinement process based on such a feedback process. Finally, our experiments show the promise of the proposed method as an automatic segmentation framework for a general purpose visual system.
引用
收藏
页码:639 / 653
页数:15
相关论文
共 46 条
[11]   Mean shift: A robust approach toward feature space analysis [J].
Comaniciu, D ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619
[12]   A neural model of figure-ground organization [J].
Craft, Edward ;
Schuetze, Hartmut ;
Niebur, Ernst ;
von der Heydt, Ruediger .
JOURNAL OF NEUROPHYSIOLOGY, 2007, 97 (06) :4310-4326
[13]  
Dimitrov P, 2000, PROC CVPR IEEE, P417, DOI 10.1109/CVPR.2000.855849
[14]   Features versus Context: An Approach for Precise and Detailed Detection and Delineation of Faces and Facial Features [J].
Ding, Liya ;
Martinez, Aleix M. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (11) :2022-2038
[15]   Efficient graph-based image segmentation [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) :167-181
[16]  
Ferrari V, 2006, LECT NOTES COMPUT SC, V3953, P14, DOI 10.1007/11744078_2
[17]  
Gur M, 1997, J NEUROSCI, V17, P2914
[18]   Eye movements and picture processing during recognition [J].
Henderson, JM ;
Williams, CC ;
Castelhano, MS ;
Falk, RJ .
PERCEPTION & PSYCHOPHYSICS, 2003, 65 (05) :725-734
[19]  
Henderson JM., 1998, Eye Movements during Scene Viewing: An Overview in Eye Guidance in Reading and Scene Perception
[20]   Change detection in the flicker paradigm: The role of fixation position within the scene [J].
Hollingworth, A ;
Schrock, G ;
Henderson, JM .
MEMORY & COGNITION, 2001, 29 (02) :296-304