Active Visual Segmentation

被引:56
作者
Mishra, Ajay K. [1 ]
Aloimonos, Yiannis [1 ]
Cheong, Loong-Fah [2 ]
Kassim, Ashraf A. [2 ]
机构
[1] Univ Maryland, Dept Comp Sci, Comp Vis Lab, College Pk, MD 20742 USA
[2] Natl Univ Singapore, Singapore 117576, Singapore
关键词
Fixation-based segmentation; object segmentation; polar space; cue integration; scale invariance; visual attention; EYE-MOVEMENTS; ATTENTION; FEATURES; SALIENCY; MODEL;
D O I
10.1109/TPAMI.2011.171
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attention is an integral part of the human visual system and has been widely studied in the visual attention literature. The human eyes fixate at important locations in the scene, and every fixation point lies inside a particular region of arbitrary shape and size, which can either be an entire object or a part of it. Using that fixation point as an identification marker on the object, we propose a method to segment the object of interest by finding the "optimal" closed contour around the fixation point in the polar space, avoiding the perennial problem of scale in the Cartesian space. The proposed segmentation process is carried out in two separate steps: First, all visual cues are combined to generate the probabilistic boundary edge map of the scene; second, in this edge map, the "optimal" closed contour around a given fixation point is found. Having two separate steps also makes it possible to establish a simple feedback between the mid-level cue (regions) and the low-level visual cues (edges). In fact, we propose a segmentation refinement process based on such a feedback process. Finally, our experiments show the promise of the proposed method as an automatic segmentation framework for a general purpose visual system.
引用
收藏
页码:639 / 653
页数:15
相关论文
共 46 条
[1]  
[Anonymous], 2009, P IEEE C COMP VIS PA
[2]  
[Anonymous], P IEEE C COMP VIS PA
[3]  
Arbelaez P., 2008, P IEEE C COMP VIS PA, P454
[4]  
Bagon S, 2008, LECT NOTES COMPUT SC, V5305, P30, DOI 10.1007/978-3-540-88693-8_3
[5]  
Blake A, 2004, LECT NOTES COMPUT SC, V3021, P428
[6]   An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision [J].
Boykov, Y ;
Kolmogorov, V .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) :1124-1137
[7]  
Boykov YY, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P105, DOI 10.1109/ICCV.2001.937505
[8]   High accuracy optical flow estimation based on a theory for warping [J].
Brox, T ;
Bruhn, A ;
Papenberg, N ;
Weickert, J .
COMPUTER VISION - ECCV 2004, PT 4, 2004, 2034 :25-36
[9]   Saliency, attention, and visual search: An information theoretic approach [J].
Bruce, Neil D. B. ;
Tsotsos, John K. .
JOURNAL OF VISION, 2009, 9 (03)
[10]  
Cerf M., 2008, P NEUR INF PROC SYST