Feature combination strategies for saliency-based visual attention systems

被引:298
作者
Itti, L [1 ]
Koch, C [1 ]
机构
[1] CALTECH, Computat & Neural Syst Program, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
D O I
10.1117/1.1333677
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Bottom-up or saliency-based visual attention allows primates to defect nonspecific conspicuous targets in cluttered scenes. A classical metaphor, derived from electrophysiological and psychophysical studies, describes attention as a rapidly shiftable "spotlight". We use a model that reproduces the attentional scan paths of this spotlight. Simple multi-scale "feature maps" detect local spatial discontinuities in intensity, color, and orientation, and are combined into a unique "master" or "saliency" map. The saliency map is sequentially scanned, in order of decreasing saliency, by the focus of attention. We here study the problem of combining feature maps, from different visual modalities (such as color and orientation), into a unique saliency map. Four combination strategies are compared using three databases of natural color images: (1) Simple normalized summation, (2) linear combination with learned weights, (3) global nonlinear normalization followed by summation, and (4) local nonlinear competition between salient locations followed by summation. Performance was measured as the number of false detections before the most salient target was found. Strategy (1) always yielded poorest performance and (2) best performance, with a threefold to eightfold improvement in time to find a salient target. However, (2) yielded specialized systems with poor generalization. Interestingly, strategy (4) and its simplified, computationally efficient approximation (3) yielded significantly better performance than (1), with up to fourfold improvement, while preserving generality. (C) 2001 SPIE and IS&T.
引用
收藏
页码:161 / 169
页数:9
相关论文
共 26 条
[1]  
BALNJA S, 1997, ROBOT AUTON SYST, V22, P329
[2]   A model for inhibitory lateral interaction effects in perceived contrast [J].
Cannon, MW ;
Fullenkamp, SC .
VISION RESEARCH, 1996, 36 (08) :1115-1125
[3]  
GILBERT CD, 1989, J NEUROSCI, V9, P2432
[4]   Spatial integration and cortical dynamics [J].
Gilbert, CD ;
Das, A ;
Ito, M ;
Kapadia, M ;
Westheimer, G .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (02) :615-622
[5]   The representation of visual salience in monkey parietal cortex [J].
Gottlieb, JP ;
Kusunoki, M ;
Goldberg, ME .
NATURE, 1998, 391 (6666) :481-484
[6]   NORMALIZATION OF CELL RESPONSES IN CAT STRIATE CORTEX [J].
HEEGER, DJ .
VISUAL NEUROSCIENCE, 1992, 9 (02) :181-197
[7]  
Horiuchi TK, 1997, ADV NEUR IN, V9, P706
[8]   A model of saliency-based visual attention for rapid scene analysis [J].
Itti, L ;
Koch, C ;
Niebur, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259
[9]   A saliency-based search mechanism for overt and covert shifts of visual attention [J].
Itti, L ;
Koch, C .
VISION RESEARCH, 2000, 40 (10-12) :1489-1506
[10]  
ITTI L, UNPUB COOPERATIVE VI