Statistical templates for visual search

被引:9
作者
Ackermann, John F. [1 ]
Landy, Michael S. [1 ,2 ]
机构
[1] NYU, Dept Psychol, New York, NY 10003 USA
[2] NYU, Ctr Neural Sci, New York, NY 10003 USA
关键词
texture; visual search; image statistics; SIGNAL-DETECTION; FUNCTIONAL ARCHITECTURE; PSYCHOPHYSICS; MODEL; ATTENTION; REPRESENTATIONS; OBSERVERS; SALIENCY; NOISE;
D O I
10.1167/14.3.18
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
How do we find a target embedded in a scene? Within the framework of signal detection theory, this task is carried out by comparing each region of the scene with a ``template,'' i.e., an internal representation of the search target. Here we ask what form this representation takes when the search target is a complex image with uncertain orientation. We examine three possible representations. The first is the matched filter. Such a representation cannot account for the ease with which humans can find a complex search target that is rotated relative to the template. A second representation attempts to deal with this by estimating the relative orientation of target and match and rotating the intensity-based template. No intensity-based template, however, can account for the ability to easily locate targets that are defined categorically and not in terms of a specific arrangement of pixels. Thus, we define a third template that represents the target in terms of image statistics rather than pixel intensities. Subjects performed a two-alternative, forced choice search task in which they had to localize an image that matched a previously viewed target. Target images were texture patches. In one condition, match images were the same image as the target and distractors were a different image of the same textured material. In the second condition, the match image was of the same texture as the target (but different pixels) and the distractor was an image of a different texture. Match and distractor stimuli were randomly rotated relative to the target. We compared human performance to pixel-based, pixel-based with rotation, and statistic-based search models. The statistic-based search model was most successful at matching human performance. We conclude that humans use summary statistics to search for complex visual targets.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 51 条
[21]  
Eckstein M., 2009, HDB MED IMAGING, V1, P593
[22]   CONTRAST DETECTION AND NEAR-THRESHOLD DISCRIMINATION IN HUMAN-VISION [J].
FOLEY, JM ;
LEGGE, GE .
VISION RESEARCH, 1981, 21 (07) :1041-1053
[23]   Metamers of the ventral stream [J].
Freeman, Jeremy ;
Simoncelli, Eero P. .
NATURE NEUROSCIENCE, 2011, 14 (09) :1195-U130
[24]   THE DESIGN AND USE OF STEERABLE FILTERS [J].
FREEMAN, WT ;
ADELSON, EH .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1991, 13 (09) :891-906
[25]   Contributions of ideal observer theory to vision research [J].
Geisler, Wilson S. .
VISION RESEARCH, 2011, 51 (07) :771-781
[26]  
Graham N. V. S., 1989, Visual Pattern Analyzers
[27]  
Green DM., 1966, Signal Detection Theory and Psychophysics, DOI DOI 10.1086/405615
[28]   NORMALIZATION OF CELL RESPONSES IN CAT STRIATE CORTEX [J].
HEEGER, DJ .
VISUAL NEUROSCIENCE, 1992, 9 (02) :181-197
[29]   RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE OF MONKEY STRIATE CORTEX [J].
HUBEL, DH ;
WIESEL, TN .
JOURNAL OF PHYSIOLOGY-LONDON, 1968, 195 (01) :215-&
[30]  
Hyvarinen A., 2009, NATURAL IMAGE STAT P, V1st