Statistical templates for visual search

被引:9
作者
Ackermann, John F. [1 ]
Landy, Michael S. [1 ,2 ]
机构
[1] NYU, Dept Psychol, New York, NY 10003 USA
[2] NYU, Ctr Neural Sci, New York, NY 10003 USA
关键词
texture; visual search; image statistics; SIGNAL-DETECTION; FUNCTIONAL ARCHITECTURE; PSYCHOPHYSICS; MODEL; ATTENTION; REPRESENTATIONS; OBSERVERS; SALIENCY; NOISE;
D O I
10.1167/14.3.18
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
How do we find a target embedded in a scene? Within the framework of signal detection theory, this task is carried out by comparing each region of the scene with a ``template,'' i.e., an internal representation of the search target. Here we ask what form this representation takes when the search target is a complex image with uncertain orientation. We examine three possible representations. The first is the matched filter. Such a representation cannot account for the ease with which humans can find a complex search target that is rotated relative to the template. A second representation attempts to deal with this by estimating the relative orientation of target and match and rotating the intensity-based template. No intensity-based template, however, can account for the ability to easily locate targets that are defined categorically and not in terms of a specific arrangement of pixels. Thus, we define a third template that represents the target in terms of image statistics rather than pixel intensities. Subjects performed a two-alternative, forced choice search task in which they had to localize an image that matched a previously viewed target. Target images were texture patches. In one condition, match images were the same image as the target and distractors were a different image of the same textured material. In the second condition, the match image was of the same texture as the target (but different pixels) and the distractor was an image of a different texture. Match and distractor stimuli were randomly rotated relative to the target. We compared human performance to pixel-based, pixel-based with rotation, and statistic-based search models. The statistic-based search model was most successful at matching human performance. We conclude that humans use summary statistics to search for complex visual targets.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 51 条
[1]   Frequency tuning of perceptual templates changes with noise magnitude [J].
Abbey, Craig K. ;
Eckstein, Miguel P. .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2009, 26 (11) :B72-B83
[2]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[3]  
[Anonymous], 1966, Textures: a photographic album for artists and designers
[4]   SOME INFORMATIONAL ASPECTS OF VISUAL PERCEPTION [J].
ATTNEAVE, F .
PSYCHOLOGICAL REVIEW, 1954, 61 (03) :183-193
[5]   A summary-statistic representation in peripheral vision explains visual crowding [J].
Balas, Benjamin ;
Nakano, Lisa ;
Rosenholtz, Ruth .
JOURNAL OF VISION, 2009, 9 (12)
[6]   Texture synthesis and perception: Using computational models to study texture representations in the human visual system [J].
Balas, BJ .
VISION RESEARCH, 2006, 46 (03) :299-309
[7]  
Barlow H.B., 1961, SENS COMMUN, V1, DOI DOI 10.7551/MITPRESS/9780262518420.003.0013
[8]   MODEL OBSERVERS FOR ASSESSMENT OF IMAGE QUALITY [J].
BARRETT, HH ;
YAO, J ;
ROLLAND, JP ;
MYERS, KJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (21) :9758-9765
[9]   ON EXISTENCE OF NEURONES IN HUMAN VISUAL SYSTEM SELECTIVELY SENSITIVE TO ORIENTATION AND SIZE OF RETINAL IMAGES [J].
BLAKEMORE, C ;
CAMPBELL, FW .
JOURNAL OF PHYSIOLOGY-LONDON, 1969, 203 (01) :237-+
[10]   INTERACTION EFFECTS IN PARAFOVEAL LETTER RECOGNITION [J].
BOUMA, H .
NATURE, 1970, 226 (5241) :177-&