Visual search for arbitrary objects in real scenes

被引:142
作者
Wolfe, Jeremy M. [1 ,2 ,3 ]
Alvarez, George A. [4 ]
Rosenholtz, Ruth [5 ]
Kuzmova, Yoana I. [3 ]
Sherman, Ashley M. [3 ]
机构
[1] Harvard Univ, Sch Med, Dept Ophthalmol, Boston, MA USA
[2] Harvard Univ, Sch Med, Dept Radiol, Boston, MA 02115 USA
[3] Brigham & Womens Hosp, Visual Attent Lab, Cambridge, MA USA
[4] Harvard Univ, Dept Psychol, Cambridge, MA 02138 USA
[5] MIT, Dept Brain & Cognit Sci, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
Search; Scene perception; Visual search; EYE-MOVEMENT GUIDANCE; HIGH-LEVEL POP; WORLD SCENES; TARGET TEMPLATE; NATURAL SCENES; TIME-COURSE; MEMORY; CONTEXT; ATTENTION; PARALLEL;
D O I
10.3758/s13414-011-0153-3
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
How efficient is visual search in real scenes? In searches for targets among arrays of randomly placed distractors, efficiency is often indexed by the slope of the reaction time (RT) x Set Size function. However, it may be impossible to define set size for real scenes. As an approximation, we hand-labeled 100 indoor scenes and used the number of labeled regions as a surrogate for set size. In Experiment 1, observers searched for named objects (a chair, bowl, etc.). With set size defined as the number of labeled regions, search was very efficient (similar to 5 ms/item). When we controlled for a possible guessing strategy in Experiment 2, slopes increased somewhat (similar to 15 ms/item), but they were much shallower than search for a random object among other distinctive objects outside of a scene setting (Exp. 3: similar to 40 ms/item). In Experiments 4-6, observers searched repeatedly through the same scene for different objects. Increased familiarity with scenes had modest effects on RTs, while repetition of target items had large effects (> 500 ms). We propose that visual search in scenes is efficient because scene-specific forms of attentional guidance can eliminate most regions from the "functional set size" of items that could possibly be the target.
引用
收藏
页码:1650 / 1671
页数:22
相关论文
共 100 条
[11]   Using real-world scenes as contextual cues for search [J].
Brockmole, JR ;
Henderson, JM .
VISUAL COGNITION, 2006, 13 (01) :99-108
[12]   THE ECCENTRICITY EFFECT - TARGET ECCENTRICITY AFFECTS PERFORMANCE ON CONJUNCTION SEARCHES [J].
CARRASCO, M ;
EVERT, DL ;
CHANG, I ;
KATZ, SM .
PERCEPTION & PSYCHOPHYSICS, 1995, 57 (08) :1241-1261
[13]   Initial scene representations facilitate eye movement guidance in visual search [J].
Castelhano, Monica S. ;
Henderson, John M. .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2007, 33 (04) :753-763
[14]   The relative contribution of scene context and target features to visual search in scenes [J].
Castelhano, Monica S. ;
Heaven, Chelsea .
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2010, 72 (05) :1283-1297
[15]   Scene consistency in object and background perception [J].
Davenport, JL ;
Potter, MC .
PSYCHOLOGICAL SCIENCE, 2004, 15 (08) :559-564
[16]   Untangling invariant object recognition [J].
DiCarlo, James J. ;
Cox, David D. .
TRENDS IN COGNITIVE SCIENCES, 2007, 11 (08) :333-341
[17]   Searching for a perceived stare in the crowd [J].
Doi, Hirokazu ;
Ueda, Kazuhiro .
PERCEPTION, 2007, 36 (05) :773-780
[18]  
DROLL J, 2008, J VISION, V8, pA320, DOI DOI 10.1167/8.6.320
[19]   VISUAL-SEARCH AND STIMULUS SIMILARITY [J].
DUNCAN, J ;
HUMPHREYS, GW .
PSYCHOLOGICAL REVIEW, 1989, 96 (03) :433-458
[20]   Attentional cues in real scenes, saccadic targeting, and Bayesian priors [J].
Eckstein, Miguel P. ;
Drescher, Barbara A. ;
Shimozaki, Steven S. .
PSYCHOLOGICAL SCIENCE, 2006, 17 (11) :973-980