Visual search for arbitrary objects in real scenes

被引:142
作者
Wolfe, Jeremy M. [1 ,2 ,3 ]
Alvarez, George A. [4 ]
Rosenholtz, Ruth [5 ]
Kuzmova, Yoana I. [3 ]
Sherman, Ashley M. [3 ]
机构
[1] Harvard Univ, Sch Med, Dept Ophthalmol, Boston, MA USA
[2] Harvard Univ, Sch Med, Dept Radiol, Boston, MA 02115 USA
[3] Brigham & Womens Hosp, Visual Attent Lab, Cambridge, MA USA
[4] Harvard Univ, Dept Psychol, Cambridge, MA 02138 USA
[5] MIT, Dept Brain & Cognit Sci, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
Search; Scene perception; Visual search; EYE-MOVEMENT GUIDANCE; HIGH-LEVEL POP; WORLD SCENES; TARGET TEMPLATE; NATURAL SCENES; TIME-COURSE; MEMORY; CONTEXT; ATTENTION; PARALLEL;
D O I
10.3758/s13414-011-0153-3
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
How efficient is visual search in real scenes? In searches for targets among arrays of randomly placed distractors, efficiency is often indexed by the slope of the reaction time (RT) x Set Size function. However, it may be impossible to define set size for real scenes. As an approximation, we hand-labeled 100 indoor scenes and used the number of labeled regions as a surrogate for set size. In Experiment 1, observers searched for named objects (a chair, bowl, etc.). With set size defined as the number of labeled regions, search was very efficient (similar to 5 ms/item). When we controlled for a possible guessing strategy in Experiment 2, slopes increased somewhat (similar to 15 ms/item), but they were much shallower than search for a random object among other distinctive objects outside of a scene setting (Exp. 3: similar to 40 ms/item). In Experiments 4-6, observers searched repeatedly through the same scene for different objects. Increased familiarity with scenes had modest effects on RTs, while repetition of target items had large effects (> 500 ms). We propose that visual search in scenes is efficient because scene-specific forms of attentional guidance can eliminate most regions from the "functional set size" of items that could possibly be the target.
引用
收藏
页码:1650 / 1671
页数:22
相关论文
共 100 条
[1]   A summary-statistic representation in peripheral vision explains visual crowding [J].
Balas, Benjamin ;
Nakano, Lisa ;
Rosenholtz, Ruth .
JOURNAL OF VISION, 2009, 9 (12)
[2]   Visual objects in context [J].
Bar, M .
NATURE REVIEWS NEUROSCIENCE, 2004, 5 (08) :617-629
[3]   Visual search for colour targets that are or are not linearly separable from distractors [J].
Bauer, B ;
Jolicoeur, P ;
Cowan, WB .
VISION RESEARCH, 1996, 36 (10) :1439-1466
[4]   OBJECT SEARCH IN NONSCENE DISPLAYS [J].
BIEDERMAN, I ;
BLICKLE, TW ;
TEITELBAUM, RC ;
KLATSKY, GJ .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1988, 14 (03) :456-467
[5]   SCENE PERCEPTION - DETECTING AND JUDGING OBJECTS UNDERGOING RELATIONAL VIOLATIONS [J].
BIEDERMAN, I ;
MEZZANOTTE, RJ ;
RABINOWITZ, JC .
COGNITIVE PSYCHOLOGY, 1982, 14 (02) :143-177
[6]   SEARCHING FOR OBJECTS IN REAL-WORLD SCIENCES [J].
BIEDERMAN, I ;
GLASS, AL ;
STACY, EW .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1973, 97 (01) :22-27
[7]   Visual long-term memory has a massive storage capacity for object details [J].
Brady, Timothy F. ;
Konkle, Talia ;
Alvarez, George A. ;
Oliva, Aude .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (38) :14325-14329
[8]   The depth of distractor processing in search with clutter [J].
Bravo, Mary J. ;
Farid, Hany .
PERCEPTION, 2007, 36 (06) :821-829
[9]   The specificity of the search template [J].
Bravo, Mary J. ;
Farid, Hany .
JOURNAL OF VISION, 2009, 9 (01)
[10]   Search for a category target in clutter [J].
Bravo, MJ ;
Farid, H .
PERCEPTION, 2004, 33 (06) :643-652