Vanishing point attracts gaze in free-viewing and visual search tasks

被引:8
|
作者
Borji, Ali [1 ]
Feng, Mengyang [2 ]
Lu, Huchuan [2 ]
机构
[1] Univ Cent Florida, Dept Comp Sci, Ctr Comp Vis Res, Orlando, FL 32816 USA
[2] Dalian Univ Technol, Dept Elect Engn, Dalian, Peoples R China
来源
JOURNAL OF VISION | 2016年 / 16卷 / 14期
关键词
visual attention; eye movements; bottom-up attention; top-down attention; saliency; free viewing; visual search; vanishing point; perspective; global context; gist; scene perception; FEATURE-BASED ATTENTION; EYE-MOVEMENTS; SCENE; MODEL; WORLD; REPRESENTATION; OBJECT; MEMORY; INTEGRATION; ALLOCATION;
D O I
10.1167/16.14.18
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Several structural scene cues such as gist, layout, horizontal line, openness, and depth have been shown to guide scene perception (e.g., Oliva & Torralba, 2001); Ross & Oliva, 2009). Here, to investigate whether vanishing point (VP) plays a significant role in gaze guidance, we ran two experiments. In the first one, we recorded fixations of 10 observers (six male, four female; mean age 22; SD = 0.84) freely viewing 532 images, out of which 319 had a VP (shuffled presentation; each image for 4 s). We found that the average number of fixations at a local region (80 x 80 pixels) centered at the VP is significantly higher than the average fixations at random locations (t test; n = 319; p < 0.001). To address the confounding factor of saliency, we learned a combined model of bottom-up saliency and VP. The AUC (area under curve) score of our model (0.85; SD = 0.01) is significantly higher than the base saliency model (e.g., 0.8 using attention for information maximization (AIM) model by Bruce & Tsotsos, 2005, t test; p = 3.14e-16) and the VP-only model (0.64, t test; p, 0.001). In the second experiment, we asked 14 subjects (10 male, four female; mean age 23.07, SD = 1.26) to search for a target character (T or L) placed randomly on a 3 3 3 imaginary grid overlaid on top of an image. Subjects reported their answers by pressing one of the two keys. Stimuli consisted of 270 color images (180 with a single VP, 90 without). The target happened with equal probability inside each cell (15 times L, 15 times T). We found that subjects were significantly faster (and more accurate) when the target appeared inside the cell containing the VP compared to cells without the VP (median across 14 subjects 1.34 s vs. 1.96 s; Wilcoxon rank-sum test; p = 0.0014). These findings support the hypothesis that vanishing point, similar to face, text (Cerf, Frady, & Koch, 2009), and gaze direction (Borji, Parks, & Itti, 2014) guides attention in free-viewing and visual search tasks.
引用
收藏
页数:22
相关论文
共 32 条
  • [31] Synchronization between frontal eye field and area V4 during free-gaze visual search
    Ting Yan
    Hui-Hui Zhou
    Zoological Research, 2019, 40 (05) : 394 - 403
  • [32] Gaze behaviors during free viewing revealed differences in visual salience processing across four major psychiatric disorders: a mega-analysis study of 1012 individuals
    Miura, Kenichiro
    Yoshida, Masatoshi
    Morita, Kentaro
    Fujimoto, Michiko
    Yasuda, Yuka
    Yamamori, Hidenaga
    Takahashi, Junichi
    Miyata, Seiko
    Okazaki, Kosuke
    Matsumoto, Junya
    Toyomaki, Atsuto
    Makinodan, Manabu
    Hashimoto, Naoki
    Onitsuka, Toshiaki
    Kasai, Kiyoto
    Ozaki, Norio
    Hashimoto, Ryota
    MOLECULAR PSYCHIATRY, 2024, : 1594 - 1600