Predicting visual fixations on video based on low-level visual features

被引:232
作者
Le Meur, Olivier
Le Callet, Patrick
Barba, Dominique
机构
[1] Thomson R&D, F-35511 Cessan Sevigne, France
[2] Univ Nantes, Ecole Polytech, CNRS, IRCCyN,UMR 6597, F-44306 Nantes, France
关键词
salience; visual attention; eye movements; bottom-up; top-down; ATTENTION; MODEL; INTEGRATION; SELECTION; SEARCH; COLOR;
D O I
10.1016/j.visres.2007.06.015
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
To what extent can a computational model of the bottom-up visual attention predict what an observer is looking at? What is the contribution of the low-level visual features in the attention deployment? To answer these questions, a new spatio-temporal computational model is proposed. This model incorporates several visual features; therefore, a fusion algorithm is required to combine the different saliency maps (achromatic, chromatic and temporal). To quantitatively assess the model performances, eye movements were recorded while naive observers viewed natural dynamic scenes. Four completing metrics have been used. In addition, predictions from the proposed model are compared to the predictions from a state of the art model [Itti's model (Itti, L., Koch, C., & Niebur, E. (1998). A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(11), 1254-1259)] and from three non-biologically plausible models (uniform, flicker and centered models). Regardless of the metric used, the proposed model shows significant improvement over the selected benchmarking models (except the centered model). Conclusions are drawn regarding both the influence of low-level visual features over time and the central bias in an eye tracking experiment. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2483 / 2498
页数:16
相关论文
共 39 条
  • [1] [Anonymous], 2005, ICIP
  • [2] [Anonymous], 1999, Network: Computation in Neural Systems
  • [3] High-level aspects of oculomotor control during viewing of natural-task images
    Canosa, RL
    Pelz, JB
    Mennie, NR
    Peak, J
    [J]. HUMAN VISION AND ELECTRONIC IMAGING VIII, 2003, 5007 : 240 - 251
  • [4] Carmi R., 2006, Proceedings. ETRA 2006. Symposium on Eye Tracking Research and Applications, P11, DOI 10.1145/1117309.1117313
  • [5] Cover Thomas., 1983, Elements of Information Theory
  • [6] A hierarchical neural system with attentional top-down enhancement of the spatial resolution for object recognition
    Deco, G
    Schürmann, B
    [J]. VISION RESEARCH, 2000, 40 (20) : 2845 - 2859
  • [7] Fecteau J.H., 2006, Trends in Cognitive Sciences, V10, P617
  • [8] FENCSIK DE, 2005, VIS COGN, V14, P92
  • [9] Saccade target selection during visual search
    Findlay, JM
    [J]. VISION RESEARCH, 1997, 37 (05) : 617 - 631
  • [10] Hoffman J. E., 1998, Attention, P119