Predicting visual fixations on video based on low-level visual features

被引：235

作者：

Le Meur, Olivier

Le Callet, Patrick

Barba, Dominique

机构：

[1] Thomson R&D, F-35511 Cessan Sevigne, France

[2] Univ Nantes, Ecole Polytech, CNRS, IRCCyN,UMR 6597, F-44306 Nantes, France

来源：

VISION RESEARCH | 2007年 / 47卷 / 19期

关键词：

salience; visual attention; eye movements; bottom-up; top-down; ATTENTION; MODEL; INTEGRATION; SELECTION; SEARCH; COLOR;

D O I：

10.1016/j.visres.2007.06.015

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

To what extent can a computational model of the bottom-up visual attention predict what an observer is looking at? What is the contribution of the low-level visual features in the attention deployment? To answer these questions, a new spatio-temporal computational model is proposed. This model incorporates several visual features; therefore, a fusion algorithm is required to combine the different saliency maps (achromatic, chromatic and temporal). To quantitatively assess the model performances, eye movements were recorded while naive observers viewed natural dynamic scenes. Four completing metrics have been used. In addition, predictions from the proposed model are compared to the predictions from a state of the art model [Itti's model (Itti, L., Koch, C., & Niebur, E. (1998). A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(11), 1254-1259)] and from three non-biologically plausible models (uniform, flicker and centered models). Regardless of the metric used, the proposed model shows significant improvement over the selected benchmarking models (except the centered model). Conclusions are drawn regarding both the influence of low-level visual features over time and the central bias in an eye tracking experiment. (C) 2007 Elsevier Ltd. All rights reserved.

引用

页码：2483 / 2498

页数：16

共 39 条

[1]

[Anonymous], 2005, ICIP

[2]

[Anonymous], 1999, Network: Computation in Neural Systems

[3] High-level aspects of oculomotor control during viewing of natural-task images [J].

Canosa, RL ;

Pelz, JB ;

Mennie, NR ;

Peak, J .

HUMAN VISION AND ELECTRONIC IMAGING VIII, 2003, 5007 :240-251

[4]

Carmi R., 2006, Proceedings. ETRA 2006. Symposium on Eye Tracking Research and Applications, P11, DOI 10.1145/1117309.1117313

[5]

Cover Thomas., 1983, Elements of Information Theory

[6] A hierarchical neural system with attentional top-down enhancement of the spatial resolution for object recognition [J].

Deco, G ;

Schürmann, B .

VISION RESEARCH, 2000, 40 (20) :2845-2859

[7]

Fecteau J.H., 2006, Trends in Cognitive Sciences, V10, P617

[8]

FENCSIK DE, 2005, VIS COGN, V14, P92

[9] Saccade target selection during visual search [J].

Findlay, JM .

VISION RESEARCH, 1997, 37 (05) :617-631

[10]

Hoffman J. E., 1998, Attention, P119

← 1 2 3 4 →