Benefiting from users' gaze: selection of image regions from eye tracking information for provided tags

被引:3
作者
Walber, Tina [1 ]
Scherp, Ansgar [1 ]
Staab, Steffen [1 ]
机构
[1] Univ Koblenz Landau, D-56070 Koblenz, Germany
关键词
Region identification; Region labeling; Gaze analysis; Eye tracking; Tagging;
D O I
10.1007/s11042-013-1390-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Providing image annotations is a tedious task. This becomes even more cumbersome when objects shall be annotated in the images. Such region-based annotations can be used in various ways like similarity search or as training set in automatic object detection. We investigate the principle idea of finding objects in images by looking at gaze paths from users, viewing images with an interest in a specific object. We have analyzed 799 gaze paths from 30 subjects viewing image-tag-pairs with the task to decide whether a tag could be found in the image or not. We have compared 13 different fixation measures analyzing the gaze paths. The best performing fixation measure is able to correctly assign a tag to a region for 63 % of the image-tag-pairs and significantly outperforms three baselines. We look into details of the image region characteristics such as the position and size for incorrect and correct assignments. The influence of aggregating multiple gaze paths from several subjects with respect to improving the precision of identifying the correct regions is also investigated. In addition, we look into the possibilities of discriminating different regions in the same image. Here, we are able to correctly identify two regions in the same image from different primings with an accuracy of 38 %.
引用
收藏
页码:363 / 390
页数:28
相关论文
共 35 条
[1]  
[Anonymous], 2009, PROC 17 ACM INT C MU
[2]  
Bruneau D, 2002, P CHI, V2
[3]   A survey of free-form object representation and recognition techniques [J].
Campbell, RJ ;
Flynn, PJ .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2001, 81 (02) :166-210
[4]  
Castagnos Sylvain, 2010, P 4 ACM C REC SYST R, P29, DOI DOI 10.1145/1864708.1864717
[5]  
Duygulu P., 2006, COMPUTER VISIONECCV, V2002, P349
[6]  
Grabner H, 2011, PROC CVPR IEEE, P1529, DOI 10.1109/CVPR.2011.5995327
[7]  
Hajimirza S., 2010, IMAGE ANAL MULTIMEDI
[8]   A model of saliency-based visual attention for rapid scene analysis [J].
Itti, L ;
Koch, C ;
Niebur, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259
[9]  
Jaimes A, 2001, SPIE, DOI [10.1117/12.429507, DOI 10.1117/12.429507]
[10]  
Judd T, 2009, IEEE I CONF COMP VIS, P2106, DOI 10.1109/ICCV.2009.5459462