Deep learning models for webcam eye tracking in online experiments

被引:7
作者
Saxena, Shreshth [1 ,2 ]
Fink, Lauren K. [1 ,2 ,3 ]
Lange, Elke B. [1 ]
机构
[1] Max Planck Inst Empir Aesthet, Mus Depart, Frankfurt, Germany
[2] McMaster Univ, Dept Psychol Neurosci & Behav, Hamilton, ON, Canada
[3] Max Planck NYU Ctr Language Mus & Emot, Frankfurt, Germany
关键词
Online; Low resolution; Eye tracking; Deep learning; Computer vision; Eye gaze; Fixation; Free viewing; Smooth pursuit; Blinks;
D O I
10.3758/s13428-023-02190-6
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
Eye tracking is prevalent in scientific and commercial applications. Recent computer vision and deep learning methods enable eye tracking with off-the-shelf webcams and reduce dependence on expensive, restrictive hardware. However, such deep learning methods have not yet been applied and evaluated for remote, online psychological experiments. In this study, we tackle critical challenges faced in remote eye tracking setups and systematically evaluate appearance-based deep learning methods of gaze tracking and blink detection. From their own homes and laptops, 65 participants performed a battery of eye tracking tasks including (i) fixation, (ii) zone classification, (iii) free viewing, (iv) smooth pursuit, and (v) blink detection. Webcam recordings of the participants performing these tasks were processed offline through appearance-based models of gaze and blink detection. The task battery required different eye movements that characterized gaze and blink prediction accuracy over a comprehensive list of measures. We find the best gaze accuracy to be 2.4 & DEG; and precision of 0.47 & DEG;, which outperforms previous online eye tracking studies and reduces the gap between laboratory-based and online eye tracking performance. We release the experiment template, recorded data, and analysis code with the motivation to escalate affordable, accessible, and scalable eye tracking that has the potential to accelerate research in the fields of psychological science, cognitive neuroscience, user experience design, and human-computer interfaces.
引用
收藏
页码:3487 / 3503
页数:17
相关论文
共 47 条
  • [1] Convolutional Neural Network-Based Methods for Eye Gaze Estimation: A Survey
    Akinyelu, Andronicus A.
    Blignaut, Pieter
    [J]. IEEE ACCESS, 2020, 8 : 142581 - 142605
  • [2] Andersson R, 2010, J EYE MOVEMENT RES, V3
  • [3] Comparing Online Webcam- and Laboratory-Based Eye-Tracking for the Assessment of Infants' Audio-Visual Synchrony Perception
    Banki, Anna
    de Eccher, Martina
    Falschlehner, Lilith
    Hoehl, Stefanie
    Markova, Gabriela
    [J]. FRONTIERS IN PSYCHOLOGY, 2022, 12
  • [4] Bradski G, 2000, DR DOBBS J, V25, P120
  • [5] Buswell GT, 1935, How people look at pictures: a study of the psychology and perception in art, P198
  • [6] Cakmak Eren, 2021, HANIMOB '21: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Animal Movement Ecology and Human Mobility, P5, DOI 10.1145/3486637.3489487
  • [7] DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks
    Chen, Weixuan
    McDuff, Daniel
    [J]. COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 356 - 373
  • [8] Cheng Y., 2021, arXiv
  • [9] RT-BENE: A Dataset and Baselines for Real-Time Blink Estimation in Natural Environments
    Cortacero, Kevin
    Fischer, Tobias
    Demiris, Yiannis
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1159 - 1168
  • [10] Delabarre E.B., 1898, AM J PSYCHOL, V9, P572, DOI [10.2307/1412191, DOI 10.2307/1412191]