Automatic Gaze Analysis: A Survey of Deep Learning Based Approaches

被引:17
作者
Ghosh, Shreya [1 ]
Dhall, Abhinav [2 ,3 ]
Hayat, Munawar [2 ]
Knibbe, Jarrod [4 ]
Ji, Qiang [5 ]
机构
[1] Curtin Univ, Bentley, WA 6102, Australia
[2] Monash Univ, Clayton 3800, Australia
[3] Indian Inst Technol Ropar, Bara Phool 140001, Punjab, India
[4] Univ Melbourne, Parkville, Vic 3052, Australia
[5] Rensselaer Polytech Inst, Troy, NY 12180 USA
关键词
Automated gaze estimation; gaze analysis; gaze tracking; human computer interaction; unsupervised and self-supervised gaze analysis; EYE-GAZE; HEAD POSE; TRACKING TECHNIQUES; VISUAL-ATTENTION; NEURAL-NETWORK; PREDICTION; MOVEMENTS; COGNITION; PATTERNS; DATASET;
D O I
10.1109/TPAMI.2023.3321337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Eye gaze analysis is an important research problem in the field of Computer Vision and Human-Computer Interaction. Even with notable progress in the last 10 years, automatic gaze analysis still remains challenging due to the uniqueness of eye appearance, eye-head interplay, occlusion, image quality, and illumination conditions. There are several open questions, including what are the important cues to interpret gaze direction in an unconstrained environment without prior knowledge and how to encode them in real-time. We review the progress across a range of gaze analysis tasks and applications to elucidate these fundamental questions, identify effective methods in gaze analysis, and provide possible future directions. We analyze recent gaze estimation and segmentation methods, especially in the unsupervised and weakly supervised domain, based on their advantages and reported evaluation metrics. Our analysis shows that the development of a robust and generic gaze analysis method still needs to address real-world challenges such as unconstrained setup and learning with less supervision. We conclude by discussing future research directions for designing a real-world gaze analysis system that can propagate to other domains including Computer Vision, Augmented Reality (AR), Virtual Reality (VR), and Human Computer Interaction (HCI).
引用
收藏
页码:61 / 84
页数:24
相关论文
共 228 条
  • [1] Examining Design Choices of Questionnaires in VR User Studies
    Alexandrovsky, Dmitry
    Putze, Susanne
    Bonfert, Michael
    Hoeffner, Sebastian
    Michelmann, Pitt
    Wenig, Dirk
    Malaka, Rainer
    Smeddinck, Jan David
    [J]. PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
  • [2] Alghowinem S, 2013, IEEE IMAGE PROC, P4220, DOI 10.1109/ICIP.2013.6738869
  • [3] [Anonymous], 2013, P 2013 C EYE TRACK S
  • [4] [Anonymous], 2013, OKAO vision
  • [5] Avital O., 2015, US Patent App, Patent No. [14/681,083, 14681083]
  • [6] A survey of augmented reality
    Azuma, RT
    [J]. PRESENCE-VIRTUAL AND AUGMENTED REALITY, 1997, 6 (04): : 355 - 385
  • [7] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [8] Baltrusaitis T, 2016, IEEE WINT CONF APPL
  • [9] OpenFace 2.0: Facial Behavior Analysis Toolkit
    Baltrusaitis, Tadas
    Zadeh, Amir
    Lim, Yao Chong
    Morency, Louis-Philippe
    [J]. PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 59 - 66
  • [10] ESCNet: Gaze Target Detection with the Understanding of 3D Scenes
    Bao, Jun
    Liu, Buyu
    Yu, Jun
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14106 - 14115