Automatic Gaze Analysis: A Survey of Deep Learning Based Approaches

被引：17

作者：

Ghosh, Shreya ^{[1
]}

Dhall, Abhinav ^{[2
,3
]}

Hayat, Munawar ^{[2
]}

Knibbe, Jarrod ^{[4
]}

Ji, Qiang ^{[5
]}

机构：

[1] Curtin Univ, Bentley, WA 6102, Australia

[2] Monash Univ, Clayton 3800, Australia

[3] Indian Inst Technol Ropar, Bara Phool 140001, Punjab, India

[4] Univ Melbourne, Parkville, Vic 3052, Australia

[5] Rensselaer Polytech Inst, Troy, NY 12180 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 01期

关键词：

Automated gaze estimation; gaze analysis; gaze tracking; human computer interaction; unsupervised and self-supervised gaze analysis; EYE-GAZE; HEAD POSE; TRACKING TECHNIQUES; VISUAL-ATTENTION; NEURAL-NETWORK; PREDICTION; MOVEMENTS; COGNITION; PATTERNS; DATASET;

D O I：

10.1109/TPAMI.2023.3321337

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Eye gaze analysis is an important research problem in the field of Computer Vision and Human-Computer Interaction. Even with notable progress in the last 10 years, automatic gaze analysis still remains challenging due to the uniqueness of eye appearance, eye-head interplay, occlusion, image quality, and illumination conditions. There are several open questions, including what are the important cues to interpret gaze direction in an unconstrained environment without prior knowledge and how to encode them in real-time. We review the progress across a range of gaze analysis tasks and applications to elucidate these fundamental questions, identify effective methods in gaze analysis, and provide possible future directions. We analyze recent gaze estimation and segmentation methods, especially in the unsupervised and weakly supervised domain, based on their advantages and reported evaluation metrics. Our analysis shows that the development of a robust and generic gaze analysis method still needs to address real-world challenges such as unconstrained setup and learning with less supervision. We conclude by discussing future research directions for designing a real-world gaze analysis system that can propagate to other domains including Computer Vision, Augmented Reality (AR), Virtual Reality (VR), and Human Computer Interaction (HCI).

引用

页码：61 / 84

页数：24

共 228 条

[1] Examining Design Choices of Questionnaires in VR User Studies
Alexandrovsky, Dmitry
Putze, Susanne
Bonfert, Michael
Hoeffner, Sebastian
Michelmann, Pitt
Wenig, Dirk
Malaka, Rainer
Smeddinck, Jan David
[J]. PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
[2] Alghowinem S, 2013, IEEE IMAGE PROC, P4220, DOI 10.1109/ICIP.2013.6738869
[3] [Anonymous], 2013, P 2013 C EYE TRACK S
[4] [Anonymous], 2013, OKAO vision
[5] Avital O., 2015, US Patent App, Patent No. [14/681,083, 14681083]
[6] A survey of augmented reality
Azuma, RT
[J]. PRESENCE-VIRTUAL AND AUGMENTED REALITY, 1997, 6 (04): : 355 - 385
[7] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[8] Baltrusaitis T, 2016, IEEE WINT CONF APPL
[9] OpenFace 2.0: Facial Behavior Analysis Toolkit
Baltrusaitis, Tadas
Zadeh, Amir
Lim, Yao Chong
Morency, Louis-Philippe
[J]. PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 59 - 66
[10] ESCNet: Gaze Target Detection with the Understanding of 3D Scenes
Bao, Jun
Liu, Buyu
Yu, Jun
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14106 - 14115

← 1 2 3 4 5 6 7 8 9 10 →