Head pose estimation with uncertainty and an application to dyadic interaction detection

被引:2
作者
Tomenotti, Federico Figari [1 ]
Noceti, Nicoletta [1 ]
Odone, Francesca [1 ]
机构
[1] Univ Genoa, MaLGa DIBRIS, Via Dodecaneso 35, I-16146 Genoa, Italy
关键词
Head pose estimation; Multi-task regression; Neural networks; Heteroscedastic uncertainty; Dyadic interaction detection; PEOPLE LOOKING; GAZE; COMMUNICATION; MODEL;
D O I
10.1016/j.cviu.2024.103999
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the visual focus of attention of people in a scene is a fundamental cue to understand social interactions from videos. Gaze direction is ideal for determining eye contact, a basic cue of non-verbal communication, but it is not always easy to recognize. Head direction is a well-known proxy of gaze direction, more robust to the variability of the scene, thus offering a valuable alternative. In this work, we consider HHP-net, a method for estimating the head direction from single frames based on a heteroscedastic neural network to estimate people's head pose from a minimal set of head key points. We formulate the problem as a multi -task regression, to predict the pose as a triplet of Euler angles from the output of a 2D pose estimator. HHP-net also provides a measure of the aleatoric heteroscedastic uncertainties associated with the angles, through an ad -hoc loss function we introduce. In a thorough experimental analysis, we show that our model is efficient and effective compared with the state of the art, with only similar to 2 degrees of degradation in the worst case counterbalanced by a space occupation similar to 12 times smaller. We also show the beneficial effects of uncertainty on interpretability. Finally, we discuss the robustness of our method to input variability, showing that it can be seen as a plug-in to different pose estimators. As a proof -of -concept, we address social interaction analysis, with an algorithm to detect dyadic interactions in images.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] MIXTURE OF RELATED REGRESSIONS FOR HEAD POSE ESTIMATION
    Pan, Lili
    Liu, Risheng
    Xie, Mie
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3647 - 3651
  • [42] Head Pose Estimation Patterns as Deepfake Detectors
    Becattini, Federico
    Bisogni, Carmen
    Loia, Vincenzo
    Pero, Chiara
    Hao, Fei
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (11)
  • [43] Head pose estimation based on tensor factorization
    Yang, Wenlu
    Zhang, Liqing
    Zhu, Wenjun
    NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 831 - 840
  • [44] Head Pose Estimation Using Deep Architectures
    Felea, Iulian-Ionut
    Florea, Laura
    Florea, Corneliu
    Vertan, Constantin
    2018 12TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2018, : 505 - 508
  • [45] Head pose estimation method based on pose manifold and tensor decomposition
    Wei, Wei
    Zhang, Yanning
    Tian, Chunna
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2010, 21 (05) : 907 - 913
  • [46] Domain Adaptation for Head Pose Estimation Using Relative Pose Consistency
    Kuhnke, Felix
    Ostermann, Joern
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2023, 5 (03): : 348 - 359
  • [47] User's Gaze Tracking System and Its Application using Head Pose Estimation
    Kim, Hyunduk
    Sohn, Myoung-Kyu
    Kim, Dong-Ju
    Ryu, Nuri
    2014 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION, 2014, : 166 - 171
  • [48] An Approach for Fast Human Head Pose Estimation
    Yari, Yessenia
    Scharcanski, Jacob
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2011, 2011, 8063
  • [49] . Robust Stereoscopic Head Pose Estimation in Human-Computer Interaction and a Unified Evaluation Framework
    Layher, Georg
    Liebau, Hendrik
    Niese, Robert
    Al-Hamadi, Ayoub
    Michaelis, Bernd
    Neumann, Heiko
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2011, PT I, 2011, 6978 : 227 - 236
  • [50] Deep learning and machine learning techniques for head pose estimation: a survey
    Algabri, Redhwan
    Abdu, Ahmed
    Lee, Sungon
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (10)