Head pose estimation with uncertainty and an application to dyadic interaction detection

被引:2
作者
Tomenotti, Federico Figari [1 ]
Noceti, Nicoletta [1 ]
Odone, Francesca [1 ]
机构
[1] Univ Genoa, MaLGa DIBRIS, Via Dodecaneso 35, I-16146 Genoa, Italy
关键词
Head pose estimation; Multi-task regression; Neural networks; Heteroscedastic uncertainty; Dyadic interaction detection; PEOPLE LOOKING; GAZE; COMMUNICATION; MODEL;
D O I
10.1016/j.cviu.2024.103999
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the visual focus of attention of people in a scene is a fundamental cue to understand social interactions from videos. Gaze direction is ideal for determining eye contact, a basic cue of non-verbal communication, but it is not always easy to recognize. Head direction is a well-known proxy of gaze direction, more robust to the variability of the scene, thus offering a valuable alternative. In this work, we consider HHP-net, a method for estimating the head direction from single frames based on a heteroscedastic neural network to estimate people's head pose from a minimal set of head key points. We formulate the problem as a multi -task regression, to predict the pose as a triplet of Euler angles from the output of a 2D pose estimator. HHP-net also provides a measure of the aleatoric heteroscedastic uncertainties associated with the angles, through an ad -hoc loss function we introduce. In a thorough experimental analysis, we show that our model is efficient and effective compared with the state of the art, with only similar to 2 degrees of degradation in the worst case counterbalanced by a space occupation similar to 12 times smaller. We also show the beneficial effects of uncertainty on interpretability. Finally, we discuss the robustness of our method to input variability, showing that it can be seen as a plug-in to different pose estimators. As a proof -of -concept, we address social interaction analysis, with an algorithm to detect dyadic interactions in images.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Towards unsupervised learning of joint facial landmark detection and head pose estimation
    Zou, Zhiming
    Jia, Dian
    Tang, Wei
    PATTERN RECOGNITION, 2025, 162
  • [22] TRFH: towards real-time face detection and head pose estimation
    Chen, Shicun
    Zhang, Yong
    Yin, Baocai
    Wang, Boyue
    PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (04) : 1745 - 1755
  • [23] Robust head pose estimation based on key frames for human-machine interaction
    Madrigal, Francisco
    Lerasle, Frederic
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
  • [24] Robust head pose estimation based on key frames for human-machine interaction
    Francisco Madrigal
    Frederic Lerasle
    EURASIP Journal on Image and Video Processing, 2020
  • [25] Head pose estimation: An extensive survey on recent techniques and applications
    Abate, Andrea F.
    Bisogni, Carmen
    Castiglione, Aniello
    Nappi, Michele
    PATTERN RECOGNITION, 2022, 127
  • [26] Relational uncertainty and dyadic synchrony within the interaction of couples
    Knobloch-Fedders, Lynne M.
    Quirk, Kelley
    Knobloch, Leanne K.
    JOURNAL OF SOCIAL AND PERSONAL RELATIONSHIPS, 2024, 41 (04) : 867 - 891
  • [27] A NEW REPRESENTATION METHOD OF HEAD IMAGES FOR HEAD POSE ESTIMATION
    Liu, Xiangyang
    Lu, Hongtao
    Luo, Heng
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3585 - 3588
  • [28] Head pose estimation method based on pose manifold and tensor decomposition
    Wei Wei1
    2.School of Electronic Engineering
    Journal of Systems Engineering and Electronics, 2010, 21 (05) : 907 - 913
  • [29] Comparing Head and AR Glasses Pose Estimation
    Firintepe, Ahmet
    Dhaouadi, Oussema
    Pagani, Alain
    Stricker, Didier
    2021 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT PROCEEDINGS (ISMAR-ADJUNCT 2021), 2021, : 109 - 114
  • [30] Isomorphic Loss Function for Head Pose Estimation
    Felea, Iulian
    Florea, Corneliu
    Vertan, Constantin
    Florea, Laura
    25. INTERNATIONAL CONFERENCE IN CENTRAL EUROPE ON COMPUTER GRAPHICS, VISUALIZATION AND COMPUTER VISION (WSCG 2017), 2017, 2701 : 89 - 94