Head pose estimation with uncertainty and an application to dyadic interaction detection

被引:2
作者
Tomenotti, Federico Figari [1 ]
Noceti, Nicoletta [1 ]
Odone, Francesca [1 ]
机构
[1] Univ Genoa, MaLGa DIBRIS, Via Dodecaneso 35, I-16146 Genoa, Italy
关键词
Head pose estimation; Multi-task regression; Neural networks; Heteroscedastic uncertainty; Dyadic interaction detection; PEOPLE LOOKING; GAZE; COMMUNICATION; MODEL;
D O I
10.1016/j.cviu.2024.103999
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the visual focus of attention of people in a scene is a fundamental cue to understand social interactions from videos. Gaze direction is ideal for determining eye contact, a basic cue of non-verbal communication, but it is not always easy to recognize. Head direction is a well-known proxy of gaze direction, more robust to the variability of the scene, thus offering a valuable alternative. In this work, we consider HHP-net, a method for estimating the head direction from single frames based on a heteroscedastic neural network to estimate people's head pose from a minimal set of head key points. We formulate the problem as a multi -task regression, to predict the pose as a triplet of Euler angles from the output of a 2D pose estimator. HHP-net also provides a measure of the aleatoric heteroscedastic uncertainties associated with the angles, through an ad -hoc loss function we introduce. In a thorough experimental analysis, we show that our model is efficient and effective compared with the state of the art, with only similar to 2 degrees of degradation in the worst case counterbalanced by a space occupation similar to 12 times smaller. We also show the beneficial effects of uncertainty on interpretability. Finally, we discuss the robustness of our method to input variability, showing that it can be seen as a plug-in to different pose estimators. As a proof -of -concept, we address social interaction analysis, with an algorithm to detect dyadic interactions in images.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Cross-Cascading Regression for Simultaneous Head Pose Estimation and Facial Landmark Detection
    Zhang, Wei
    Zhang, Hongwen
    Li, Qi
    Liu, Fei
    Sun, Zhenan
    Li, Xin
    Wan, Xinxin
    BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 148 - 156
  • [32] Quantitative Evaluation of Face Detection and Tracking Algorithms for Head Pose Estimation in Mobile Platforms
    Welivita, Anuradha
    Nimalsiri, Nanduni
    Wickramasinghe, Ruchiranga
    Pathirana, Upekka
    Gamage, Chandana
    2017 3RD INTERNATIONAL MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2017, : 310 - 315
  • [33] Multi-level structured hybrid forest for joint head detection and pose estimation
    Liu, Yuanyuan
    Xie, Zhong
    Yuan, Xiaohui
    Chen, Jingying
    Song, Wu
    NEUROCOMPUTING, 2017, 266 : 206 - 215
  • [34] Head Pose Estimation Based on Head Tracking and the Kalman Filter
    Yu, Wang
    Gang, Liu
    2011 INTERNATIONAL CONFERENCE ON PHYSICS SCIENCE AND TECHNOLOGY (ICPST), 2011, 22 : 420 - 427
  • [35] Exploiting Fuzzy Approximator to Head Pose Estimation
    Baradaran-Khalkhali, Maryam
    Shekofteh, S. Kazem
    Toosizadeh, Saeed
    Akbarzadeh-T, Mohammad-R.
    SPA 2010: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2010, : 68 - +
  • [36] Deep Learning for Head Pose Estimation: A Survey
    Asperti A.
    Filippini D.
    SN Computer Science, 4 (4)
  • [37] Learning toward practical head pose estimation
    Sang, Gaoli
    He, Feixiang
    Zhu, Rong
    Xuan, Shibin
    OPTICAL ENGINEERING, 2017, 56 (08)
  • [38] Online Learning State Evaluation Method Based on Face Detection and Head Pose Estimation
    Li, Bin
    Liu, Peng
    SENSORS, 2024, 24 (05)
  • [39] Driver Fatigue Detection Based on Residual Channel Attention Network and Head Pose Estimation
    Ye, Mu
    Zhang, Weiwei
    Cao, Pengcheng
    Liu, Kangan
    APPLIED SCIENCES-BASEL, 2021, 11 (19):
  • [40] Comparative study of coarse head pose estimation
    Brown, LM
    Tian, YL
    IEEE WORKSHOP ON MOTION AND VIDEO COMPUTING (MOTION 2002), PROCEEDINGS, 2002, : 125 - 130