Head pose estimation with uncertainty and an application to dyadic interaction detection

被引:2
作者
Tomenotti, Federico Figari [1 ]
Noceti, Nicoletta [1 ]
Odone, Francesca [1 ]
机构
[1] Univ Genoa, MaLGa DIBRIS, Via Dodecaneso 35, I-16146 Genoa, Italy
关键词
Head pose estimation; Multi-task regression; Neural networks; Heteroscedastic uncertainty; Dyadic interaction detection; PEOPLE LOOKING; GAZE; COMMUNICATION; MODEL;
D O I
10.1016/j.cviu.2024.103999
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the visual focus of attention of people in a scene is a fundamental cue to understand social interactions from videos. Gaze direction is ideal for determining eye contact, a basic cue of non-verbal communication, but it is not always easy to recognize. Head direction is a well-known proxy of gaze direction, more robust to the variability of the scene, thus offering a valuable alternative. In this work, we consider HHP-net, a method for estimating the head direction from single frames based on a heteroscedastic neural network to estimate people's head pose from a minimal set of head key points. We formulate the problem as a multi -task regression, to predict the pose as a triplet of Euler angles from the output of a 2D pose estimator. HHP-net also provides a measure of the aleatoric heteroscedastic uncertainties associated with the angles, through an ad -hoc loss function we introduce. In a thorough experimental analysis, we show that our model is efficient and effective compared with the state of the art, with only similar to 2 degrees of degradation in the worst case counterbalanced by a space occupation similar to 12 times smaller. We also show the beneficial effects of uncertainty on interpretability. Finally, we discuss the robustness of our method to input variability, showing that it can be seen as a plug-in to different pose estimators. As a proof -of -concept, we address social interaction analysis, with an algorithm to detect dyadic interactions in images.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] PHOW Based Feature Detection For Head Pose Estimation
    Jian, Wang
    Hua, Van
    Jing, Li
    Ping, Xia
    2015 IEEE 16TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2015, : 437 - 440
  • [2] HeadDiff: Exploring Rotation Uncertainty With Diffusion Models for Head Pose Estimation
    Wang, Yaoxing
    Liu, Hao
    Feng, Yaowei
    Li, Zhendong
    Wu, Xiangjuan
    Zhu, Congcong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1868 - 1882
  • [3] Fast Head Pose Estimation for Human-Computer Interaction
    Garcia-Montero, Mario
    Redondo-Cabrera, Carolina
    Lopez-Sastre, Roberto
    Tuytelaars, Tinne
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 101 - 110
  • [4] Gaze Detection Based on Head Pose Estimation in Smart TV
    Dat Tien Nguyen
    Shin, Kwang Yong
    Lee, Won Oh
    Kim, Yeong Gon
    Kim, Ki Wan
    Hong, Hyung Gil
    Park, Kang Ryoung
    Oh, CheonIn
    Lee, HanKyu
    Jeong, Youngho
    2013 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2013): FUTURE CREATIVE CONVERGENCE TECHNOLOGIES FOR NEW ICT ECOSYSTEMS, 2013, : 283 - 288
  • [5] Integrating perceptual level of detail with head-pose estimation and its uncertainty
    Javier E. Martinez
    Ali Erol
    George Bebis
    Richard Boyle
    Xander Twombly
    Machine Vision and Applications, 2009, 21
  • [6] Integrating perceptual level of detail with head-pose estimation and its uncertainty
    Martinez, Javier E.
    Erol, Ali
    Bebis, George
    Boyle, Richard
    Twombly, Xander
    MACHINE VISION AND APPLICATIONS, 2009, 21 (01) : 69 - 83
  • [7] Camera Pose Estimation using Human Head Pose Estimation
    Fischer, Robert
    Hoedlmoser, Michael
    Gelautz, Margrit
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 877 - 886
  • [8] An improved head pose estimation method for the robotic wheelchair interaction control
    Xu, Guozheng
    Xu, Lei
    Lv, Cheng
    Zhu, Bo
    2017 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE ROBIO 2017), 2017, : 1589 - 1593
  • [9] Head Pose Estimation in Computer Vision: A Survey
    Murphy-Chutorian, Erik
    Trivedi, Mohan Manubhai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (04) : 607 - 626
  • [10] Collaborative learning network for head pose estimation
    Xia, Haiying
    Liu, Gan
    Xu, Luhui
    Gan, Yanling
    IMAGE AND VISION COMPUTING, 2022, 127