Head pose estimation with uncertainty and an application to dyadic interaction detection

被引：2

作者：

Tomenotti, Federico Figari ^{[1
]}

Noceti, Nicoletta ^{[1
]}

Odone, Francesca ^{[1
]}

机构：

[1] Univ Genoa, MaLGa DIBRIS, Via Dodecaneso 35, I-16146 Genoa, Italy

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2024年 / 243卷

关键词：

Head pose estimation; Multi-task regression; Neural networks; Heteroscedastic uncertainty; Dyadic interaction detection; PEOPLE LOOKING; GAZE; COMMUNICATION; MODEL;

D O I：

10.1016/j.cviu.2024.103999

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Determining the visual focus of attention of people in a scene is a fundamental cue to understand social interactions from videos. Gaze direction is ideal for determining eye contact, a basic cue of non-verbal communication, but it is not always easy to recognize. Head direction is a well-known proxy of gaze direction, more robust to the variability of the scene, thus offering a valuable alternative. In this work, we consider HHP-net, a method for estimating the head direction from single frames based on a heteroscedastic neural network to estimate people's head pose from a minimal set of head key points. We formulate the problem as a multi -task regression, to predict the pose as a triplet of Euler angles from the output of a 2D pose estimator. HHP-net also provides a measure of the aleatoric heteroscedastic uncertainties associated with the angles, through an ad -hoc loss function we introduce. In a thorough experimental analysis, we show that our model is efficient and effective compared with the state of the art, with only similar to 2 degrees of degradation in the worst case counterbalanced by a space occupation similar to 12 times smaller. We also show the beneficial effects of uncertainty on interpretability. Finally, we discuss the robustness of our method to input variability, showing that it can be seen as a plug-in to different pose estimators. As a proof -of -concept, we address social interaction analysis, with an algorithm to detect dyadic interactions in images.

引用

页数：14

共 50 条

[21] Towards unsupervised learning of joint facial landmark detection and head pose estimation
Zou, Zhiming
Jia, Dian
Tang, Wei
PATTERN RECOGNITION, 2025, 162
[22] TRFH: towards real-time face detection and head pose estimation
Chen, Shicun
Zhang, Yong
Yin, Baocai
Wang, Boyue
PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (04) : 1745 - 1755
[23] Robust head pose estimation based on key frames for human-machine interaction
Madrigal, Francisco
Lerasle, Frederic
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
[24] Robust head pose estimation based on key frames for human-machine interaction
Francisco Madrigal
Frederic Lerasle
EURASIP Journal on Image and Video Processing, 2020
[25] Head pose estimation: An extensive survey on recent techniques and applications
Abate, Andrea F.
Bisogni, Carmen
Castiglione, Aniello
Nappi, Michele
PATTERN RECOGNITION, 2022, 127
[26] Relational uncertainty and dyadic synchrony within the interaction of couples
Knobloch-Fedders, Lynne M.
Quirk, Kelley
Knobloch, Leanne K.
JOURNAL OF SOCIAL AND PERSONAL RELATIONSHIPS, 2024, 41 (04) : 867 - 891
[27] A NEW REPRESENTATION METHOD OF HEAD IMAGES FOR HEAD POSE ESTIMATION
Liu, Xiangyang
Lu, Hongtao
Luo, Heng
2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3585 - 3588
[28] Head pose estimation method based on pose manifold and tensor decomposition
Wei Wei1
2.School of Electronic Engineering
Journal of Systems Engineering and Electronics, 2010, 21 (05) : 907 - 913
[29] Comparing Head and AR Glasses Pose Estimation
Firintepe, Ahmet
Dhaouadi, Oussema
Pagani, Alain
Stricker, Didier
2021 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT PROCEEDINGS (ISMAR-ADJUNCT 2021), 2021, : 109 - 114
[30] Isomorphic Loss Function for Head Pose Estimation
Felea, Iulian
Florea, Corneliu
Vertan, Constantin
Florea, Laura
25. INTERNATIONAL CONFERENCE IN CENTRAL EUROPE ON COMPUTER GRAPHICS, VISUALIZATION AND COMPUTER VISION (WSCG 2017), 2017, 2701 : 89 - 94

← 1 2 3 4 5 →