Real-time 6DoF full-range markerless head pose estimation

被引:5
|
作者
Algabri, Redhwan [1 ]
Shin, Hyunsoo [2 ]
Lee, Sungon [3 ]
机构
[1] Hanyang Univ, Res Inst Engn & Technol, Ansan 15588, South Korea
[2] Hanyang Univ, Dept Elect & Elect Engn, Ansan 15588, South Korea
[3] Hanyang Univ, Dept Robot, Ansan 15588, South Korea
基金
新加坡国家研究基金会;
关键词
Head pose estimation; Full-range angles; 6DoF poses; Landmark-free; Deep learning;
D O I
10.1016/j.eswa.2023.122293
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Head pose estimation is a fundamental function for several applications in human-computer interactions. Accurate six degrees of freedom head pose estimation (6DoF-HPE) with full-range angles make up most of these applications, which require sequential images of the human head as input. Most existing head pose estimation methods focus on a three degrees of freedom (3DoF) frontal head, which restricts their applications in real-world scenarios. This study presents a framework designed to estimate a head pose without landmark localization. The novelty of our framework is to estimate the 6DoF head poses under full-range angles in real-time. The proposed framework leverages deep neural networks to detect human heads and predict their angles using single shot multibox detector (SSD) and RepVGG-b1g4 backbone, respectively. This work uses red, green, blue, and depth (RGB-D) data to estimate the rotational and translational components relative to the camera pose. The proposed framework employs a continuous representation to predict the angles and a multi-loss approach to update the loss functions for the training strategy. The regression function combines the geodesic loss with the mean squared error. The ground-truth labels were extracted from the public dataset Carnegie Mellon university (CMU) Panoptic for full head angles. This study provides a comprehensive comparison with state-of-the-art methods using public benchmark datasets. Experiments demonstrate that the proposed method achieves or outperforms state-of-the-art methods. The code and datasets are available at: (https://github.com/Redhwan-A/6DoFHPE).
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation
    Hempel, Thorsten
    Abdelrahman, Ahmed A.
    Al-Hamadi, Ayoub
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2377 - 2387
  • [22] Real-Time Head Pose Estimation Based on Kalman Filter and Random Regression Forest
    Li C.
    Zhong F.
    Ma X.
    Qin X.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2017, 29 (12): : 2309 - 2316
  • [23] Real-Time Head Pose Estimation Using Multi-variate RVM on Faces in the Wild
    Selim, Mohamed
    Pagani, Alain
    Stricker, Didier
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT II, 2015, 9257 : 254 - 265
  • [24] Real-time head pose estimation using multi-task deep neural network
    Ahn, Byungtae
    Choi, Dong-Geol
    Park, Jaesik
    Kweon, In So
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 103 : 1 - 12
  • [25] intrApose: Monocular Driver 6 DOF Head Pose Estimation Leveraging Camera Intrinsics
    Roth, Markus
    Gavrila, Dariu M.
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (08): : 4057 - 4068
  • [26] A Real-Time Head Pose Estimation Using Adaptive POSIT Based on Modified Supervised Descent Method
    Zhao, Zhong-Qiu
    Cheng, Kewen
    Peng, Qinmu
    Wu, Xindong
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT I, 2016, 9771 : 74 - 85
  • [27] Real-time fall detection algorithm based on pose estimation
    Yu N.-G.
    Bai D.-G.
    Kongzhi yu Juece/Control and Decision, 2020, 35 (11): : 2761 - 2766
  • [28] SynPo-Net-Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training
    Su, Yongzhi
    Rambach, Jason
    Pagani, Alain
    Stricker, Didier
    SENSORS, 2021, 21 (01) : 1 - 16
  • [29] Detecting Object Surface Keypoints from a Single RGB Image via Deep Learning Network for 6DoF Pose Estimation
    Aing, Lee
    Lie, Wen-Nung
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 1673 - 1678
  • [30] A real-time vehicle safety system by concurrent object detection and head pose estimation via stereo vision
    Rodriguez-Quinonez, Julio C.
    Sanchez-Castro, Jonathan J.
    Real-Moreno, Oscar
    Galaviz, Guillermo
    Flores-Fuentes, Wendy
    Sergiyenko, Oleg
    Castro-Toscano, Moises J.
    Hernandez-Balbuena, Daniel
    HELIYON, 2024, 10 (16)