Real-time 6DoF full-range markerless head pose estimation

被引:5
|
作者
Algabri, Redhwan [1 ]
Shin, Hyunsoo [2 ]
Lee, Sungon [3 ]
机构
[1] Hanyang Univ, Res Inst Engn & Technol, Ansan 15588, South Korea
[2] Hanyang Univ, Dept Elect & Elect Engn, Ansan 15588, South Korea
[3] Hanyang Univ, Dept Robot, Ansan 15588, South Korea
基金
新加坡国家研究基金会;
关键词
Head pose estimation; Full-range angles; 6DoF poses; Landmark-free; Deep learning;
D O I
10.1016/j.eswa.2023.122293
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Head pose estimation is a fundamental function for several applications in human-computer interactions. Accurate six degrees of freedom head pose estimation (6DoF-HPE) with full-range angles make up most of these applications, which require sequential images of the human head as input. Most existing head pose estimation methods focus on a three degrees of freedom (3DoF) frontal head, which restricts their applications in real-world scenarios. This study presents a framework designed to estimate a head pose without landmark localization. The novelty of our framework is to estimate the 6DoF head poses under full-range angles in real-time. The proposed framework leverages deep neural networks to detect human heads and predict their angles using single shot multibox detector (SSD) and RepVGG-b1g4 backbone, respectively. This work uses red, green, blue, and depth (RGB-D) data to estimate the rotational and translational components relative to the camera pose. The proposed framework employs a continuous representation to predict the angles and a multi-loss approach to update the loss functions for the training strategy. The regression function combines the geodesic loss with the mean squared error. The ground-truth labels were extracted from the public dataset Carnegie Mellon university (CMU) Panoptic for full head angles. This study provides a comprehensive comparison with state-of-the-art methods using public benchmark datasets. Experiments demonstrate that the proposed method achieves or outperforms state-of-the-art methods. The code and datasets are available at: (https://github.com/Redhwan-A/6DoFHPE).
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Combined Framework for Real-time Head Pose Estimation using Facial Landmark Detection and Salient Feature Tracking
    Barros, Jilliam Maria Diaz
    Garcia, Frederic
    Mirbach, Bruno
    Varanasi, Kiran
    Stricker, Didier
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 123 - 133
  • [32] Real-time masked face classification and head pose estimation for RGB facial image via knowledge distillation
    Chien Thai
    Viet Tran
    Minh Bui
    Dat Nguyen
    Huong Ninh
    Hai Tran
    INFORMATION SCIENCES, 2022, 616 : 330 - 347
  • [33] Real-Time Energy Efficient Hand Pose Estimation: A Case Study
    Al Koutayni, Mhd Rashed
    Rybalkin, Vladimir
    Malik, Jameel
    Elhayek, Ahmed
    Weis, Christian
    Reis, Gerd
    Wehn, Norbert
    Stricker, Didier
    SENSORS, 2020, 20 (10)
  • [34] Real-time yoga pose classification with 3-D pose estimation model with LSTM
    Ratnesh Prasad Srivastava
    Lokendra Singh Umrao
    Ramjeet Singh Yadav
    Multimedia Tools and Applications, 2024, 83 : 33019 - 33030
  • [35] Real-time yoga pose classification with 3-D pose estimation model with LSTM
    Srivastava, Ratnesh Prasad
    Umrao, Lokendra Singh
    Yadav, Ramjeet Singh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 33019 - 33030
  • [36] Refining Weights for Enhanced Object Similarity in Multi-perspective 6Dof Pose Estimation and 3D Object Detection
    Kusumo, Budiarianto Suryo
    Thomas, Ulrike
    DEEP LEARNING THEORY AND APPLICATIONS, PT I, DELTA 2024, 2024, 2171 : 310 - 327
  • [37] Towards Real-Time Head Pose Estimation: Exploring Parameter-Reduced Residual Networks on In-the-wild Datasets
    Rieger, Ines
    Hauenstein, Thomas
    Hettenkofer, Sebastian
    Garbas, Jens-Uwe
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: FROM THEORY TO PRACTICE, 2019, 11606 : 123 - 134
  • [38] Real-Time Accurate 3D Head Tracking and Pose Estimation with Consumer RGB-D Cameras
    David Joseph Tan
    Federico Tombari
    Nassir Navab
    International Journal of Computer Vision, 2018, 126 : 158 - 183
  • [39] Real-time Head Pose Estimation for Driver Assistance System Using Low-Cost On-Board Computer
    Yin, Chao
    Yang, Xubo
    PROCEEDINGS VRCAI 2016: 15TH ACM SIGGRAPH CONFERENCE ON VIRTUAL-REALITY CONTINUUM AND ITS APPLICATIONS IN INDUSTRY, 2016, : 43 - 46
  • [40] Real-Time Accurate 3D Head Tracking and Pose Estimation with Consumer RGB-D Cameras
    Tan, David Joseph
    Tombari, Federico
    Navab, Nassir
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (2-4) : 158 - 183