Multi-level structured hybrid forest for joint head detection and pose estimation

被引:18
|
作者
Liu, Yuanyuan [1 ]
Xie, Zhong [1 ]
Yuan, Xiaohui [2 ]
Chen, Jingying [3 ]
Song, Wu [3 ]
机构
[1] China Univ Geosci, Fac Informat Engn, Wuhan, Hubei, Peoples R China
[2] Univ North Texas, Dept Comp Sci & Engn, Denton, TX USA
[3] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Hubei, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Multi-level structured hybrid forest; Head pose estimation; Head detection; Joint detection-estimation; Multiple structured features; FRAMEWORK;
D O I
10.1016/j.neucom.2017.05.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real-world applications, factors such as illumination variation, occlusion, and poor image quality, etc. make head detection and pose estimation much more challenging. In this paper, we propose a multi-level structured hybrid forest (MSHF) for joint head detection and pose estimation. Our method extends the hybrid framework of classification and regression forests by introducing multi-level splitting functions and multi-structural features. Multi-level splitting functions are used to construct trees in different layers of MSHF. Multi-structured features are.extracted from randomly selected image patches, which are either head region or the background. The head contour is derived from these patches using the signed distance of the patch center to the head contour by MSHF regression. The randomly selected sub-regions from the patches within the head contour are used to develop the MSHF for head pose estimation in a coarse-to fine manner. The weighted neighbor structured aggregation integrates votes from trees to achieve an estimation of continuous pose angles. Experiments were conducted using public datasets and video streams. Compared to the state-of-the-art methods, MSHF achieved improved performance and great robustness with an average accuracy of 90% and the average angular error of 6.6 degrees. The averaged time for performing a joint head detection and pose estimation is about 0.44 s. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:206 / 215
页数:10
相关论文
共 50 条
  • [1] Head Pose Estimation Based on Multi-Level Feature Fusion
    Yan, Chunman
    Zhang, Xiao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (02)
  • [2] A Multi-Level Network for Human Pose Estimation
    Shao, Zhanpeng
    Liu, Peng
    Li, Youfu
    Yang, Jianyu
    Zhou, Xiaolong
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13085 - 13091
  • [3] Human pose tracking using multi-level structured models
    Lee, Mun Wai
    Nevatia, Ram
    COMPUTER VISION - ECCV 2006, PT 3, PROCEEDINGS, 2006, 3953 : 368 - 381
  • [4] Multi-Level Drowsiness Detection Based on Deep Feature Fusion of Eye and Head Pose
    Ye, Fang
    Li, Shunxin
    Yuan, Xin
    Li, Longfei
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 107 - 111
  • [5] HandyPose: Multi-level framework for hand pose estimation
    Gupta, Divyansh
    Artacho, Bruno
    Savakis, Andreas
    PATTERN RECOGNITION, 2022, 128
  • [6] Multi-level feature fusion and joint refinement for simultaneous object pose estimation and camera localization
    Wang, Junyi
    Qi, Yue
    NEURAL NETWORKS, 2024, 174
  • [7] Human Pose Estimation with Multi-Scale and Multi-Level Feature Fusion
    Wang, Yanni
    Hu, Min
    Han, Shipeng
    Chen, Yixuan
    Lyu, Hao
    Computer Engineering and Applications, 2025, 61 (06) : 199 - 209
  • [8] MRSAPose: Multi-level routing sparse attention for multi-person pose estimation
    Wu, Shang
    Wang, Bin
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 257
  • [9] MULTI-LEVEL NETWORK FOR HIGH-SPEED MULTI-PERSON POSE ESTIMATION
    Huang, Ying
    Zhuang, Jiankai
    Qin, Zengchang
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 589 - 593
  • [10] WNet: Joint Multiple Head Detection and Head Pose Estimation from a Spectator Crowd Image
    Jan, Yasir
    Sohel, Ferdous
    Shiratuddin, Mohd Fairuz
    Wong, Kok Wai
    COMPUTER VISION - ACCV 2018 WORKSHOPS, 2019, 11367 : 484 - 493