Robust Head Pose Estimation Using Extreme Gradient Boosting Machine on Stacked Autoencoders Neural Network

被引:10
作者
Minh Thanh Vo [1 ]
Trang Nguyen [2 ]
Tuong Le [3 ]
机构
[1] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
[2] Ho Chi Minh City Open Univ, Fac Informat Technol, Ho Chi Minh City 700000, Vietnam
[3] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh 700000, Vietnam
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Head pose estimation; autoencoder; feature reduction; gradient boosting; global features; DIMENSIONALITY REDUCTION; SMILE DETECTION; LEAST-SQUARES; REGRESSION;
D O I
10.1109/ACCESS.2019.2962974
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Head pose estimation is an important sign in helping robots and other intelligence machines understand human. It plays a vital role in designing human computer interaction systems because many applications rely on precise results of head pose angles such as human behavior analysis, gaze estimation, 3D head reconstruction etc. This study presents a robust approach for estimating the head pose angles in a single image. More specifically, the proposed system first encodes the global features extracted from Histogram of Oriented Gradients in a multi stacked autoencoders neural network. Based on the hidden nodes in deep layers, Autoencoder has been proposed for feature reduction while maintaining the key information of data. A scalable gradient boosting machine is then employed to train and classify the embedded features. Experiences have evaluated on the Pointing 04 dataset and show that the proposed approach outperforms the state-of-the-art methods with the low head pose angle errors in pitch and yaw as 6.16 degrees and 7.17 degrees, respectively.
引用
收藏
页码:3687 / 3694
页数:8
相关论文
共 53 条
  • [1] Abdi H., 2003, Encyclopedia for research methods for the social sciences, P792
  • [2] Al Haj M, 2012, PROC CVPR IEEE, P2602, DOI 10.1109/CVPR.2012.6247979
  • [3] A membrane-inspired bat algorithm to recognize faces in unconstrained scenarios
    Alsalibi, Bisan
    Venkat, Ibrahim
    Al-Betar, Mohammed Azmi
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 64 : 242 - 260
  • [4] [Anonymous], P INT WORKSH VIS OBS
  • [5] [Anonymous], 2008, Advances in neural information processing systems
  • [6] Bishop Christopher M., 2006, BISHOP PATTERN RECOG, DOI DOI 10.1117/1.2819119
  • [7] Chen T., 2016, KDD16 P 22 ACM, P785, DOI [DOI 10.1145/2939672.2939785, 10.1145/2939672.2939785]
  • [8] View-based active appearance models
    Cootes, TF
    Wheeler, GV
    Walker, KN
    Taylor, CJ
    [J]. IMAGE AND VISION COMPUTING, 2002, 20 (9-10) : 657 - 664
  • [9] ELM based smile detection using Distance Vector
    Cui, Dongshun
    Huang, Guang-Bin
    Liu, Tianchi
    [J]. PATTERN RECOGNITION, 2018, 79 : 356 - 369
  • [10] Robust Head-Pose Estimation Based on Partially-Latent Mixture of Linear Regressions
    Drouard, Vincent
    Horaud, Radu
    Deleforge, Antoine
    Ba, Sileye
    Evangelidis, Georgios
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (03) : 1428 - 1440