Improving head pose estimation using two-stage ensembles with top-k regression

被引:37
作者
Huang, Bin [1 ]
Chen, Renwen [1 ]
Xu, Wang [1 ]
Zhou, Qinbang [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, State Key Lab Mech & Control Mech Struct, Nanjing 210016, Peoples R China
基金
美国国家科学基金会;
关键词
3D head pose estimation; Average top-k regression; Task-dependent weights; Two-stage ensembles;
D O I
10.1016/j.imavis.2019.11.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional head pose estimation methods are regarded as a classification or regression paradigm, individually. The accuracy of classification-based approaches is limited to pose quantized interval and regression-based methods are fragile due to extremely large pose in non-ideal conditions. On the contrary to these methods, this paper introduces a novel head pose estimation method using two-stage ensembles with average top-k regression. The first stage is a binned classification subtask with the optimal pose partition. The second stage achieves average top-k regression based on the former prediction. Then we combine the two subtasks by considering the task-dependent weights instead of setting coefficients by grid search. We conduct several experiments to analyze the optimal pose partition for classification part and to validate the average top-k loss for regression part. Furthermore, we report the performance of proposed method on MW, AFLW2000 and BIWI datasets and results show rather competitive performance in head pose prediction. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:8
相关论文
共 37 条
  • [1] Real-time head pose estimation using multi-task deep neural network
    Ahn, Byungtae
    Choi, Dong-Geol
    Park, Jaesik
    Kweon, In So
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 103 : 1 - 12
  • [2] Real-Time Head Orientation from a Monocular Camera Using Deep Neural Network
    Ahn, Byungtae
    Park, Jaesik
    Kweon, In So
    [J]. COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 82 - 96
  • [3] [Anonymous], ARXIV150703148
  • [4] [Anonymous], P 1 IEEE INT WORKSH
  • [5] BenAbdelkader C, 2010, LECT NOTES COMPUT SC, V6316, P518, DOI 10.1007/978-3-642-15567-3_38
  • [6] Brown LM, 2002, IEEE WORKSHOP ON MOTION AND VIDEO COMPUTING (MOTION 2002), PROCEEDINGS, P125, DOI 10.1109/MOTION.2002.1182224
  • [7] How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks)
    Bulat, Adrian
    Tzimiropoulos, Georgios
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1021 - 1030
  • [8] Fan Y., 2017, ARXIV170508826
  • [9] Random Forests for Real Time 3D Face Analysis
    Fanelli, Gabriele
    Dantone, Matthias
    Gall, Juergen
    Fossati, Andrea
    Van Gool, Luc
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (03) : 437 - 458
  • [10] A Two-Layer Framework for Piecewise Linear Manifold-Based Head Pose Estimation
    Foytik, Jacob
    Asari, Vijayan K.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (02) : 270 - 287