A Vector-based Representation to Enhance Head Pose Estimation

被引:78
作者
Cao, Zhiwen [1 ]
Chu, Zongcheng [1 ]
Liu, Dongfang [1 ]
Chen, Yingjie [1 ]
机构
[1] Purdue Univ, Dept Comp Graph Technol, W Lafayette, IN 47907 USA
来源
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021) | 2021年
关键词
D O I
10.1109/WACV48630.2021.00123
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes to use the three vectors in a rotation matrix as the representation in head pose estimation and develops a new neural network based on the characteristic of such representation. We address two potential issues existed in current head pose estimation works: 1. Public datasets for head pose estimation use either Euler angles or quaternions to annotate data samples. However, both of these annotations have the issue of discontinuity and thus could result in some performance issues in neural network training. 2. Most research works report Mean Absolute Error (MAE) of Euler angles as the measurement of performance. We show that MAE may not reflect the actual behavior especially for the cases of profile views. To solve these two problems, we propose a new annotation method which uses three vectors to describe head poses and a new measurement Mean Absolute Error of Vectors (MAEV) to assess the performance. We also train a new neural network to predict the three vectors with the constraints of orthogonality. Our proposed method achieves state-of-the-art results on both AFLW2000 and BIWI datasets. Experiments show our vector-based annotation method can effectively reduce prediction errors for large pose angles.
引用
收藏
页码:1187 / 1196
页数:10
相关论文
共 39 条
[1]   Localizing Parts of Faces Using a Consensus of Exemplars [J].
Belhumeur, Peter N. ;
Jacobs, David W. ;
Kriegman, David J. ;
Kumar, Neeraj .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2930-2940
[2]   Face Alignment by Explicit Shape Regression [J].
Cao, Xudong ;
Wei, Yichen ;
Wen, Fang ;
Sun, Jian .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 107 (02) :177-190
[3]   Cascaded Pose Regression [J].
Dollar, Piotr ;
Welinder, Peter ;
Perona, Pietro .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1078-1085
[4]   Random Forests for Real Time 3D Face Analysis [J].
Fanelli, Gabriele ;
Dantone, Matthias ;
Gall, Juergen ;
Fossati, Andrea ;
Van Gool, Luc .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (03) :437-458
[5]  
Fanelli G, 2011, PROC CVPR IEEE, P617, DOI 10.1109/CVPR.2011.5995458
[6]  
Gao G., 2018, P EUROPEAN C COMPUTE
[7]  
Gourier N, 2007, LECT NOTES COMPUT SC, V4122, P270
[8]   Towards Fast, Accurate and Stable 3D Dense Face Alignment [J].
Guo, Jianzhu ;
Zhu, Xiangyu ;
Yang, Yang ;
Yang, Fan ;
Lei, Zhen ;
Li, Stan Z. .
COMPUTER VISION - ECCV 2020, PT XIX, 2020, 12364 :152-168
[9]   QuatNet: Quaternion-Based Head Pose Estimation With Multiregression Loss [J].
Hsu, Heng-Wei ;
Wu, Tung-Yu ;
Wan, Sheng ;
Wong, Wing Hung ;
Lee, Chen-Yi .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (04) :1035-1046
[10]   Improving head pose estimation using two-stage ensembles with top-k regression [J].
Huang, Bin ;
Chen, Renwen ;
Xu, Wang ;
Zhou, Qinbang .
IMAGE AND VISION COMPUTING, 2020, 93