Head Pose Estimation Based on Multivariate Label Distribution

被引:142
作者
Geng, Xin [1 ]
Xia, Yu [1 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
关键词
AGE;
D O I
10.1109/CVPR.2014.237
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate ground truth pose is essential to the training of most existing head pose estimation algorithms. However, in many cases, the "ground truth" pose is obtained in rather subjective ways, such as asking the human subjects to stare at different markers on the wall. In such case, it is better to use soft labels rather than explicit hard labels. Therefore, this paper proposes to associate a multivariate label distribution (MLD) to each image. An MLD covers a neighborhood around the original pose. Labeling the images with MLD can not only alleviate the problem of inaccurate pose labels, but also boost the training examples associated to each pose without actually increasing the total amount of training examples. Two algorithms are proposed to learn from the MLD by minimizing the weighted Jeffrey's divergence between the predicted MLD and the ground truth MLD. Experimental results show that the MLD-based methods perform significantly better than the compared state-of-the-art head pose estimation algorithms.
引用
收藏
页码:1837 / 1842
页数:6
相关论文
共 20 条
[1]  
Al Haj M, 2012, PROC CVPR IEEE, P2602, DOI 10.1109/CVPR.2012.6247979
[2]  
[Anonymous], 2004, P ICPR INT WORKSH VI
[3]  
Cha S.-H., 2007, City, V1, P300
[4]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[5]   Random Forests for Real Time 3D Face Analysis [J].
Fanelli, Gabriele ;
Dantone, Matthias ;
Gall, Juergen ;
Fossati, Andrea ;
Van Gool, Luc .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (03) :437-458
[6]   A Two-Layer Framework for Piecewise Linear Manifold-Based Head Pose Estimation [J].
Foytik, Jacob ;
Asari, Vijayan K. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (02) :270-287
[7]   Facial Age Estimation by Learning from Label Distributions [J].
Geng, Xin ;
Yin, Chao ;
Zhou, Zhi-Hua .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (10) :2401-2412
[8]  
Geng X, 2010, AAAI CONF ARTIF INTE, P451
[9]  
Gourier N., 2006, Multimodal Technologies for Perception of Humans. First International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006. Revised Selected Papers (Lecture Notes in Computer Science Vol.4122), P270
[10]  
Guo G., 2008, International Conference on Pattern Recognition, P1