Combined Framework for Real-time Head Pose Estimation using Facial Landmark Detection and Salient Feature Tracking

被引:7
作者
Barros, Jilliam Maria Diaz [1 ,2 ]
Garcia, Frederic [1 ]
Mirbach, Bruno [1 ]
Varanasi, Kiran [2 ]
Stricker, Didier [2 ]
机构
[1] IEE SA, PTU Opt, Contern, Luxembourg
[2] German Res Ctr Artificial Intelligence DFKI, Kaiserslautern, Germany
来源
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP | 2018年
关键词
Head Pose Estimation; Real Time; Fusion; MODEL;
D O I
10.5220/0006628701230133
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel approach to address the head pose estimation (HPE) problem in real world and demanding applications. We propose a new framework that combines the detection of facial landmarks with the tracking of salient features within the head region. That is, rigid facial landmarks are detected from a given face image, while at the same time, salient features are detected within the head region. The 3D coordinates of both set of features result from their intersection on a simple geometric head model (e.g., cylinder or ellipsoid). We then formulate the HPE problem as a perspective-n-point problem that we separately solve by minimizing the reprojection error of each 3D features set and their corresponding facial or salient features in the next face image. The resulting head pose estimations are then combined using Kalman Filter, which allows us to take advantage of the high accuracy when using facial landmarks while enabling us to handle extreme head poses by using salient features. Results are comparable to those from the related literature, with the advantage of being robust under real world situations that might not be covered in the evaluated datasets.
引用
收藏
页码:123 / 133
页数:11
相关论文
共 43 条
[1]   Real-Time Head Orientation from a Monocular Camera Using Deep Neural Network [J].
Ahn, Byungtae ;
Park, Jaesik ;
Kweon, In So .
COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 :82-96
[2]   3D Head Tracking and Pose-Robust 2D Texture Map-Based Face Recognition using a Simple Ellipsoid Model [J].
An, Kwang Ho ;
Chung, Myung Jin .
2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, :307-312
[3]  
[Anonymous], 2010, WORKSH EYE GAZ INT H
[4]  
[Anonymous], 2004, 11 WORLD C INT TRANS
[5]  
[Anonymous], 2015, P 2 WORKSH COMP MOD
[6]  
Baltrusaitis T., 2012, INT C COMP VIS PATT
[7]  
Borghi G., 2017, INT C COMP VIS PATT
[8]  
Bouguet J.-Y., 2001, INTEL CORP, V5, P1, DOI DOI 10.1109/HPDC.2004.1323531
[9]   Multi-spectral and multi-perspective video arrays for driver body tracking and activity analysis [J].
Cheng, Shinko Y. ;
Park, Sangho ;
Trivedi, Mohan M. .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2007, 106 (2-3) :245-257
[10]   Robust head tracking using 3D ellipsoidal head model in particle filter [J].
Choi, Sukwon ;
Kim, Daijin .
PATTERN RECOGNITION, 2008, 41 (09) :2901-2915