Head Pose Estimation and Augmented Reality Tracking: An Integrated System and Evaluation for Monitoring Driver Awareness

被引:194
作者
Murphy-Chutorian, Erik [1 ]
Trivedi, Mohan Manubhai [1 ]
机构
[1] Univ Calif San Diego, Dept Elect & Comp Engn, Comp Vis & Robot Res Lab, La Jolla, CA 92093 USA
关键词
Active safety; graphics programming units; head pose estimation; human-computer interface; intelligent driver assistance; performance metrics and evaluation; real-time machine vision; support vector classifiers; 3-D face models and tracking; COMPUTER-VISION; ATTENTION;
D O I
10.1109/TITS.2010.2044241
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Driver distraction and inattention are prominent causes of automotive collisions. To enable driver-assistance systems to address these problems, we require new sensing approaches to infer a driver's focus of attention. In this paper, we present a new procedure for static head-pose estimation and a new algorithm for visual 3-D tracking. They are integrated into the novel real-time (30 fps) system for measuring the position and orientation of a driver's head. This system consists of three interconnected modules that detect the driver's head, provide initial estimates of the head's pose, and continuously track its position and orientation in six degrees of freedom. The head-detection module consists of an array of Haar-wavelet Adaboost cascades. The initial pose estimation module employs localized gradient orientation (LGO) histograms as input to support vector regressors (SVRs). The tracking module provides a fine estimate of the 3-D motion of the head using a new appearance-based particle filter for 3-D model tracking in an augmented reality environment. We describe our implementation that utilizes OpenGL-optimized graphics hardware to efficiently compute particle samples in real time. To demonstrate the suitability of this system for real driving situations, we provide a comprehensive evaluation with drivers of varying ages, race, and sex spanning daytime and nighttime conditions. To quantitatively measure the accuracy of system, we compare its estimation results to a marker-based cinematic motion-capture system installed in the automotive testbed.
引用
收藏
页码:300 / 311
页数:12
相关论文
共 45 条
  • [21] Parametrized structured from motion for 3D adaptive feedback tracking of faces
    Jebara, TS
    Pentland, A
    [J]. 1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, : 144 - 150
  • [22] Kessenich J., 2006, OPENGL SHADING LANGU
  • [23] The influence of head contour and nose angle on the perception of eye-gaze direction
    Langton, SRH
    Honeyman, H
    Tessler, E
    [J]. PERCEPTION & PSYCHOPHYSICS, 2004, 66 (05): : 752 - 771
  • [24] LI Y, 2000, P IEEE INT C AUT FAC, P300
  • [25] Lienhart R, 2002, IEEE IMAGE PROC, P900
  • [26] Distinctive image features from scale-invariant keypoints
    Lowe, DG
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
  • [27] Lane change intent analysis using robust operators and sparse Bayesian learning
    McCall, Joel C.
    Wipf, David P.
    Trivedi, Mohan M.
    Rao, Bhaskar D.
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2007, 8 (03) : 431 - 440
  • [28] A performance evaluation of local descriptors
    Mikolajczyk, K
    Schmid, C
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (10) : 1615 - 1630
  • [29] Morency LP, 2003, PROC CVPR IEEE, P803
  • [30] Murphy-Chutorian E., 2008, Proc. Computer Vision and Pattern Recognition Workshop, P1