Robust classification of face and head gestures in video

被引:28
作者
Akakin, Hatice Cinar [1 ]
Sankur, Bulent [1 ]
机构
[1] Bogazici Univ, Istanbul, Turkey
关键词
Face and head gesture classification; Facial landmark tracking; Time series analysis; Fusion of classifiers; FACIAL ACTION; EXPRESSION RECOGNITION; TRACKING; POSE;
D O I
10.1016/j.imavis.2011.03.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic analysis of head gestures and facial expressions is a challenging research area and it has significant applications in human-computer interfaces. We develop a face and head gesture detector in video streams. The detector is based on face landmark paradigm in that appearance and configuration information of landmarks are used. First we detect and track accurately facial landmarks using adaptive templates, Kalman predictor and subspace regularization. Then the trajectories (time series) of facial landmark positions during the course of the head gesture or facial expression are converted in various discriminative features. Features can be landmark coordinate time series, facial geometric features or patches on expressive regions of the face. We use comparatively, two feature sequence classifiers, that is, Hidden Markov Models (HMM) and Hidden Conditional Random Fields (HCRF), and various feature subspace classifiers, that is, ICA (Independent Component Analysis) and NMF (Non-negative Matrix Factorization) on the spatiotemporal data. We achieve 87.3% correct gesture classification on a seven-gesture test database, and the performance reaches 98.2% correct detection under a fusion scheme. Promising and competitive results are also achieved on classification of naturally occurring gesture clips of LIM-TwoTalk Corpus. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:470 / 483
页数:14
相关论文
共 56 条
  • [1] AKAKIN HC, 2007, 3DTV C KOS ISL GREEC, P1
  • [2] Salah AA, 2007, ANN TELECOMMUN, V62, P83
  • [3] [Anonymous], P SPIE
  • [4] [Anonymous], 1 COST 2101 WORKSH B
  • [5] A multi-class classification strategy for Fisher scores: Application to signer independent sign language recognition
    Aran, Oya
    Akarun, Lale
    [J]. PATTERN RECOGNITION, 2010, 43 (05) : 1776 - 1788
  • [6] ARI I, 2008, FACIAL FEATURE TRACK, P1
  • [7] Effects of damping head movement and facial expression in dyadic conversation using real-time facial expression tracking and synthesized avatars
    Boker, Steven M.
    Cohn, Jeffrey F.
    Theobald, Barry-John
    Matthews, Iain
    Brick, Timothy R.
    Spies, Jeffrey R.
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2009, 364 (1535) : 3485 - 3495
  • [8] BOWDEN R, 2010, LILIR TWOTALK CORPUS
  • [9] ACTIVE SHAPE MODELS - THEIR TRAINING AND APPLICATION
    COOTES, TF
    TAYLOR, CJ
    COOPER, DH
    GRAHAM, J
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 1995, 61 (01) : 38 - 59
  • [10] Active appearance models
    Cootes, TF
    Edwards, GJ
    Taylor, CJ
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (06) : 681 - 685