Robust full-motion recovery of head by dynamic templates and re-registration techniques

被引：102

作者：

Xiao, J ^{[1
]}

Moriyama, T

Kanade, T

Cohn, JF

机构：

[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA

[2] Univ Pittsburgh, Pittsburgh, PA 15260 USA

来源：

INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY | 2003年 / 13卷 / 01期

关键词：

3D full motion recovery; dynamic templates; compensated IRLS; facial expression analysis;

D O I：

10.1002/ima.10048

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This article presents a method to recover the full-motion (3 rotations and 3 translations) of the head from an input video using a cylindrical head model. Given an initial reference template of the head image and the corresponding head pose, the head model is created and full head motion is recovered automatically. The robustness of the approach is achieved by a combination of three techniques. First, we use the iteratively reweighted least squares (IRLS) technique in conjunction with the image gradient to accommodate nonrigid motion and occlusion. Second, while tracking, the templates are dynamically updated to diminish the effects of self-occlusion and gradual lighting changes and to maintain accurate tracking even when the face moves out of view of the camera. Third, to minimize error accumulation inherent in the use of dynamic templates, we re-register images to a reference template whenever head pose is close to that in the template. The performance of the method, which runs in real time, was evaluated in three separate experiments using image sequences (both synthetic and real) for which ground truth head motion was known. The real sequences included pitch and yaw as large as 40degrees and 75degrees, respectively. The average recovery accuracy of the 3D rotations was about 3degrees. In a further test, the method was used as part of a facial expression analysis system intended for use with spontaneous facial behavior in which moderate head motion is common. Image data consisted of 1-min of video from each of 10 subjects while engaged in a two-person interview. The method successfully stabilized face and eye images allowing for 98% accuracy in automatic blink recognition. (C) 2003 Wiley Periodicals, Inc.

引用

页码：85 / 94

页数：10

共 25 条

[1] Measuring facial expressions by computer image analysis [J].

Bartlett, MS ;

Hager, JC ;

Ekman, P ;

Sejnowski, TJ .

PSYCHOPHYSIOLOGY, 1999, 36 (02) :253-263

[2]

BASU S, 1996, ICPR96

[3]

Black M, 1992, THESIS YALE U

[4] Recognizing facial expressions in image sequences using local parameterized models of image motion [J].

Black, MJ ;

Yacoob, Y .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 1997, 25 (01) :23-48

[5] Tracking people with twists and exponential maps [J].

Bregler, C ;

Malik, J .

1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :8-15

[6] Infant ''surprise'' expressions as coordinative motor structures [J].

Camras, LA ;

Lambrecht, L ;

Michel, GF .

JOURNAL OF NONVERBAL BEHAVIOR, 1996, 20 (03) :183-195

[7] The integration of optical flow and deformable models with applications to human face shape and motion estimation [J].

DeCarlo, D ;

Metaxas, D .

1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, :231-238

[8] Classifying facial actions [J].

Donato, G ;

Bartlett, MS ;

Hager, JC ;

Ekman, P ;

Sejnowski, TJ .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (10) :974-989

[9]

Ekman P., 1978, Facial action coding system: manual, DOI 10.1037/t27734-000

[10] Coding, analysis, interpretation, and recognition of facial expressions [J].

Essa, IA ;

Pentland, AP .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :757-763

← 1 2 3 →