Tracking a face for knowledge-based coding of videophone sequences

被引:10
作者
Zhang, L
机构
关键词
face tracking; face model; global head motion compensation; shape update;
D O I
10.1016/S0923-5965(97)00020-9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Tracking a face is one of the important topics for knowledge-based coding of videophone sequences and also for the representation of 3D objects within MPEG-4 Synthetic/Natural Hybrid Coding (SNHC). Up to now, the face model has been tracked by global head motion compensation. Because the 3D head model shape affects the accuracy of motion estimation, an inaccurate head model shape reduces the accuracy of face tracking. In this paper, a new algorithm for tracking a face combining global head motion compensation and the update of the face model Candide during the sequence is proposed. As a first stage of the proposed algorithm, face tracking only by global head motion compensation is used. After that, the 2D center positions of the eyes and the mouth of a person in the image sequence are estimated using template matching and feature point extraction techniques. Then, the shape of the face model Candide is updated during the sequence using these estimated 2D center positions. This proposed algorithm has been applied to typical videophone sequences with a spatial resolution corresponding to CIF and a frame rate of 10 Hz. For evaluation, error criteria have been introduced which give position errors of the eyes and the mouth averaged over a whole sequence. The experimental results show that the proposed algorithm reduces the average position errors for the eyes and the mouth by 48% and 53%, respectively, compared to face tracking by global head motion compensation only. (C) 1997 Elsevier Science B.V.
引用
收藏
页码:93 / 114
页数:22
相关论文
共 25 条
[1]   ON THE COMPUTATION OF MOTION FROM SEQUENCES OF IMAGES - A REVIEW [J].
AGGARWAL, JK ;
NANDHAKUMAR, N .
PROCEEDINGS OF THE IEEE, 1988, 76 (08) :917-935
[2]   MODEL-BASED IMAGE-CODING - ADVANCED VIDEO CODING TECHNIQUES FOR VERY-LOW BIT-RATE APPLICATIONS [J].
AIZAWA, K ;
HUANG, TS .
PROCEEDINGS OF THE IEEE, 1995, 83 (02) :259-271
[3]   3-D MOTION ESTIMATION AND WIREFRAME ADAPTATION INCLUDING PHOTOMETRIC EFFECTS FOR MODEL-BASED CODING OF FACIAL IMAGE SEQUENCES [J].
BOZDAGI, G ;
TEKALP, AM ;
ONURAL, L .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1994, 4 (03) :246-256
[4]   AN IMPROVEMENT TO MBASIC ALGORITHM FOR 3-D MOTION AND DEPTH ESTIMATION [J].
BOZDAGI, G ;
TEKALP, AM ;
ONURAL, L .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1994, 3 (05) :711-716
[5]  
CHOI CS, 1994, IEEE T CIRC SYST VID, V4, P257, DOI 10.1109/76.305871
[6]   MOTION AND STRUCTURE FROM FEATURE CORRESPONDENCES - A REVIEW [J].
HUANG, TS ;
NETRAVALI, AN .
PROCEEDINGS OF THE IEEE, 1994, 82 (02) :252-268
[7]  
*ISO IEC, 1996, JTC1SC29WG11 ISOIEC
[8]   Automatic adaptation of a face model in a layered coder with an object-based analysis-synthesis layer and a knowledge-based layer [J].
Kampmann, M ;
Ostermann, J .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 1997, 9 (03) :201-220
[9]  
KAPPEI F, 1987, P SOC PHOTO-OPT INS, V860, P126
[10]  
KAPPEI F, PICT COD S PCS 88 TO