Accurate and quasi-automatic lip tracking

被引:78
作者
Eveno, N [1 ]
Caplier, A [1 ]
Coulon, PY [1 ]
机构
[1] Lab Images & Signaux, F-38031 Grenoble, France
关键词
active contour; deformable model; points tracking; segmentation;
D O I
10.1109/TCSVT.2004.826754
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Lip segmentation is an essential stage in many multimedia systems such as videoconferencing, lip reading, or low-bit-rate coding communication systems. In this paper, we propose an accurate and robust quasi-automatic lip segmentation algorithm. First, the upper mouth boundary and several characteristic points are detected in the first frame by using a new kind of active contour: the "jumping snake." Unlike classic snakes, it can be initialized far from the final edge and the adjustment of its parameters is easy and intuitive. Then, to achieve the segmentation, we propose a parametric model composed of several cubic curves. Its high flexibility enables accurate lip contour extraction even in the challenging case of a very asymmetric mouth. Com-pared to existing models, it brings a significant improvement in accuracy and realism. The segmentation in the following frames is achieved by using an interframe tracking of the keypoints and the model parameters. However, we show that, with a usual tracking algorithm, the keypoints' positions become unreliable after a few frames. We therefore propose an adjustment process that enables an accurate tracking even after hundreds of frames. Finally, we show that the mean keypoints' tracking errors of our algorithm are comparable to manual points' selection errors.
引用
收藏
页码:706 / 715
页数:10
相关论文
共 21 条
[1]   Audio-visual speech recognition using MPEGA compliant visual features [J].
Aleksic, PS ;
Williams, JJ ;
Wu, ZL ;
Katsaggelos, AK .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (11) :1213-1227
[2]  
[Anonymous], 2000, P AS C COMP VIS
[3]  
[Anonymous], SCIENCES
[4]  
Berger M.-O., 1990, Proceedings. 10th International Conference on Pattern Recognition (Cat. No.90CH2898-5), P847, DOI 10.1109/ICPR.1990.118228
[5]  
Coianiz T., 1995, NATO ADV STUDY I SPE, P391
[6]   Automatic Snakes for robust lip boundaries extraction [J].
Delmas, P ;
Coulon, PY ;
Fristot, V .
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, :3069-3072
[7]  
Delmas P, 2002, INT C PATT RECOG, P528, DOI 10.1109/ICPR.2002.1048356
[8]  
Eveno N., 2002, Proceedings 2002 IEEE International Conference on Multimedia and Expo (Cat. No.02TH8604), P125, DOI 10.1109/ICME.2002.1035528
[9]  
Eveno N, 2003, 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 2, PROCEEDINGS, P867
[10]   A new color transformation for lips segmentation [J].
Eveno, N ;
Caplier, A ;
Coulon, PY .
2001 IEEE FOURTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2001, :3-8