Lip Tracking Using Particle Filter and Geometric Model for Visual Speech Recognition

被引:0
作者
Jarraya, Islem [1 ]
Werda, Salah [2 ]
Mahdi, Walid [1 ]
机构
[1] Univ Sfax, Multimedia InfoRmat Syst & Adv Comp Lab MIRACL, ISIMS, Route Tunis Km 10, Sfax, Tunisia
[2] Univ Sfax, Multimedia InfoRmat Syst & Adv Comp Lab MIRACL, ISGI, Route Tunis Km 10, Sfax, Tunisia
来源
2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS (SIGMAP) | 2014年
关键词
Lip Localization; Geometric Lip Model; Lip Tracking; Lip Descriptors Extraction; Viseme Classification and Recognition;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic lip-reading is a technology which helps understanding messages exchanged in the case of a noisy environment or of elderly hearing impairment. To carry out this system, we need to implement three subsystems. There is a locating and tracking lips system, labial descriptors extraction system and a classification and speech recognition system. In this work, we present a spatio-temporal approach to track and characterize lip movements for the automatic recognition of visemes of the French language. First, we segment lips using the color information and a geometric model of lips. Then, we apply a particle filter to track lip movements. Finally, we propose to extract and classify the visual informations to recognize the pronounced viseme. This approach is applied with multiple speakers in natural conditions.
引用
收藏
页码:172 / 179
页数:8
相关论文
共 13 条
[1]  
Beaumesnil B., 2006, ICPR INT C PATT REC
[2]  
Bouvier C., 2010, SEGMENTATION REGION
[3]  
Kalbkhani H., 2012, J WORLDS ELECT ENG T
[4]  
Liu X., 2011, ICASSP IEEE INT C AC
[5]  
Mahdi W., 2008, ICAE 03 INTEGRATED C
[6]  
Majumdar J., 2013, INT J EMERGING TECHN
[7]  
Meng J., 2014, J COMPUTATIONAL INFO
[8]  
Nicolas E., 2003, SEGMENTATION LEVRES
[9]  
Segura C., 2014, SENSORS TRANSDUCERS
[10]  
Shirinzadeh F., 2012, INT J COMPUTER THEOR