Automatic Lip Reading in the Dutch Language Using Active Appearance Models on High Speed Recordings

被引:0
作者
Chitu, Alin Gavril [1 ]
Driel, Karin [1 ]
Rothkrantz, Leon J. M. [1 ]
机构
[1] Delft Univ Technol, Man Machine Interact Grp, Dept Mediamat, NL-2628 CD Delft, Netherlands
来源
TEXT, SPEECH AND DIALOGUE | 2010年 / 6231卷
关键词
lip reading; active appearance models; high speed recordings; data corpus; NDUTAVSC; Dutch;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents our work on lip reading in the Dutch language. The results are based on a new data corpus recorded at 100Hz in our group. The NDUTAVSC corpus is to date the largest corpus build for lip reading in Dutch. For parameterising the input data we use Active Appearance Models. Based on the results of AAM we define a set of high level geometric features which are used for training recognizer systems for different recognition tasks, such as fixed length digits strings, random length letters strings, random word sequences, fixed topic continuous speech and random continuous speech. We show that our approach gives great improvements compared to previous results. We also investigate the influence of the high speed recordings on the performance of the recognition. We show that in the case of high speech rate the use of higher speed recordings is compulsory.
引用
收藏
页码:259 / 266
页数:8
相关论文
共 12 条
  • [1] Bregler C., 1994, IEEE INT C AC SPEECH
  • [2] Chitu A.G., 2007, P INT C SPEECH COMPU, P678
  • [3] Chitu AG, 2007, EUROMEDIA '2007, P88
  • [4] COMPARISON BETWEEN DIFFERENT FEATURE EXTRACTION TECHNIQUES FOR AUDIO-VISUAL SPEECH RECOGNITION
    Chitu, Alin G.
    Rothkrantz, Leon J. M.
    Wiggers, Pascal
    Wojdel, Jacek C.
    [J]. JOURNAL ON MULTIMODAL USER INTERFACES, 2007, 1 (01) : 7 - 20
  • [5] Cootes TF, 2001, PROC SPIE, V4322, P236, DOI 10.1117/12.431093
  • [6] DUCHNOWSKI P, 1995, INT CONF ACOUST SPEE, P109, DOI 10.1109/ICASSP.1995.479285
  • [7] Interpreting face images using Active Appearance Models
    Edwards, GJ
    Taylor, CJ
    Cootes, TF
    [J]. AUTOMATIC FACE AND GESTURE RECOGNITION - THIRD IEEE INTERNATIONAL CONFERENCE PROCEEDINGS, 1998, : 300 - 305
  • [8] ESSA IA, 1994, 1994 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, P76, DOI 10.1109/CVPR.1994.323813
  • [9] Design and use of linear models for image motion analysis
    Fleet, DJ
    Black, MJ
    Yacoob, Y
    Jepson, AD
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2000, 36 (03) : 171 - 193
  • [10] Potamianos G, 1998, 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 3, P173, DOI 10.1109/ICIP.1998.999008