A multiple deformable template approach for visual speech recognition

被引:0
|
作者
Chandramohan, D
Silsbee, PL
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose an improved deformable template algorithm for modeling the shape of a talker's mouth. We use a two step approach which begins by classifying mouth images into broad categories. The classification procedure yields both a set of template parameters (in effect, a unique template) and a set of initial conditions. The second step is to allow the deformable template to converge using standard techniques. The multi-model approach is significantly more flexible than single-model approaches and consistently provides better solutions. We present examples of single and multiple template solutions which support this statement. In a small recognition experiment, recognition of consonants improved from 16% to 33%, based only on visual information, when multiple templates were used.
引用
收藏
页码:50 / 53
页数:4
相关论文
共 50 条
  • [1] Deformable template recognition of multiple occluded objects
    Mardia, KV
    Qian, W
    Shah, D
    deSouza, KMA
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (09) : 1035 - 1042
  • [2] Visual speech recognition for multiple languages in the wild
    Ma, Pingchuan
    Petridis, Stavros
    Pantic, Maja
    NATURE MACHINE INTELLIGENCE, 2022, 4 (11) : 930 - 939
  • [3] Visual speech recognition for multiple languages in the wild
    Pingchuan Ma
    Stavros Petridis
    Maja Pantic
    Nature Machine Intelligence, 2022, 4 : 930 - 939
  • [4] A novel approach of road recognition based on deformable template and genetic algorithm
    Liu, T
    Zheng, NN
    Cheng, H
    Xing, ZB
    2003 IEEE INTELLIGENT TRANSPORTATION SYSTEMS PROCEEDINGS, VOLS. 1 & 2, 2003, : 1251 - 1256
  • [5] A TEMPLATE POLYNOMIAL APPROACH FOR IMAGE-PROCESSING AND VISUAL RECOGNITION
    QIAN, K
    BHATTACHARYA, P
    PATTERN RECOGNITION, 1992, 25 (12) : 1505 - 1515
  • [6] USING MULTIPLE VISUAL TANDEM STREAMS IN AUDIO-VISUAL SPEECH RECOGNITION
    Topkaya, Ibrahim Saygin
    Erdogan, Hakan
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4988 - 4991
  • [7] On deformable models for visual pattern recognition
    Cheung, KW
    Yeung, DY
    Chin, RT
    PATTERN RECOGNITION, 2002, 35 (07) : 1507 - 1526
  • [8] Visual Servo with Template Update for a Deformable Moving Object
    Cheng, Chi-Cheng
    Chou, Cheng-Te
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 5302 - 5307
  • [9] Recognition of human shapes by deformable template and neural network
    Tate, S
    Oka, S
    Takefuji, Y
    ARTIFICIAL INTELLIGENCE IN REAL-TIME CONTROL 1998, 1999, : 137 - 141
  • [10] Object detection and recognition based on multiscale deformable template
    Yu, Li
    Wang, Run-Sheng
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2002, 39 (10):