A multiple deformable template approach for visual speech recognition

被引：0

作者：

Chandramohan, D

Silsbee, PL

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we propose an improved deformable template algorithm for modeling the shape of a talker's mouth. We use a two step approach which begins by classifying mouth images into broad categories. The classification procedure yields both a set of template parameters (in effect, a unique template) and a set of initial conditions. The second step is to allow the deformable template to converge using standard techniques. The multi-model approach is significantly more flexible than single-model approaches and consistently provides better solutions. We present examples of single and multiple template solutions which support this statement. In a small recognition experiment, recognition of consonants improved from 16% to 33%, based only on visual information, when multiple templates were used.

引用

页码：50 / 53

页数：4

共 50 条

[1] Deformable template recognition of multiple occluded objects
Mardia, KV
Qian, W
Shah, D
deSouza, KMA
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (09) : 1035 - 1042
[2] Visual speech recognition for multiple languages in the wild
Ma, Pingchuan
Petridis, Stavros
Pantic, Maja
NATURE MACHINE INTELLIGENCE, 2022, 4 (11) : 930 - 939
[3] Visual speech recognition for multiple languages in the wild
Pingchuan Ma
Stavros Petridis
Maja Pantic
Nature Machine Intelligence, 2022, 4 : 930 - 939
[4] A novel approach of road recognition based on deformable template and genetic algorithm
Liu, T
Zheng, NN
Cheng, H
Xing, ZB
2003 IEEE INTELLIGENT TRANSPORTATION SYSTEMS PROCEEDINGS, VOLS. 1 & 2, 2003, : 1251 - 1256
[5] A TEMPLATE POLYNOMIAL APPROACH FOR IMAGE-PROCESSING AND VISUAL RECOGNITION
QIAN, K
BHATTACHARYA, P
PATTERN RECOGNITION, 1992, 25 (12) : 1505 - 1515
[6] USING MULTIPLE VISUAL TANDEM STREAMS IN AUDIO-VISUAL SPEECH RECOGNITION
Topkaya, Ibrahim Saygin
Erdogan, Hakan
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4988 - 4991
[7] On deformable models for visual pattern recognition
Cheung, KW
Yeung, DY
Chin, RT
PATTERN RECOGNITION, 2002, 35 (07) : 1507 - 1526
[8] Visual Servo with Template Update for a Deformable Moving Object
Cheng, Chi-Cheng
Chou, Cheng-Te
26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 5302 - 5307
[9] Recognition of human shapes by deformable template and neural network
Tate, S
Oka, S
Takefuji, Y
ARTIFICIAL INTELLIGENCE IN REAL-TIME CONTROL 1998, 1999, : 137 - 141
[10] Object detection and recognition based on multiscale deformable template
Yu, Li
Wang, Run-Sheng
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2002, 39 (10):

← 1 2 3 4 5 →