Acoustic-labial speaker verification

被引:37
作者
Jourlin, P
Luettin, J
Genoud, D
Wassner, H
机构
[1] IDIAP, CH-1920 Martigny, Switzerland
[2] LIA, F-84911 Avignon 9, France
关键词
person authentication; bimodal speech; decision fusion; lip feature extraction; hidden Markov models;
D O I
10.1016/S0167-8655(97)00070-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a multimodal approach for speaker verification. The system consists of two classifiers, one using visual features, the other using acoustic features. A lip tracker is used to extract visual information from the speaking face which provides shape and intensity features. We describe an approach for normalizing and mapping different modalities onto a common confidence interval. We also describe a novel method for integrating the scores of multiple classifiers. Verification experiments are reported for the individual modalities and for the combined classifier. The integrated system outperformed each sub-system and reduced the false acceptance rate of the acoustic sub-system from 2.3% to 0.5%. (C) 1997 Elsevier Science B.V.
引用
收藏
页码:853 / 858
页数:6
相关论文
共 14 条
[1]  
Acheroy M., 1996, Proceedings of the European Conference on Multimedia Applications, Services and Techniques, P747
[2]   PERSON IDENTIFICATION USING MULTIPLE CUES [J].
BRUNELLI, R ;
FALAVIGNA, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (10) :955-966
[3]   HUMAN AND MACHINE RECOGNITION OF FACES - A SURVEY [J].
CHELLAPPA, R ;
WILSON, CL ;
SIROHEY, S .
PROCEEDINGS OF THE IEEE, 1995, 83 (05) :705-740
[4]  
CHOLLET G, 1995, SWISS FRENCH POLYPHO
[5]   USE OF ACTIVE SHAPE MODELS FOR LOCATING STRUCTURE IN MEDICAL IMAGES [J].
COOTES, TF ;
HILL, A ;
TAYLOR, CJ ;
HASLAM, J .
IMAGE AND VISION COMPUTING, 1994, 12 (06) :355-365
[6]  
FURUI S, 1994, P ESCA WORKSH AUT SP, P1
[7]  
Genoud D, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P1756, DOI 10.1109/ICSLP.1996.607968
[8]  
JOURLIN P, 1996, P EUR SIGN PROC C TR, P133
[9]  
JOURLIN P, 1995, P INT WORKSH AUT FAC, P320
[10]  
Luettin J, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P58, DOI 10.1109/ICSLP.1996.607024