English-Chinese bilingual text-independent speaker verification

被引：0

作者：

Ma, B ^{[1
]}

Meng, H ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Human Comp Commun Lab, Hong Kong, Hong Kong, Peoples R China

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes the development of a text-independent speaker verification (TISV) system for English and Chinese utterances. We have designed and collected a bilingual database that contains spoken responses and commands in short, medium and long durations. The TISV system uses Gaussian mixtures for speaker models. Our experiments indicate that language mismatch between enrolment and verification data leads to significant degradation in verification performance (between 40% to 49%). In order to maximize robustness towards language change in test utterances, speaker models were trained with utterances from both languages. Results indicate that this can effectively close performance degradation gap due to language mismatch as mentioned above.

引用

页码：293 / 296

页数：4

共 6 条

[1]

AUCKENTHALER R, 2001, ICASSP 2001 MAY

[2]

CAMPBELL JP, 1995, INT CONF ACOUST SPEE, P341, DOI 10.1109/ICASSP.1995.479543

[3]

LI Q, 2000, IEEE T SPEECH AU SEP, P1

[4] Speaker verification using normalized log-likelihood score [J].

Liu, CS ;

Wang, HC ;

Lee, CH .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (01) :56-60

[5]

Qing X. K., 2000, P ISCSLP, P263

[6] SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].

REYNOLDS, DA .

SPEECH COMMUNICATION, 1995, 17 (1-2) :91-108

← 1 →