Discriminative training for speaker identification based on maximum model distance algorithm

被引：0

作者：

Hong, QY ^{[1
]}

Kwong, S ^{[1
]}

机构：

[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING | 2004年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper we apply the Maximum model distance (MMD) training [4] to speaker identification and a new selection strategy of competitive speakers is proposed to it. The traditional ML method only utilizes the utterances for each speaker model, which probably leads to a local optimization solution. By maximizing the dissimilarities among those similar speaker models, MMD could add the discriminative capability into the training procedure and then improve the identification performance. Based on the TIMIT corpus, we designed the word and sentence experiments to evaluate this proposed training approach. The results show that the identification performance can be improved greatly when the training data is limited.

引用

页码：25 / 28

页数：4

共 50 条

[1] Maximum Model Distance Discriminative Training for Text-Independent Speaker Verification
Hong, Q. Y.
Kwong, S.
IECON 2004: 30TH ANNUAL CONFERENCE OF IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOL 2, 2004, : 1769 - 1774
[2] A discriminative training algorithm for VQ-based speaker identification
He, JL
Liu, L
Palm, G
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03): : 353 - 356
[3] Discriminative training for speaker identification
Hong, QY
Kwong, S
ELECTRONICS LETTERS, 2004, 40 (04) : 280 - 281
[4] Discriminative training of GMM for speaker identification
delAlamo, CM
Gil, FJC
Munilla, CDL
Gomez, LH
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 89 - 92
[5] Improved Speaker Identification Algorithm based on Discriminative Weighted Method
Li Shaomei
Guo Yunfei
Liu Lixiong
MINES 2009: FIRST INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 642 - 644
[6] Chinese dialect identification based on genetic algorithm for discriminative training of bigram model
Tsai, Wuei-He, 2000, IEICE of Japan, Tokyo, Japan (E83-D)
[7] Chinese dialect identification based on genetic algorithm for discriminative training of bigram model
Tsai, WH
Chang, WW
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (12) : 2183 - 2185
[8] Incremental speaker adaptation with minimum error discriminative training for speaker identification
delAlamo, CM
Alvarez, J
delaTorre, C
Poyatos, FJ
Hernandez, L
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1760 - 1763
[9] Discriminative training of GMM based on Maximum Mutual Information for language identification
Qu Dan
Wang Bingxi
Yan Honggang
Dai Guannan
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 1576 - +
[10] Speaker Identification based on Discriminative Vector Quantization
Zhou, GY
Mikhael, WB
Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 617 - 620

← 1 2 3 4 5 →