Discriminative training for speaker identification based on maximum model distance algorithm

被引:0
|
作者
Hong, QY [1 ]
Kwong, S [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
来源
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING | 2004年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we apply the Maximum model distance (MMD) training [4] to speaker identification and a new selection strategy of competitive speakers is proposed to it. The traditional ML method only utilizes the utterances for each speaker model, which probably leads to a local optimization solution. By maximizing the dissimilarities among those similar speaker models, MMD could add the discriminative capability into the training procedure and then improve the identification performance. Based on the TIMIT corpus, we designed the word and sentence experiments to evaluate this proposed training approach. The results show that the identification performance can be improved greatly when the training data is limited.
引用
收藏
页码:25 / 28
页数:4
相关论文
共 50 条
  • [1] Maximum Model Distance Discriminative Training for Text-Independent Speaker Verification
    Hong, Q. Y.
    Kwong, S.
    IECON 2004: 30TH ANNUAL CONFERENCE OF IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOL 2, 2004, : 1769 - 1774
  • [2] A discriminative training algorithm for VQ-based speaker identification
    He, JL
    Liu, L
    Palm, G
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03): : 353 - 356
  • [3] Discriminative training for speaker identification
    Hong, QY
    Kwong, S
    ELECTRONICS LETTERS, 2004, 40 (04) : 280 - 281
  • [4] Discriminative training of GMM for speaker identification
    delAlamo, CM
    Gil, FJC
    Munilla, CDL
    Gomez, LH
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 89 - 92
  • [5] Improved Speaker Identification Algorithm based on Discriminative Weighted Method
    Li Shaomei
    Guo Yunfei
    Liu Lixiong
    MINES 2009: FIRST INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 642 - 644
  • [7] Chinese dialect identification based on genetic algorithm for discriminative training of bigram model
    Tsai, WH
    Chang, WW
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (12) : 2183 - 2185
  • [8] Incremental speaker adaptation with minimum error discriminative training for speaker identification
    delAlamo, CM
    Alvarez, J
    delaTorre, C
    Poyatos, FJ
    Hernandez, L
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1760 - 1763
  • [9] Discriminative training of GMM based on Maximum Mutual Information for language identification
    Qu Dan
    Wang Bingxi
    Yan Honggang
    Dai Guannan
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 1576 - +
  • [10] Speaker Identification based on Discriminative Vector Quantization
    Zhou, GY
    Mikhael, WB
    Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 617 - 620