Modeling long-range dependencies in speech data for text-independent speaker recognition

被引:0
|
作者
Ming, Ji [1 ]
Lin, Jie [2 ]
机构
[1] Queens Univ Belfast, Inst ECIT, Belfast BT7 1NN, Antrim, North Ireland
[2] Univ Elect Sci & Technol China, Sch Comp Sci, Chengdu, Peoples R China
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
time dependence; segment modeling; speaker modeling; speaker recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the paper, a new approach for modeling and matching long-range dependencies in free-text speech data is proposed for speaker recognition. The new approach consists of a sentence model to detail up to sentence-level dependencies in the training data, and a search algorithm that is capable of locating the matches of arbitrary-length segments between the training and testing sentences. The search algorithm is optimized to increase the probability for the match of long, continuous segments as opposed to short, separated segments, assuming that long, continuous segments contain more specific information about the speaker. The new approach has been evaluated on the NIST 1998 Speaker Recognition Evaluation database, and has shown improved performance.
引用
收藏
页码:4825 / +
页数:2
相关论文
共 50 条
  • [41] Codebook design using DCT coder for text-independent speaker recognition
    Lung, SY
    Proceedings of the Sixth IASTED International Conference on Signal and Image Processing, 2004, : 261 - 263
  • [42] Efficient genetic algorithm of codebook design for text-independent speaker recognition
    Chen, CCT
    Chen, CT
    Lung, SY
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (11) : 2529 - 2531
  • [43] Comparison of clustering methods: A case study of text-independent speaker modeling
    Kinnunen, Tomi
    Sidoroff, Ilja
    Tuononen, Marko
    Franti, Pasi
    PATTERN RECOGNITION LETTERS, 2011, 32 (13) : 1604 - 1617
  • [44] Multi-Source Domain Adaptation for Text-Independent Forensic Speaker Recognition
    Wang, Zhenyu
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 60 - 75
  • [45] Common vector approach and its combination with GMM for text-independent speaker recognition
    Sadic, Selami
    Gulmezoglu, M. Bilginer
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11394 - 11400
  • [46] TEXT-INDEPENDENT SPEAKER RECOGNITION USING TWO-DIMENSIONAL INFORMATION ENTROPY
    Bozilovic, Bosko
    Todorovic, Branislav M.
    Obradovic, Miroslav
    JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2015, 66 (03): : 169 - 173
  • [47] ORTHOGONAL TRAINING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Zhu, Yingke
    Mak, Brian
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6584 - 6588
  • [48] CHANNEL ADVERSARIAL TRAINING FOR CROSS-CHANNEL TEXT-INDEPENDENT SPEAKER RECOGNITION
    Fang, Xin
    Zou, Liang
    Li, Jin
    Sun, Lei
    Ling, Zhen-Hua
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6221 - 6225
  • [49] Utterance-level Feature Extraction in Text-independent Speaker Recognition: A Review
    Chen C.
    Han J.-Q.
    Chen D.-Y.
    He Y.-J.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (03): : 664 - 688
  • [50] Feature extracted from wavelet eigenfunction estimation for text-independent speaker recognition
    Lung, SY
    PATTERN RECOGNITION, 2004, 37 (07) : 1543 - 1544