Modeling long-range dependencies in speech data for text-independent speaker recognition

被引：0

作者：

Ming, Ji ^{[1
]}

Lin, Jie ^{[2
]}

机构：

[1] Queens Univ Belfast, Inst ECIT, Belfast BT7 1NN, Antrim, North Ireland

[2] Univ Elect Sci & Technol China, Sch Comp Sci, Chengdu, Peoples R China

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

time dependence; segment modeling; speaker modeling; speaker recognition;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In the paper, a new approach for modeling and matching long-range dependencies in free-text speech data is proposed for speaker recognition. The new approach consists of a sentence model to detail up to sentence-level dependencies in the training data, and a search algorithm that is capable of locating the matches of arbitrary-length segments between the training and testing sentences. The search algorithm is optimized to increase the probability for the match of long, continuous segments as opposed to short, separated segments, assuming that long, continuous segments contain more specific information about the speaker. The new approach has been evaluated on the NIST 1998 Speaker Recognition Evaluation database, and has shown improved performance.

引用

页码：4825 / +

页数：2

共 50 条

[41] Codebook design using DCT coder for text-independent speaker recognition
Lung, SY
Proceedings of the Sixth IASTED International Conference on Signal and Image Processing, 2004, : 261 - 263
[42] Efficient genetic algorithm of codebook design for text-independent speaker recognition
Chen, CCT
Chen, CT
Lung, SY
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (11) : 2529 - 2531
[43] Comparison of clustering methods: A case study of text-independent speaker modeling
Kinnunen, Tomi
Sidoroff, Ilja
Tuononen, Marko
Franti, Pasi
PATTERN RECOGNITION LETTERS, 2011, 32 (13) : 1604 - 1617
[44] Multi-Source Domain Adaptation for Text-Independent Forensic Speaker Recognition
Wang, Zhenyu
Hansen, John H. L.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 60 - 75
[45] Common vector approach and its combination with GMM for text-independent speaker recognition
Sadic, Selami
Gulmezoglu, M. Bilginer
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11394 - 11400
[46] TEXT-INDEPENDENT SPEAKER RECOGNITION USING TWO-DIMENSIONAL INFORMATION ENTROPY
Bozilovic, Bosko
Todorovic, Branislav M.
Obradovic, Miroslav
JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2015, 66 (03): : 169 - 173
[47] ORTHOGONAL TRAINING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Zhu, Yingke
Mak, Brian
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6584 - 6588
[48] CHANNEL ADVERSARIAL TRAINING FOR CROSS-CHANNEL TEXT-INDEPENDENT SPEAKER RECOGNITION
Fang, Xin
Zou, Liang
Li, Jin
Sun, Lei
Ling, Zhen-Hua
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6221 - 6225
[49] Utterance-level Feature Extraction in Text-independent Speaker Recognition: A Review
Chen C.
Han J.-Q.
Chen D.-Y.
He Y.-J.
Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (03): : 664 - 688
[50] Feature extracted from wavelet eigenfunction estimation for text-independent speaker recognition
Lung, SY
PATTERN RECOGNITION, 2004, 37 (07) : 1543 - 1544

← 1 2 3 4 5 →