Automatic text-independent speaker verification using convolutional deep belief network

被引：4

作者：

Rakhmanenko, I. A. ^{[1
]}

Shelupanov, A. A. ^{[1
]}

Kostyuchenko, E. Y. ^{[1
]}

机构：

[1] Tomsk State Univ Control Syst & Radioelect, Prospect Lenina 40, Tomsk 634050, Russia

来源：

COMPUTER OPTICS | 2020年 / 44卷 / 04期

关键词：

speaker recognition; speaker verification; Gaussian mixture models; GMM-UBM system; speech features; speech processing; deep learning; neural networks; pattern recognition; RECOGNITION;

D O I：

10.18287/2412-6179-CO-621

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

This paper is devoted to the use of the convolutional deep belief network as a speech feature extractor for automatic text-independent speaker verification. The paper describes the scope and problems of automatic speaker verification systems. Types of modern speaker verification systems and types of speech features used in speaker verification systems are considered. The structure and learning algorithm of convolutional deep belief networks is described. The use of speech features extracted from three layers of a trained convolution deep belief network is proposed. Experimental studies of the proposed features were performed on two speech corpora: own speech corpus including audio recordings of 50 speakers and TIMIT speech corpus including audio recordings of 630 speakers. The accuracy of the proposed features was assessed using different types of classifiers. Direct use of these features did not increase the accuracy compared to the use of traditional spectral speech features, such as mel-frequency cepstral coefficients. However, the use of these features in the classifiers ensemble made it possible to achieve a reduction of the equal error rate to 0.21% on 50-speaker speech corpus and to 0.23% on the TIMIT speech corpus.

引用

页码：596 / +

页数：12

共 50 条

[21] Text-independent speaker verification using Support Vector Machines
Kharroubi, J
Chollet, G
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4017 - 4017
[22] Language dependency in text-independent speaker verification
Auckenthaler, R
Carey, MJ
Mason, JSD
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 441 - 444
[23] Robust text-independent speaker verification using genetic programming
Day, Peter
Nandi, Asoke K.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 285 - 295
[24] A ROBUST TEXT-INDEPENDENT SPEAKER VERIFICATION METHOD BASED ON SPEECH SEPARATION AND DEEP SPEAKER
Zhao, Fei
Li, Hao
Zhang, Xueliang
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6101 - 6105
[25] Text-independent speaker verification in embedded environments
Tydlitat, Borivoj
Navratil, Jiri
Pelecanos, Jason W.
Ramaswamy, Ganesh N.
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 293 - +
[26] ORTHOGONAL TRAINING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Zhu, Yingke
Mak, Brian
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6584 - 6588
[27] Adaptive method for text-independent speaker verification
Zhang, Yiying, 2000, (11):
[28] Collaborative and adversarial network for text-independent speaker verification in domain adaptation
Qiang, Junhao
Yang, Qun
Gao, Jie
Liu, Shaohan
ELECTRONICS LETTERS, 2023, 59 (02)
[29] Text-independent speaker verification using speaker clustering and support vector machines
Hou, FL
Wang, BX
2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 456 - 459
[30] Analysis-Based Optimization of Temporal Dynamic Convolutional Neural Network for Text-Independent Speaker Verification
Kim, Seong-Hu
Nam, Hyeonuk
Park, Yong-Hwa
IEEE ACCESS, 2023, 11 : 60646 - 60659

← 1 2 3 4 5 →