Automatic speaker recognition from speech signal using bidirectional long-short-term memory recurrent neural network

被引:12
作者
Devi, Kharibam Jilenkumari [1 ]
Thongam, Khelchandra [2 ]
机构
[1] Natl Inst Technol Manipur, Dept Elect & Commun Engn, Imphal 795004, Manipur, India
[2] Natl Inst Technol Manipur, Dept Comp Sci & Engn, Imphal, Manipur, India
关键词
Mel-frequency cepstral coefficient; probabilistic principal component analysis; recurrent neural network-bidirectional long short term memory; Wiener filter algorithm; IDENTIFICATION; VERIFICATION;
D O I
10.1111/coin.12278
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speaker recognition is a major challenge in various languages for researchers. For programmed speaker recognition structure prepared by utilizing ordinary speech, shouting creates a confusion between the enlistment and test, henceforth minimizing the identification execution as extreme vocal exertion is required during shouting. Speaker recognition requires more time for classification of data, accuracy is optimized, and the low root-mean-square error rate is the major problem. The objective of this work is to develop an efficient system of speaker recognition. In this work, an improved method of Wiener filter algorithm is applied for better noise reduction. To obtain the essential feature vector values, Mel-frequency cepstral coefficient feature extraction method is used on the noise-removed signals. Furthermore, input samples are created by using these extracted features after the dimensions have been reduced using probabilistic principal component analysis. Finally, recurrent neural network-bidirectional long-short-term memory is used for the classification to improve the prediction accuracy. For checking the effectiveness, the proposed work is compared with the existing methods based on accuracy, sensitivity, and error rate. The results obtained with the proposed method demonstrate an accuracy of 95.77%.
引用
收藏
页码:170 / 193
页数:24
相关论文
共 49 条
[41]   Real-Time Parameter Estimation of an Electrochemical Lithium-Ion Battery Model Using a Long Short-Term Memory Network [J].
Chun, Huiyong ;
Kim, Jungsoo ;
Yu, Jungwook ;
Han, Soohee .
IEEE ACCESS, 2020, 8 :81789-81799
[42]   Utilization of Micro-Doppler Radar to Classify Gait Patterns of Young and Elderly Adults: An Approach Using a Long Short-Term Memory Network [J].
Hayashi, Sora ;
Saho, Kenshi ;
Shioiri, Keitaro ;
Fujimoto, Masahiro ;
Masugi, Masao .
SENSORS, 2021, 21 (11)
[43]   Data-driven modeling of bridge buffeting in the time domain using long short-term memory network based on structural health monitoring [J].
Li, Shanwu ;
Li, Suchao ;
Laima, Shujin ;
Li, Hui .
STRUCTURAL CONTROL & HEALTH MONITORING, 2021, 28 (08)
[44]   DeepDRBP-2L: A New Genome Annotation Predictor for Identifying DNA-Binding Proteins and RNA-Binding Proteins Using Convolutional Neural Network and Long Short-Term Memory [J].
Zhang, Jun ;
Chen, Qingcai ;
Liu, Bin .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (04) :1451-1463
[45]   Qualitative and quantitative studies of phthalates in extra virgin olive oil (EVOO) by surface-enhanced Raman spectroscopy (SERS) combined with long short term memory (LSTM) neural network [J].
Wu, Xijun ;
Du, Zherui ;
Ma, Renqi ;
Zhang, Xin ;
Yang, Daolin ;
Liu, Hailong ;
Zhang, Yungang .
FOOD CHEMISTRY, 2024, 433
[46]   A novel long short-term memory neural-network-based self-excited force model of limit cycle oscillations of nonlinear flutter for various aerodynamic configurations [J].
Li, Wenjie ;
Laima, Shujin ;
Jin, Xiaowei ;
Yuan, Wenyong ;
Li, Hui .
NONLINEAR DYNAMICS, 2020, 100 (03) :2071-2087
[47]   Spatio-Temporal Feature Extraction for Pipeline Leak Detection in Smart Cities Using Acoustic Emission Signals: A One-Dimensional Hybrid Convolutional Neural Network-Long Short-Term Memory Approach [J].
Ullah, Saif ;
Ullah, Niamat ;
Siddique, Muhammad Farooq ;
Ahmad, Zahoor ;
Kim, Jong-Myon .
APPLIED SCIENCES-BASEL, 2024, 14 (22)
[48]   Polar Vortex Multi-Day Intensity Prediction Relying on New Deep Learning Model: A Combined Convolution Neural Network with Long Short-Term Memory Based on Gaussian Smoothing Method [J].
Peng, Kecheng ;
Cao, Xiaoqun ;
Liu, Bainian ;
Guo, Yanan ;
Xiao, Chaohao ;
Tian, Wenlong .
ENTROPY, 2021, 23 (10)
[49]   Inception 1D-convolutional neural network for accurate prediction of electrical insulator leakage current from environmental data during its normal operation using long-term recording [J].
Bueno-Barrachina, Jose-M. ;
Ye-Lin, Yiyao ;
Nieto-del-Amor, Felix ;
Fuster-Roig, Vicente .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119