Automatic speaker recognition from speech signal using bidirectional long-short-term memory recurrent neural network

被引：12

作者：

Devi, Kharibam Jilenkumari ^{[1
]}

Thongam, Khelchandra ^{[2
]}

机构：

[1] Natl Inst Technol Manipur, Dept Elect & Commun Engn, Imphal 795004, Manipur, India

[2] Natl Inst Technol Manipur, Dept Comp Sci & Engn, Imphal, Manipur, India

来源：

COMPUTATIONAL INTELLIGENCE | 2023年 / 39卷 / 02期

关键词：

Mel-frequency cepstral coefficient; probabilistic principal component analysis; recurrent neural network-bidirectional long short term memory; Wiener filter algorithm; IDENTIFICATION; VERIFICATION;

D O I：

10.1111/coin.12278

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speaker recognition is a major challenge in various languages for researchers. For programmed speaker recognition structure prepared by utilizing ordinary speech, shouting creates a confusion between the enlistment and test, henceforth minimizing the identification execution as extreme vocal exertion is required during shouting. Speaker recognition requires more time for classification of data, accuracy is optimized, and the low root-mean-square error rate is the major problem. The objective of this work is to develop an efficient system of speaker recognition. In this work, an improved method of Wiener filter algorithm is applied for better noise reduction. To obtain the essential feature vector values, Mel-frequency cepstral coefficient feature extraction method is used on the noise-removed signals. Furthermore, input samples are created by using these extracted features after the dimensions have been reduced using probabilistic principal component analysis. Finally, recurrent neural network-bidirectional long-short-term memory is used for the classification to improve the prediction accuracy. For checking the effectiveness, the proposed work is compared with the existing methods based on accuracy, sensitivity, and error rate. The results obtained with the proposed method demonstrate an accuracy of 95.77%.

引用

页码：170 / 193

页数：24

共 49 条

[31] Seismic damage state predictions of reinforced concrete structures using stacked long short-term memory neural networks [J].

Ahmed, Bilal ;

Mangalathu, Sujith ;

Jeon, Jong-Su .

JOURNAL OF BUILDING ENGINEERING, 2022, 46

[32] A Hybrid Vibration Signal Prediction Model Using Autocorrelation Local Characteristic-Scale Decomposition and Improved Long Short Term Memory [J].

Tian, Hui-Xin ;

Ren, Dai-Xu ;

Li, Kun .

IEEE ACCESS, 2019, 7 :60995-61007

[33] Capacity estimation of lithium-ion batteries based on adaptive empirical wavelet transform and long short-term memory neural network [J].

El-Dalahmeh, Ma'd ;

Al-Greer, Maher ;

El-Dalahmeh, Moath ;

Bashir, Imran .

JOURNAL OF ENERGY STORAGE, 2023, 70

[34] Modeling of the dynamic hysteresis in DEAP actuator using an empirical mode decomposition based long-short term memory network [J].

Jiang, Zhaoguo ;

Li, Yuan ;

Wang, Qinglin .

JOURNAL OF INTELLIGENT MATERIAL SYSTEMS AND STRUCTURES, 2021, 32 (17) :2108-2123

[35] An Ensemble Hybrid Forecasting Model for Annual Runoff Based on Sample Entropy, Secondary Decomposition, and Long Short-Term Memory Neural Network [J].

Wang, Wen-chuan ;

Du, Yu-jin ;

Chau, Kwok-wing ;

Xu, Dong-mei ;

Liu, Chang-jun ;

Ma, Qiang .

WATER RESOURCES MANAGEMENT, 2021, 35 (14) :4695-4726

[36] Fault Detection Strategy of Vehicle Wheel Angle Signal via Long Short-Term Memory Network and Improved Sequential Probability Ratio Test [J].

Zou, Songchun ;

Zhao, Wanzhong ;

Wang, Chunyan ;

Chen, Feng .

IEEE SENSORS JOURNAL, 2021, 21 (15) :17290-17299

[37] Weather radar echo prediction method based on convolution neural network and Long Short-Term memory networks for sustainable e-agriculture [J].

Zhang, Lei ;

Huang, Zhenyue ;

Liu, Wei ;

Guo, Zhongli ;

Zhang, Zhe .

JOURNAL OF CLEANER PRODUCTION, 2021, 298

[38] Probabilistic Principal Component Analysis and Long Short-Term Memory Classifier for Automatic Detection of Alzheimer’s Disease using MRI Brain Images [J].

Suresha H.S. ;

Parthasarathy S.S. .

Journal of The Institution of Engineers (India): Series B, 2021, 102 (04) :807-818

[39] Probabilistic principal component analysis and long short-term memory classifier for automatic detection of Alzheimer's disease using MRI brain images [J].

Suresha, Halebeedu Subbaraya ;

Parthasarathy, Srirangapatna Sampathkumaran .

INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2022, 26 (01) :53-64

[40] ORI-Deep: improving the accuracy for predicting origin of replication sites by using a blend of features and long short-term memory network [J].

Shahid, Mahwish ;

Ilyas, Maham ;

Hussain, Waqar ;

Khan, Yaser Daanial .

BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)

← 1 2 3 4 5 →