Hybrid machine learning classification scheme for speaker identification

被引:6
|
作者
Karthikeyan, V [1 ]
Priyadharsini, Suja S. [2 ]
机构
[1] Kalasalingam Inst Technol, Dept Elect & Commun Engn, Srivilliputhur 626126, Tamil Nadu, India
[2] Dept Elect & Commun Engn, Anna Univ, Reg Campus, Tirunelveli 627007, Tamil Nadu, India
关键词
equal error rate; machine learning; random forest; RF-SVM; speaker identification; support vector machine; RANDOM FOREST; RECOGNITION; FEATURES;
D O I
10.1111/1556-4029.15006
中图分类号
DF [法律]; D9 [法律]; R [医药、卫生];
学科分类号
0301 ; 10 ;
摘要
Motivated by the requirement to prepare for the next generation of "Automatic Spokesperson Recognition" (ASR) system, this paper applied the fused spectral features with hybrid machine learning (ML) strategy to the speech communication field. This strategy involved the combined spectral features such as mel-frequency cepstral coefficients (MFCCs), spectral kurtosis, spectral skewness, normalized pitch frequency (NPF), and formants. The characterization of suggested classification method could possibly serve in advanced speaker identification scenarios. Special attention was given to hybrid ML scheme capable of finding unknown speakers equipped with speaker id-detecting classifier technique, known as "Random Forest-Support Vector Machine" (RF-SVM). The extracted speaker precise spectral attributes are applied to the hybrid RF-SVM classifier to identify/verify the particular speaker. This work aims to construct an ensemble decision tree on a bounded area with minimal misclassification error using a hybrid ensemble RF-SVM strategy. A series of standard, real-time speaker databases, and noise conditions are functionally tested to validate its performance with other state-of-the-art mechanisms. The proposed fusion method succeeds in the speaker identification task with a high identification rate (97% avg) and lower equal error rate (EER) (<2%), compared with the individual schemes for the recorded experimental dataset. The robustness of the classifier is validated using the standard ELSDSR, TIMIT, and NIST audio datasets. Experiments on ELSDSR, TIMIT, and NIST datasets show that the hybrid classifier produces 98%, 99%, and 94% accuracy, and EERs were 2%, 1%, and 2% respectively. The findings are then compared with well-known other speaker recognition schemes and found to be superior.
引用
收藏
页码:1033 / 1048
页数:16
相关论文
共 50 条
  • [31] Identification and classification of materials using machine vision and machine learning in the context of industry 4.0
    Durga Prasad Penumuru
    Sreekumar Muthuswamy
    Premkumar Karumbu
    Journal of Intelligent Manufacturing, 2020, 31 : 1229 - 1241
  • [32] Identification and classification of materials using machine vision and machine learning in the context of industry 4.0
    Penumuru, Durga Prasad
    Muthuswamy, Sreekumar
    Karumbu, Premkumar
    JOURNAL OF INTELLIGENT MANUFACTURING, 2020, 31 (05) : 1229 - 1241
  • [33] The identification and localization of speaker using fusion techniques and machine learning techniques
    Ali, Rasha H.
    Abdullah, Mohammed Najm
    Abed, Buthainah F.
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 133 - 149
  • [34] Hybrid Feature Extraction and Machine Learning Approach for Fruits and Vegetable Classification
    Bahia, Nimratveer Kaur
    Rani, Rajneesh
    Kamboj, Aman
    Kakkar, Deepti
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2019, 27 (04): : 1693 - 1708
  • [35] An Intelligent Fault Detection and Classification Scheme for Distribution Lines Using Machine Learning
    Ponukumati, Balamurali Krishna
    Sinha, Pampa
    Maharana, Manoj Kumar
    Kumar, A. V. Pavan
    Karthik, Akkenaguntla
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2022, 12 (04) : 8972 - 8977
  • [36] A novel whispered speaker identification system based on extreme learning machine
    Sangeetha J.
    Jayasankar T.
    International Journal of Speech Technology, 2018, 21 (1) : 157 - 165
  • [37] Automatic Speaker Recognition System based on Optimised Machine Learning Algorithms
    Mokgonyane, Tumisho Billson
    Sefara, Tshephisho Joseph
    Modipa, Thipe Isaiah
    Manamela, Madimetja Jonas
    2019 IEEE AFRICON, 2019,
  • [38] The identification and localization of speaker using fusion techniques and machine learning techniques
    Rasha H. Ali
    Mohammed Najm Abdullah
    Buthainah F. Abed
    Evolutionary Intelligence, 2024, 17 : 133 - 149
  • [39] Enhancing Classification and Prediction through the Application of Hybrid Machine Learning Models
    Banda, Misheck
    Ngassam, Ernest Ketcha
    Mnkandla, Ernest
    2024 IST-AFRICA CONFERENCE, 2024,
  • [40] Machine learning in thermoelectric materials identification: Feature selection and analysis
    Xu, Yijing
    Jiang, Lu
    Qi, Xiang
    COMPUTATIONAL MATERIALS SCIENCE, 2021, 197