Hybrid machine learning classification scheme for speaker identification

被引:6
|
作者
Karthikeyan, V [1 ]
Priyadharsini, Suja S. [2 ]
机构
[1] Kalasalingam Inst Technol, Dept Elect & Commun Engn, Srivilliputhur 626126, Tamil Nadu, India
[2] Dept Elect & Commun Engn, Anna Univ, Reg Campus, Tirunelveli 627007, Tamil Nadu, India
关键词
equal error rate; machine learning; random forest; RF-SVM; speaker identification; support vector machine; RANDOM FOREST; RECOGNITION; FEATURES;
D O I
10.1111/1556-4029.15006
中图分类号
DF [法律]; D9 [法律]; R [医药、卫生];
学科分类号
0301 ; 10 ;
摘要
Motivated by the requirement to prepare for the next generation of "Automatic Spokesperson Recognition" (ASR) system, this paper applied the fused spectral features with hybrid machine learning (ML) strategy to the speech communication field. This strategy involved the combined spectral features such as mel-frequency cepstral coefficients (MFCCs), spectral kurtosis, spectral skewness, normalized pitch frequency (NPF), and formants. The characterization of suggested classification method could possibly serve in advanced speaker identification scenarios. Special attention was given to hybrid ML scheme capable of finding unknown speakers equipped with speaker id-detecting classifier technique, known as "Random Forest-Support Vector Machine" (RF-SVM). The extracted speaker precise spectral attributes are applied to the hybrid RF-SVM classifier to identify/verify the particular speaker. This work aims to construct an ensemble decision tree on a bounded area with minimal misclassification error using a hybrid ensemble RF-SVM strategy. A series of standard, real-time speaker databases, and noise conditions are functionally tested to validate its performance with other state-of-the-art mechanisms. The proposed fusion method succeeds in the speaker identification task with a high identification rate (97% avg) and lower equal error rate (EER) (<2%), compared with the individual schemes for the recorded experimental dataset. The robustness of the classifier is validated using the standard ELSDSR, TIMIT, and NIST audio datasets. Experiments on ELSDSR, TIMIT, and NIST datasets show that the hybrid classifier produces 98%, 99%, and 94% accuracy, and EERs were 2%, 1%, and 2% respectively. The findings are then compared with well-known other speaker recognition schemes and found to be superior.
引用
收藏
页码:1033 / 1048
页数:16
相关论文
共 50 条
  • [41] A New Hybrid Support Vector Machine Ensemble Classification Model for Credit Scoring
    Yao, Jian-Rong
    Chen, Jia-Rui
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2019, 12 (01) : 77 - 88
  • [42] A Comprehensive Exploration of Machine Learning and Explainable AI Techniques for Malware Classification
    Athira
    Baburaj, Drishya
    Gupta, Deepa
    2024 2ND WORLD CONFERENCE ON COMMUNICATION & COMPUTING, WCONF 2024, 2024,
  • [43] Comparison of Three Supervised Machine Learning Classification Methods for the Diagnosis of PD
    Villagrana-Banuelos, Ricardo
    Villagrana-Banuelos, Karen E.
    Soto Murillo, Manuel A.
    Eric Galvan-Tejada, Carlos
    Maria Celaya-Padilla, Jose
    Issac Galvan-Tejada, Jorge
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING & AMBIENT INTELLIGENCE (UCAMI 2022), 2023, 594 : 314 - 319
  • [44] Machine Learning Based Framework for Classification of Children with ADHD and Healthy Controls
    Parashar, Anshu
    Kalra, Nidhi
    Singh, Jaskirat
    Goyal, Raman Kumar
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 28 (03) : 669 - 682
  • [45] A lightweight machine learning methods for malware classification
    Farfoura, Mahmoud E.
    Mashal, Ibrahim
    Alkhatib, Ahmad
    Batyha, Radwan M.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (01):
  • [46] Ransomware Detection and Classification Using Machine Learning and Deep Learning
    Ouerdi, Noura
    Mejjout, Brahim
    Laaroussi, Khadija
    Kasmi, Mohammed Amine
    ADVANCES IN SMART MEDICAL, IOT & ARTIFICIAL INTELLIGENCE, VOL 1, ICSMAI 2024, 2024, 11 : 194 - 201
  • [47] Pneumonia Image Classification: Deep Learning and Machine Learning Fusion
    Tang, Jiarui
    Zhang, Bohua
    Liu, Jinzhou
    Dong, Zhuoling
    Zhou, Yangbin
    Meng, Xingyu
    Toe, Teoh Teik
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 440 - 447
  • [48] Efficient retinal detachment classification using hybrid machine learning with levy flight-based optimization
    Anitha, E.
    Aravindhar, D. John
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 239
  • [49] Machine learning and spectral techniques for lithological classification
    Parakh, Khushboo
    Thakur, Sanchari
    Chudasama, Bijal
    Tirodkar, Siddhesh
    Porwal, Alok
    Bhattacharya, Avik
    MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL REMOTE SENSING TECHNOLOGY, TECHNIQUES AND APPLICATIONS VI, 2016, 9880
  • [50] Seismic Data Classification using Machine Learning
    Li, Wenrui
    Nakshatra
    Narvekar, Nishita
    Raut, Nitisha
    Sirkeci, Birsen
    Gao, Jerry
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2018), 2018, : 56 - 63