Hybrid machine learning classification scheme for speaker identification

被引:6
|
作者
Karthikeyan, V [1 ]
Priyadharsini, Suja S. [2 ]
机构
[1] Kalasalingam Inst Technol, Dept Elect & Commun Engn, Srivilliputhur 626126, Tamil Nadu, India
[2] Dept Elect & Commun Engn, Anna Univ, Reg Campus, Tirunelveli 627007, Tamil Nadu, India
关键词
equal error rate; machine learning; random forest; RF-SVM; speaker identification; support vector machine; RANDOM FOREST; RECOGNITION; FEATURES;
D O I
10.1111/1556-4029.15006
中图分类号
DF [法律]; D9 [法律]; R [医药、卫生];
学科分类号
0301 ; 10 ;
摘要
Motivated by the requirement to prepare for the next generation of "Automatic Spokesperson Recognition" (ASR) system, this paper applied the fused spectral features with hybrid machine learning (ML) strategy to the speech communication field. This strategy involved the combined spectral features such as mel-frequency cepstral coefficients (MFCCs), spectral kurtosis, spectral skewness, normalized pitch frequency (NPF), and formants. The characterization of suggested classification method could possibly serve in advanced speaker identification scenarios. Special attention was given to hybrid ML scheme capable of finding unknown speakers equipped with speaker id-detecting classifier technique, known as "Random Forest-Support Vector Machine" (RF-SVM). The extracted speaker precise spectral attributes are applied to the hybrid RF-SVM classifier to identify/verify the particular speaker. This work aims to construct an ensemble decision tree on a bounded area with minimal misclassification error using a hybrid ensemble RF-SVM strategy. A series of standard, real-time speaker databases, and noise conditions are functionally tested to validate its performance with other state-of-the-art mechanisms. The proposed fusion method succeeds in the speaker identification task with a high identification rate (97% avg) and lower equal error rate (EER) (<2%), compared with the individual schemes for the recorded experimental dataset. The robustness of the classifier is validated using the standard ELSDSR, TIMIT, and NIST audio datasets. Experiments on ELSDSR, TIMIT, and NIST datasets show that the hybrid classifier produces 98%, 99%, and 94% accuracy, and EERs were 2%, 1%, and 2% respectively. The findings are then compared with well-known other speaker recognition schemes and found to be superior.
引用
收藏
页码:1033 / 1048
页数:16
相关论文
共 50 条
  • [11] Classification of land use and land cover through machine learning algorithms: a literature review
    Tobar-Diaz, Rene
    Gao, Yan
    Mas, Jean Francois
    Cambron-Sandoval, Victor Hugo
    REVISTA DE TELEDETECCION, 2023, (62): : 1 - 19
  • [12] Classification of WatSan Technologies Using Machine Learning Techniques
    Al Nuaimi, Hala
    Abdelmagid, Mohamed
    Bouabid, Ali
    Chrysikopoulos, Constantinos V. V.
    Maalouf, Maher
    WATER, 2023, 15 (15)
  • [13] Machine Learning Techniques for Diabetes Classification: A Comparative Study
    Mustafa, Hiri
    Mohamed, Chrayah
    Nabil, Ourdani
    Noura, Aknin
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 785 - 790
  • [14] Classification of Product Review Sentiment by NLP and Machine Learning
    Das, Rely
    Hossain, Forhad
    Ahmed, Taufiq
    Devanath, Ananyna
    Akter, Shahnaz
    Sattar, Abdus
    2022 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL, COMPUTING, COMMUNICATION AND SUSTAINABLE TECHNOLOGIES (ICAECT), 2022,
  • [15] Patient care classification using machine learning techniques
    Melhem, Shatha
    Al-Aiad, Ahmad
    Al-Ayyad, Muhammad Saleh
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 57 - 62
  • [16] Hypertension Classification using Machine Learning Part II
    Nasir, Nida
    Oswald, Paul
    Barneih, Feras
    Alshaltone, Omar
    AlShabi, Mohammad
    Bonny, Talal
    Al Shammaa, Ahmed
    2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 459 - 463
  • [17] Identification and Classification of Urban PLES Spatial Functions Based on Multisource Data and Machine Learning
    Fu, Jingying
    Bu, Ziqiang
    Jiang, Dong
    Lin, Gang
    LAND, 2022, 11 (10)
  • [18] Automatic Speaker Recognition System based on Machine Learning Algorithms
    Mokgonyane, Tumisho Billson
    Sefara, Tshephisho Joseph
    Modipa, Thipe Isaiah
    Mogale, Mercy Mosibudi
    Manamela, Madimetja Jonas
    Manamela, Phuti John
    2019 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA), 2019, : 141 - 146
  • [19] Classification and prediction of diabetes disease using machine learning paradigm
    Maniruzzaman, Md.
    Rahman, Md. Jahanur
    Ahammed, Benojir
    Abedin, Md. Menhazul
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2020, 8 (01)
  • [20] Classification Prediction of Lung Cancer Based on Machine Learning Method
    Li, Dantong
    Li, Guixin
    Li, Shuang
    Bang, Ashley
    INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2024, 19 (01)