Hybrid machine learning classification scheme for speaker identification

被引:6
|
作者
Karthikeyan, V [1 ]
Priyadharsini, Suja S. [2 ]
机构
[1] Kalasalingam Inst Technol, Dept Elect & Commun Engn, Srivilliputhur 626126, Tamil Nadu, India
[2] Dept Elect & Commun Engn, Anna Univ, Reg Campus, Tirunelveli 627007, Tamil Nadu, India
关键词
equal error rate; machine learning; random forest; RF-SVM; speaker identification; support vector machine; RANDOM FOREST; RECOGNITION; FEATURES;
D O I
10.1111/1556-4029.15006
中图分类号
DF [法律]; D9 [法律]; R [医药、卫生];
学科分类号
0301 ; 10 ;
摘要
Motivated by the requirement to prepare for the next generation of "Automatic Spokesperson Recognition" (ASR) system, this paper applied the fused spectral features with hybrid machine learning (ML) strategy to the speech communication field. This strategy involved the combined spectral features such as mel-frequency cepstral coefficients (MFCCs), spectral kurtosis, spectral skewness, normalized pitch frequency (NPF), and formants. The characterization of suggested classification method could possibly serve in advanced speaker identification scenarios. Special attention was given to hybrid ML scheme capable of finding unknown speakers equipped with speaker id-detecting classifier technique, known as "Random Forest-Support Vector Machine" (RF-SVM). The extracted speaker precise spectral attributes are applied to the hybrid RF-SVM classifier to identify/verify the particular speaker. This work aims to construct an ensemble decision tree on a bounded area with minimal misclassification error using a hybrid ensemble RF-SVM strategy. A series of standard, real-time speaker databases, and noise conditions are functionally tested to validate its performance with other state-of-the-art mechanisms. The proposed fusion method succeeds in the speaker identification task with a high identification rate (97% avg) and lower equal error rate (EER) (<2%), compared with the individual schemes for the recorded experimental dataset. The robustness of the classifier is validated using the standard ELSDSR, TIMIT, and NIST audio datasets. Experiments on ELSDSR, TIMIT, and NIST datasets show that the hybrid classifier produces 98%, 99%, and 94% accuracy, and EERs were 2%, 1%, and 2% respectively. The findings are then compared with well-known other speaker recognition schemes and found to be superior.
引用
收藏
页码:1033 / 1048
页数:16
相关论文
共 50 条
  • [1] Adaptive boosted random forest-support vector machine based classification scheme for speaker identification
    V K.
    S S.P.
    Applied Soft Computing, 2022, 131
  • [2] A strong hybrid AdaBoost classification algorithm for speaker recognition
    V Karthikeyan
    S Suja Priyadharsini
    Sādhanā, 2021, 46
  • [3] A strong hybrid AdaBoost classification algorithm for speaker recognition
    Karthikeyan, V
    Suja Priyadharsini, S.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2021, 46 (03):
  • [4] Deep Learning and Machine Learning Techniques Applied to Speaker Identification on Small Datasets
    Manfron, Enrico
    Teixeira, Joao Paulo
    Minetto, Rodrigo
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, PT II, OL2A 2023, 2024, 1982 : 195 - 210
  • [5] Hybrid Machine Learning Scheme for Classification of BECTS and TLE Patients Using EEG Brain Signals
    Yang, Wonsik
    Joo, Minsoo
    Kim, Yujaung
    Kim, Se Hee
    Chung, Jong-Moon
    IEEE ACCESS, 2020, 8 : 218924 - 218935
  • [6] Development of a regional voice dataset and speaker classification based on machine learning
    Ismail, Muhammad
    Memon, Shahzad
    Dhomeja, Lachhman Das
    Shah, Shahid Munir
    Hussain, Dostdar
    Rahim, Sabit
    Ali, Imran
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [7] Fetal health classification from cardiotocographic data using machine learning
    Mehbodniya, Abolfazl
    Lazar, Arokia Jesu Prabhu
    Webber, Julian
    Sharma, Dilip Kumar
    Jayagopalan, Santhosh
    Kousalya, K.
    Singh, Pallavi
    Rajan, Regin
    Pandya, Sharnil
    Sengan, Sudhakar
    EXPERT SYSTEMS, 2022, 39 (06)
  • [8] Diabetes Classification Using Machine Learning Techniques
    Phongying, Methaporn
    Hiriote, Sasiprapa
    COMPUTATION, 2023, 11 (05)
  • [9] Machine Learning for identification and classification of Foraminifera: Testing on monothalamids
    Sabbatini, Anna
    Caridi, Francesca
    Potena, Domenico
    Negri, Alessandra
    MARINE MICROPALEONTOLOGY, 2025, 195
  • [10] Speaker Identification Under Noisy Conditions Using Hybrid Deep Learning Model
    Lambamo, Wondimu
    Srinivasagan, Ramasamy
    Jifara, Worku
    PAN-AFRICAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PT I, PANAFRICON AI 2023, 2024, 2068 : 154 - 175