Hybrid machine learning classification scheme for speaker identification

被引:6
|
作者
Karthikeyan, V [1 ]
Priyadharsini, Suja S. [2 ]
机构
[1] Kalasalingam Inst Technol, Dept Elect & Commun Engn, Srivilliputhur 626126, Tamil Nadu, India
[2] Dept Elect & Commun Engn, Anna Univ, Reg Campus, Tirunelveli 627007, Tamil Nadu, India
关键词
equal error rate; machine learning; random forest; RF-SVM; speaker identification; support vector machine; RANDOM FOREST; RECOGNITION; FEATURES;
D O I
10.1111/1556-4029.15006
中图分类号
DF [法律]; D9 [法律]; R [医药、卫生];
学科分类号
0301 ; 10 ;
摘要
Motivated by the requirement to prepare for the next generation of "Automatic Spokesperson Recognition" (ASR) system, this paper applied the fused spectral features with hybrid machine learning (ML) strategy to the speech communication field. This strategy involved the combined spectral features such as mel-frequency cepstral coefficients (MFCCs), spectral kurtosis, spectral skewness, normalized pitch frequency (NPF), and formants. The characterization of suggested classification method could possibly serve in advanced speaker identification scenarios. Special attention was given to hybrid ML scheme capable of finding unknown speakers equipped with speaker id-detecting classifier technique, known as "Random Forest-Support Vector Machine" (RF-SVM). The extracted speaker precise spectral attributes are applied to the hybrid RF-SVM classifier to identify/verify the particular speaker. This work aims to construct an ensemble decision tree on a bounded area with minimal misclassification error using a hybrid ensemble RF-SVM strategy. A series of standard, real-time speaker databases, and noise conditions are functionally tested to validate its performance with other state-of-the-art mechanisms. The proposed fusion method succeeds in the speaker identification task with a high identification rate (97% avg) and lower equal error rate (EER) (<2%), compared with the individual schemes for the recorded experimental dataset. The robustness of the classifier is validated using the standard ELSDSR, TIMIT, and NIST audio datasets. Experiments on ELSDSR, TIMIT, and NIST datasets show that the hybrid classifier produces 98%, 99%, and 94% accuracy, and EERs were 2%, 1%, and 2% respectively. The findings are then compared with well-known other speaker recognition schemes and found to be superior.
引用
收藏
页码:1033 / 1048
页数:16
相关论文
共 50 条
  • [21] The Use of Machine Learning Algorithms in Urban Tree Species Classification
    Cetin, Zehra
    Yastikli, Naci
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (04)
  • [22] Value of Geologically Derived Features in Machine Learning Facies Classification
    Halotel, Julie
    Demyanov, Vasily
    Gardiner, Andy
    MATHEMATICAL GEOSCIENCES, 2020, 52 (01) : 5 - 29
  • [23] Validation and Implementation of Customer Classification System using Machine Learning
    Yoon, Hyemin
    Kim, HyunJin
    Kim, Sangjin
    MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES, 2024, 22 (02) : 131 - 140
  • [24] Rice Disease Classification Using Supervised Machine Learning Approach
    Jena, Kalyan Kumar
    Bhoi, Sourav Kumar
    Mohapatra, Debasis
    Mallick, Chittaranjan
    Swain, Prachi
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 328 - 333
  • [25] Fingerprints Classification through Image Analysis and Machine Learning Method
    Huong Thu Nguyen
    Long The Nguyen
    ALGORITHMS, 2019, 12 (11)
  • [26] Value of Geologically Derived Features in Machine Learning Facies Classification
    Julie Halotel
    Vasily Demyanov
    Andy Gardiner
    Mathematical Geosciences, 2020, 52 : 5 - 29
  • [27] Comparative Analysis of Network Fault Classification Using Machine Learning
    Kawasaki, Junichi
    Mouri, Genichi
    Suzuki, Yusuke
    NOMS 2020 - PROCEEDINGS OF THE 2020 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2020: MANAGEMENT IN THE AGE OF SOFTWARIZATION AND ARTIFICIAL INTELLIGENCE, 2020,
  • [28] Comparison of Performance of Machine Learning Algorithms for Cervical Cancer Classification
    Karani, Hamza
    Gangurde, Ashish
    Dhumal, Gauri
    Gautam, Waidehi
    Hiran, Samiksha
    Marathe, Abha
    2022 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL, COMPUTING, COMMUNICATION AND SUSTAINABLE TECHNOLOGIES (ICAECT), 2022,
  • [29] Machine Learning and Zombie Firms Classification
    Minami, Koutaroh
    Yasuda, Yukihiro
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [30] A Feature Level Fusion Scheme for Robust Speaker Identification
    Sekkate, Sara
    Khalil, Mohammed
    Adib, Abdellah
    BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 289 - 300