Deep Learning and Machine Learning Techniques Applied to Speaker Identification on Small Datasets

被引:0
|
作者
Manfron, Enrico [1 ,2 ,3 ]
Teixeira, Joao Paulo [1 ,2 ]
Minetto, Rodrigo [3 ]
机构
[1] Inst Politecn Braganca, Res Ctr Digitalizat & Intelligent Robot CeDRI, Campus Santa Apolonia, P-5300253 Braganca, Portugal
[2] Inst Politecn Braganca, Associate Lab Sustainabil & Technol SusTEC, Campus Santa Apolonia, P-5300253 Braganca, Portugal
[3] Univ Tecnol Fed Parana, BR-80230901 Curitiba, Parana, Brazil
来源
OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, PT II, OL2A 2023 | 2024年 / 1982卷
关键词
Speaker Identification; Convolutional Neural Network; Deep Learning; RECOGNITION;
D O I
10.1007/978-3-031-53036-4_14
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this study, we explore the capabilities of speaker recognition technology for biometric authentication developing speaker recognition-based access control systems and serving as a resource for future research and improvements in secure and efficient speaker identification solutions. We focused on developing and evaluating machine learning and deep learning models for speaker identification. The models were trained and tested on private datasets with 32 speakers and public datasets with 1251 to 6112 speakers. The Gaussian Mixture Model performed well with our private datasets, with 93,10%, and 95% accuracy in correctly identifying the speakers. The Multilayer Perceptron achieved a peak accuracy of 93.33% on the Framed Trim private dataset. The VGGM model, after initial training on larger datasets, achieved an accuracy of 90.34% and 98.33% on our private datasets. At last, the model ResNet50 slightly outperformed the other models on two versions of our private dataset, achieving accuracies of 97.93% and 100%.
引用
收藏
页码:195 / 210
页数:16
相关论文
共 50 条
  • [1] The identification and localization of speaker using fusion techniques and machine learning techniques
    Ali, Rasha H.
    Abdullah, Mohammed Najm
    Abed, Buthainah F.
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 133 - 149
  • [2] The identification and localization of speaker using fusion techniques and machine learning techniques
    Rasha H. Ali
    Mohammed Najm Abdullah
    Buthainah F. Abed
    Evolutionary Intelligence, 2024, 17 : 133 - 149
  • [3] Machine Learning and Deep Learning Techniques Applied to Diabetes Research: A Bibliometric Analysis
    Garcia-Jaramillo, Maira
    Luque, Carolina
    Leon-Vargas, Fabian
    JOURNAL OF DIABETES SCIENCE AND TECHNOLOGY, 2023, : 287 - 301
  • [4] A systematic review of machine learning techniques for cattle identification: Datasets, methods and future directions
    Hossain, Md Ekramul
    Kabir, Muhammad Ashad
    Zheng, Lihong
    Swain, Dave L.
    McGrath, Shawn
    Medway, Jonathan
    ARTIFICIAL INTELLIGENCE IN AGRICULTURE, 2022, 6 : 138 - 155
  • [5] Machine Learning Techniques for Heart Disease Datasets: A Survey
    Khan, Younas
    Qamar, Usman
    Yousaf, Nazish
    Khan, Aimal
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 21 - 29
  • [6] Hybrid machine learning classification scheme for speaker identification
    Karthikeyan, V
    Priyadharsini, Suja S.
    JOURNAL OF FORENSIC SCIENCES, 2022, 67 (03) : 1033 - 1048
  • [7] Comparative analysis of speaker identification performance using deep learning, machine learning, and novel subspace classifiers with multiple feature extraction techniques
    Keser, Serkan
    Gezer, Esra
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [8] Basic Artificial Intelligence Techniques Machine Learning and Deep Learning
    Erickson, Bradley J.
    RADIOLOGIC CLINICS OF NORTH AMERICA, 2021, 59 (06) : 933 - 940
  • [9] Machine Learning and Deep Learning Models Applied to Photovoltaic Production Forecasting
    Cordeiro-Costas, Moises
    Villanueva, Daniel
    Eguia-Oller, Pablo
    Granada-Alvarez, Enrique
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [10] Deep Learning Applied on Refined Opinion Review Datasets
    Jost, Ingo
    Valiati, Joao Francisco
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAL JOURNAL OF ARTIFICIAL INTELLIGENCE, 2018, 21 (62): : 91 - 102