Empirical Comparison between Deep and Classical Classifiers for Speaker Verification in Emotional Talking Environments

被引:2
|
作者
Nassif, Ali Bou [1 ]
Shahin, Ismail [2 ]
Lataifeh, Mohammed [3 ]
Elnagar, Ashraf [3 ]
Nemmour, Nawel [1 ]
机构
[1] Univ Sharjah, Comp Engn Dept, Sharjah 27272, U Arab Emirates
[2] Univ Sharjah, Elect Engn Dept, Sharjah 27272, U Arab Emirates
[3] Univ Sharjah, Comp Sci Dept, Sharjah 27272, U Arab Emirates
关键词
classical classifiers; deep neural network; emotional speech; feature extraction; speaker verification;
D O I
10.3390/info13100456
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech signals carry various bits of information relevant to the speaker such as age, gender, accent, language, health, and emotions. Emotions are conveyed through modulations of facial and vocal expressions. This paper conducts an empirical comparison of performances between the classical classifiers: Gaussian Mixture Model (GMM), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Artificial neural networks (ANN); and the deep learning classifiers, i.e., Long Short-Term Memory (LSTM), Convolutional Neural Network (CNN), and Gated Recurrent Unit (GRU) in addition to the ivector approach for a text-independent speaker verification task in neutral and emotional talking environments. The deep models undergo hyperparameter tuning using the Grid Search optimization algorithm. The models are trained and tested using a private Arabic Emirati Speech Database, Ryerson Audio-Visual Database of Emotional Speech and Song dataset (RAVDESS) database, and a public Crowd-Sourced Emotional Multimodal Actors (CREMA) database. Experimental results illustrate that deep architectures do not necessarily outperform classical classifiers. In fact, evaluation was carried out through Equal Error Rate (EER) along with Area Under the Curve (AUC) scores. The findings reveal that the GMM model yields the lowest EER values and the best AUC scores across all datasets, amongst classical classifiers. In addition, the ivector model surpasses all the fine-tuned deep models (CNN, LSTM, and GRU) based on both evaluation metrics in the neutral, as well as the emotional speech. In addition, the GMM outperforms the ivector using the Emirati and RAVDESS databases.
引用
收藏
页数:23
相关论文
共 42 条
  • [31] An Experimental Comparison between Deep Learning and Classical Machine Learning Approaches for Writer Identification in Medieval Documents
    Cilia, Nicole Dalia
    De Stefano, Claudio
    Fontanella, Francesco
    Marrocco, Claudio
    Molinara, Mario
    Freca, Alessandra Scotto di
    JOURNAL OF IMAGING, 2020, 6 (09)
  • [33] Tomato Leaf Diseases Classification Based on Leaf Images: A Comparison between Classical Machine Learning and Deep Learning Methods
    Tan, Lijuan
    Lu, Jinzhu
    Jiang, Huanyu
    AGRIENGINEERING, 2021, 3 (03): : 542 - 558
  • [34] Data-Driven Predictive Maintenance in Evolving Environments: A Comparison Between Machine Learning and Deep Learning for Novelty Detection
    Del Buono, Francesco
    Calabrese, Francesca
    Baraldi, Andrea
    Paganelli, Matteo
    Regattieri, Alberto
    SUSTAINABLE DESIGN AND MANUFACTURING, KES-SDM 2021, 2022, 262 : 109 - 119
  • [35] Verification of pulmonary vein isolation during single transseptal cryoballoon ablation: a comparison between the classical circular mapping catheter and the inner lumen mapping catheter
    Chierchia, Gian-Battista
    Namdar, Mehdi
    Sarkozy, Andrea
    Sorgente, Antonio
    de Asmundis, Carlo
    Casado-Arroyo, Ruben
    Capulzini, Lucio
    Bayrak, Fatih
    Rodriguez-Manero, Moises
    Ricciardi, Danilo
    Rao, Jayakeerthi Y.
    Overeinder, Ingrid
    Paparella, Gaetano
    Brugada, Pedro
    EUROPACE, 2012, 14 (12): : 1708 - 1714
  • [36] Classification of human actions using 3D skeleton data: A performance comparison between classical machine learning and deep learning models
    Kim, Juhwan
    Kim, Jongchan
    Lee, Sungim
    KOREAN JOURNAL OF APPLIED STATISTICS, 2024, 37 (05)
  • [37] COVID-XIX-Net: Deep learning empirical comparison between X-ray imaging and POCUS for COVID-19 detection
    Kandil, Marwa
    Kelkawi, Ali
    Ahmad, Imtiaz
    Al-Failakawi, Mohammed
    JOURNAL OF ENGINEERING RESEARCH, 2021, 9 (4A): : 87 - 97
  • [38] Deep Study on Fouling Modelling of Ultrafiltration Membranes Used for OMW Treatment: Comparison Between Semi-empirical Models, Response Surface, and Artificial Neural Networks
    Magdalena Cifuentes-Cabezas
    José Luis Bohórquez-Zurita
    Sandra Gil-Herrero
    María Cinta Vincent-Vela
    José Antonio Mendoza-Roca
    Silvia Álvarez-Blanco
    Food and Bioprocess Technology, 2023, 16 : 2126 - 2146
  • [39] Deep Study on Fouling Modelling of Ultrafiltration Membranes Used for OMW Treatment: Comparison Between Semi-empirical Models, Response Surface, and Artificial Neural Networks
    Cifuentes-Cabezas, Magdalena
    Bohorquez-Zurita, Jose Luis
    Gil-Herrero, Sandra
    Vincent-Vela, Maria Cinta
    Mendoza-Roca, Jose Antonio
    Alvarez-Blanco, Silvia
    FOOD AND BIOPROCESS TECHNOLOGY, 2023, 16 (10) : 2126 - 2146
  • [40] EFFECTS OF FINE-GRAIN SIZE ON DISTRIBUTION OF MN IN SHALLOW AND DEEP-WATER BLACK-SEA SEDIMENTS - A COMPARISON BETWEEN OXIC AND ANOXIC DEPOSITIONAL-ENVIRONMENTS
    ERGIN, M
    GEO-MARINE LETTERS, 1995, 15 (01) : 51 - 58