Deep Learning, Ensemble and Supervised Machine Learning for Arabic Speech Emotion Recognition

被引:0
|
作者
Ismaiel, Wahiba
Alhalangy, Abdalilah [1 ,2 ]
Mohamed, Adil. O. Y. [2 ]
Musa, Abdalla Ibrahim Abdalla [2 ]
机构
[1] Taif Univ, Univ Coll Ranyah, Dept Sci & Technol, Taif, Saudi Arabia
[2] Qassim Univ, Coll Comp, Dept Comp Engn, Buraydah, Saudi Arabia
关键词
Arabic speech emotion recognition; ANAD; SERDNN; SOM; Xgboost; Adaboost; DT; KNN; RANDOM FOREST;
D O I
10.48084/etasr.7134
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Today, automatic emotion recognition in speech is one of the most important areas of research in signal processing. Identifying emotional content in Arabic speech is regarded as a very challenging and intricate task due to several obstacles, such as the wide range of cultures and dialects, the influence of cultural factors on emotional expression, and the scarcity of available datasets. This study used a variety of artificial intelligence models, including Xgboost, Adaboost, KNN, DT, and SOM, and a deep -learning model named SERDNN. ANAD was employed as a training dataset, which contains three emotions, "angry", "happy", and "surprised", with 844 features. This study aimed to present a more efficient and accurate technique for recognizing emotions in Arabic speech. Precision, accuracy, recall, and F1 -score metrics were utilized to evaluate the effectiveness of the proposed techniques. The results showed that the Xgboost, SOM, and KNN classifiers achieved superior performance in recognizing emotions in Arabic speech. The SERDNN deep learning model outperformed the other techniques, achieving the highest accuracy of 97.40% with a loss rate of 0.1457. Therefore, it can be relied upon and deployed to recognize emotions in Arabic speech.
引用
收藏
页码:13757 / 13764
页数:8
相关论文
共 50 条
  • [11] Emotion Recognition in Speech with Deep Learning Architectures
    Erdal, Mehmet
    Kaechele, Markus
    Schwenker, Friedhelm
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, 2016, 9896 : 298 - 311
  • [12] Speech Emotion Recognition Using Deep Learning
    Alagusundari, N.
    Anuradha, R.
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
  • [13] Speech Emotion Recognition Using Deep Learning
    Ahmed, Waqar
    Riaz, Sana
    Iftikhar, Khunsa
    Konur, Savas
    ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
  • [14] Multimodal Arabic emotion recognition using deep learning
    Al Roken, Noora
    Barlas, Gerassimos
    SPEECH COMMUNICATION, 2023, 155
  • [15] An Ensemble Deep Learning Approach for Emotion Detection in Arabic Tweets
    Mansy, Alaa
    Rady, Sherine
    Gharib, Tarek
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (04) : 980 - 990
  • [16] Speech Emotion Recognition Using Deep Neural Network and Extreme Learning Machine
    Han, Kun
    Yu, Dong
    Tashev, Ivan
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 223 - 227
  • [17] Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition
    Xu, Xinzhou
    Deng, Jun
    Coutinho, Eduardo
    Wu, Chen
    Zhao, Li
    Schuller, Bjoern W.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (03) : 795 - 808
  • [18] Emotion Recognition System via Facial Expressions and Speech Using Machine Learning and Deep Learning Techniques
    Chaudhari A.
    Bhatt C.
    Nguyen T.T.
    Patel N.
    Chavda K.
    Sarda K.
    SN Computer Science, 4 (4)
  • [19] Ensemble Learning of Hybrid Acoustic Features for Speech Emotion Recognition
    Zvarevashe, Kudakwashe
    Olugbara, Oludayo
    ALGORITHMS, 2020, 13 (03)
  • [20] Emotion Recognition On Speech Signals Using Machine Learning
    Ghai, Mohan
    Lal, Shamit
    Duggal, Shivam
    Manik, Shrey
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS AND COMPUTATIONAL INTELLIGENCE (ICBDAC), 2017, : 34 - 39