Cross-lingual deep learning model for gender-based emotion detection

被引:1
|
作者
Bhattacharya, Sudipta [1 ]
Mishra, Brojo Kishore [1 ]
Borah, Samarjeet [2 ]
Das, Nabanita [3 ]
Dey, Nilanjan [4 ]
机构
[1] GIET Univ, Dept Comp Sci & Engn, Gunupur, Odisha, India
[2] Sikkim Manipal Univ, Sikkim Manipal Inst Technol, Dept Comp Applicat, Sikkim, India
[3] Bengal Inst Technol, Kolkata, India
[4] Techno Int New Town, Dept Comp Sci & Engn, Kolkata, India
关键词
Gender-based emotion recognition; RAVDESS; EmoDB; Urdu language; Cross-lingual database; Ensemble soft voting classifier; Deep learning; SPEECH; RECOGNITION;
D O I
10.1007/s11042-023-16304-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In real-world applications, speech recognition is becoming increasingly popular. Human emotion and automatic gender recognition, which aims to identify male and female voices from any available emotional speech database, is an exciting application. It is noticeable that the performance of the automatic speech emotion and gender recognition system diminishes when cross-corpus circumstances exist, such as when multiple languages are present or a previously unknown language is present, such as Urdu. This study focuses on automatic emotion detection and gender identification from publicly available emotional speech databases. For this work, two public western language databases, namely, RAVDESS (English) and EmoDB (German), are combined for training, and the Urdu database is used for test purposes. The research reported that the k-fold ensemble soft-voting model, deep learning model, and augmented deep learning model obtained 79%, 82%, and 97.6% accuracy, respectively. The results are considerably better than those of many existing systems. The performance evaluation results are also encouraging. Many previous studies on speech emotion recognition have focused on various languages. The proposed technique is sufficiently robust and can efficiently detect emotion and identify gender from the Urdu database. The approach can be used in a wide range of applications.
引用
收藏
页码:25969 / 26007
页数:39
相关论文
共 50 条
  • [1] Cross-lingual deep learning model for gender-based emotion detection
    Sudipta Bhattacharya
    Brojo Kishore Mishra
    Samarjeet Borah
    Nabanita Das
    Nilanjan Dey
    Multimedia Tools and Applications, 2024, 83 (9) : 25969 - 26007
  • [2] Cross-lingual Emotion Detection
    Hassan, Sabit
    Shaar, Shaden
    Darwish, Kareem
    2022 Language Resources and Evaluation Conference, LREC 2022, 2022, : 6948 - 6958
  • [3] Cross-lingual Emotion Detection
    Hassan, Sabit
    Shaar, Shaden
    Darwish, Kareem
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6948 - 6958
  • [4] Emotion Detection in Cross-Lingual Text Based on Bidirectional LSTM
    Ren, Han
    Wan, Jing
    Ren, Yafeng
    SECURITY WITH INTELLIGENT COMPUTING AND BIG-DATA SERVICES, 2020, 895 : 838 - 845
  • [5] Cross-Lingual Transfert Learning for Speech Emotion Recognition
    Baklouti, Imen
    Ben Ahmed, Olfa
    Baklouti, Raoudha
    Fernandez, Christine
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES, SIGNAL AND IMAGE PROCESSING, ATSIP 2024, 2024, : 559 - 563
  • [6] Advancements in Bangla Speech Emotion Recognition: A Deep Learning Approach with Cross-Lingual Validation
    Alam, Khorshed
    Bhuiyan, Mahbubul Haq
    Hossain, Md Junayed
    Monir, Md Fahad
    Bin Khaled, Md Asif
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,
  • [7] Advancements in Bangla Speech Emotion Recognition: A Deep Learning Approach with Cross-Lingual Validation
    Alam, Khorshed
    Bhuiyan, Mahbubul Haq
    Hossain, Md Junayed
    Monir, Md Fahad
    Khaled, Md Asif Bin
    IEEE Vehicular Technology Conference, 2024,
  • [8] Towards Cross-Lingual Emotion Transplantation
    Lorenzo-Trueba, Jaime
    Barra-Chicote, Roberto
    Yamagishi, Junichi
    Montero, Juan M.
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2014, 2014, 8854 : 199 - 208
  • [9] Towards cross-lingual emotion transplantation
    1600, Springer Verlag (8854):
  • [10] Cross-lingual Speaker Verification with Deep Feature Learning
    Li, Lantian
    Wang, Dong
    Rozi, Askar
    Zheng, Thomas Fang
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1040 - 1044