Analysis of Linguistic and Prosodic Features of Bilingual Arabic&x2013;English Speakers for Speech Emotion Recognition

被引:12
作者
Abdel-Hamid, Lamiaa [1 ]
Shaker, Nabil H. [1 ]
Emara, Ingy [2 ]
机构
[1] Misr Int Univ, Fac Engn, Dept Elect & Commun, Cairo, Egypt
[2] Misr Int Univ, Fac Al Alsun & Mass Commun, Dept Al Alsun Languages, Cairo, Egypt
关键词
Linguistics; Databases; Speech recognition; Emotion recognition; Feature extraction; Semantics; TV; Arabic speech analysis; bilingual linguistic and prosodic features; classification; speech emotion recognition; FUNDAMENTAL-FREQUENCY; CLASSIFIERS; ENGLISH; WORDS;
D O I
10.1109/ACCESS.2020.2987864
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech emotion recognition (SER) research has usually focused on the analysis of the native language of speakers, most commonly, targeting European and Asian languages. In the present study, a bilingual Arabic/English speech emotion database elicited from 16 male and 16 female Egyptian participants was created in order to investigate how the linguistic and prosodic features were affected by the anger, fear, happiness and sadness emotions across Arabic and English emotional speech. The results of the linguistic analysis indicated that the participants preferred to express their emotions indirectly, mainly using religious references, and that the female participants tended to use language that was more tentative and emotionally expressive, while the male participants tended to use language that was more assertive and independent. As for the prosodic analysis, statistical t-tests showed that the prosodic features of pitch, intensity and speech rate were more indicative of anger and happiness while less relevant to fear and scarcely significant for sadness. Furthermore, speech emotion recognition performed using linear support vector machine (SVM) with AdaBoost also supported these results. In regard to first and second language linguistic features, there was no significant difference in the choice of words and structures expressing the different emotions in the two languages, but in terms of prosodic features, the females & x2019; speech showed higher pitch in Arabic in all cases while both genders showed close intensity values in the two languages and faster speech rate in Arabic than in English.
引用
收藏
页码:72957 / 72970
页数:14
相关论文
共 69 条
  • [1] Recognizing Emotion from Speech Based on Age and Gender Using Hierarchical Models
    Abu Shaqra, Ftoon
    Duwairi, Rehab
    Al-Ayyoub, Mahmoud
    [J]. 10TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2019) / THE 2ND INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40 2019) / AFFILIATED WORKSHOPS, 2019, 151 : 37 - 44
  • [2] Features and classifiers for emotion recognition from speech: a survey from 2000 to 2011
    Anagnostopoulos, Christos-Nikolaos
    Iliou, Theodoros
    Giannoukos, Ioannis
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2015, 43 (02) : 155 - 177
  • [3] Anagnostopoulos CN, 2010, STUD COMPUT INTELL, V279, P127, DOI 10.1007/978-3-642-11684-1_8
  • [4] [Anonymous], 1993, PROC I PHONETIC SCI, DOI DOI 10.1371/JOURNAL.PONE.0069107
  • [5] [Anonymous], 1975, LANGUAGE WOMENS PLAC
  • [6] [Anonymous], 2008, WOMEN FIRE DANGEROUS
  • [7] [Anonymous], 2000, LANGUAGE VARIATION S
  • [8] [Anonymous], IEEE ACCESS
  • [9] Was Darwin Wrong About Emotional Expressions?
    Barrett, Lisa Feldman
    [J]. CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2011, 20 (06) : 400 - 406
  • [10] Batliner A, 2000, ART INTEL, P122