Fear emotion classification in speech by acoustic and behavioral cues

被引:6
|
作者
Yoon, Shin-ae [1 ]
Son, Guiyoung [1 ]
Kwon, Soonil [1 ]
机构
[1] Sejong Univ, Coll Software & Convergence Technol, Dept Software, 209 Neung Dong Ro, Seoul 05006, South Korea
关键词
Emotional speech classification; Emergency situation; Behavioral cue; Disfluency(interjection); Speech signal processing; VOCAL EXPRESSION; RECOGNITION; COMMUNICATION; DISCRETE; FEATURES;
D O I
10.1007/s11042-018-6329-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine-based emotional speech classification has become a requirement for natural and familiar human-computer interactions. Because emotional speech recognition systems use a person's voice to spontaneously detect their emotional status and take subsequent appropriate actions, they can be used widely for various reason in call centers and emotional based media services. Emotional speech recognition systems are primarily developed using emotional acoustic data. While there are several emotional acoustic databases available for emotion recognition systems in other countries, there is currently no real situational data related to the fear emotion available. Thus, in this study, we collected acoustic data recordings which represent real urgent and fearful situations from an emergency call center. To classify callers' emotions more accurately, we also included the additional behavioral feature interjection which can be classified as a type of disfluency which arises due to cognitive dysfunction observed in spontaneous speech when a speaker gets hyperemotional. We used Support Vector Machines (SVM), with the interjections feature, as well as conventionally used acoustic features (i.e., F0 variability, voice intensity variability, and Mel-Frequency Cepstral Coefficients; MFCCs) to identify which emotional category acoustic data fell into. The results of our study revealed that the MFCC was the best acoustic feature for spontaneous fear speech classification. In addition, we demonstrated the validity of behavioral features as an important criteria for emotional classification improvement.
引用
收藏
页码:2345 / 2366
页数:22
相关论文
共 50 条
  • [21] Emotion Classification from Speech and Text in Videos Using a Multimodal Approach
    Caschera, Maria Chiara
    Grifoni, Patrizia
    Ferri, Fernando
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2022, 6 (04)
  • [22] Bilingualism and children's use of paralinguistic cues to interpret emotion in speech
    Yow, W. Quin
    Markman, Ellen M.
    BILINGUALISM-LANGUAGE AND COGNITION, 2011, 14 (04) : 562 - 569
  • [23] Acoustic Characteristics of Emotional Speech Using Spectrogram Image Classification
    Stolar, Melissa
    Lech, Margaret
    Bolia, Robert S.
    Skinner, Michael
    2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
  • [24] Acoustic cues to visual detection: A classification image study
    Pascucci, David
    Megna, Nicola
    Panichi, Michela
    Baldassi, Stefano
    JOURNAL OF VISION, 2011, 11 (06): : 1 - 11
  • [25] Automatic Speech Emotion Detection System using Multi-domain Acoustic Feature Selection and Classification Models
    Semwal, Nancy
    Kumar, Abhijeet
    Narayanan, Sakthivel
    2017 IEEE INTERNATIONAL CONFERENCE ON IDENTITY, SECURITY AND BEHAVIOR ANALYSIS (ISBA), 2017,
  • [26] Real Life Emotion Classification from Speech Using Gaussian Mixture Models
    Koolagudi, Shashidhar G.
    Barthwal, Anurag
    Devliyal, Swati
    Rao, K. Sreenivasa
    CONTEMPORARY COMPUTING, 2012, 306 : 250 - +
  • [27] Study of Wavelet Packet Energy Entropy for Emotion Classification in Speech and Glottal Signals
    He, Ling
    Lech, Margaret
    Zhang, Jing
    Ren, Xiaomei
    Deng, Lihua
    FIFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2013), 2013, 8878
  • [28] Hybrid Approach for Emotion Classification of Audio Conversation Based on Text and Speech Mining
    Bhaskar, Jasmine
    Sruthi, K.
    Nedungadi, Prema
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 635 - 643
  • [29] A New Amharic Speech Emotion Dataset and Classification Benchmark
    Retta, Ephrem Afele
    Almekhlafi, Eiad
    Sutcliffe, Richard
    Mhamed, Mustafa
    Ali, Haider
    Feng, Jun
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
  • [30] Multistage classification scheme to enhance speech emotion recognition
    Poorna, S. S.
    Nair, G. J.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (02) : 327 - 340