Fear emotion classification in speech by acoustic and behavioral cues

被引：6

作者：

Yoon, Shin-ae ^{[1
]}

Son, Guiyoung ^{[1
]}

Kwon, Soonil ^{[1
]}

机构：

[1] Sejong Univ, Coll Software & Convergence Technol, Dept Software, 209 Neung Dong Ro, Seoul 05006, South Korea

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2019年 / 78卷 / 02期

关键词：

Emotional speech classification; Emergency situation; Behavioral cue; Disfluency(interjection); Speech signal processing; VOCAL EXPRESSION; RECOGNITION; COMMUNICATION; DISCRETE; FEATURES;

D O I：

10.1007/s11042-018-6329-2

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Machine-based emotional speech classification has become a requirement for natural and familiar human-computer interactions. Because emotional speech recognition systems use a person's voice to spontaneously detect their emotional status and take subsequent appropriate actions, they can be used widely for various reason in call centers and emotional based media services. Emotional speech recognition systems are primarily developed using emotional acoustic data. While there are several emotional acoustic databases available for emotion recognition systems in other countries, there is currently no real situational data related to the fear emotion available. Thus, in this study, we collected acoustic data recordings which represent real urgent and fearful situations from an emergency call center. To classify callers' emotions more accurately, we also included the additional behavioral feature interjection which can be classified as a type of disfluency which arises due to cognitive dysfunction observed in spontaneous speech when a speaker gets hyperemotional. We used Support Vector Machines (SVM), with the interjections feature, as well as conventionally used acoustic features (i.e., F0 variability, voice intensity variability, and Mel-Frequency Cepstral Coefficients; MFCCs) to identify which emotional category acoustic data fell into. The results of our study revealed that the MFCC was the best acoustic feature for spontaneous fear speech classification. In addition, we demonstrated the validity of behavioral features as an important criteria for emotional classification improvement.

引用

页码：2345 / 2366

页数：22

共 50 条

[21] Emotion Classification from Speech and Text in Videos Using a Multimodal Approach
Caschera, Maria Chiara
Grifoni, Patrizia
Ferri, Fernando
MULTIMODAL TECHNOLOGIES AND INTERACTION, 2022, 6 (04)
[22] Bilingualism and children's use of paralinguistic cues to interpret emotion in speech
Yow, W. Quin
Markman, Ellen M.
BILINGUALISM-LANGUAGE AND COGNITION, 2011, 14 (04) : 562 - 569
[23] Acoustic Characteristics of Emotional Speech Using Spectrogram Image Classification
Stolar, Melissa
Lech, Margaret
Bolia, Robert S.
Skinner, Michael
2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
[24] Acoustic cues to visual detection: A classification image study
Pascucci, David
Megna, Nicola
Panichi, Michela
Baldassi, Stefano
JOURNAL OF VISION, 2011, 11 (06): : 1 - 11
[25] Automatic Speech Emotion Detection System using Multi-domain Acoustic Feature Selection and Classification Models
Semwal, Nancy
Kumar, Abhijeet
Narayanan, Sakthivel
2017 IEEE INTERNATIONAL CONFERENCE ON IDENTITY, SECURITY AND BEHAVIOR ANALYSIS (ISBA), 2017,
[26] Real Life Emotion Classification from Speech Using Gaussian Mixture Models
Koolagudi, Shashidhar G.
Barthwal, Anurag
Devliyal, Swati
Rao, K. Sreenivasa
CONTEMPORARY COMPUTING, 2012, 306 : 250 - +
[27] Study of Wavelet Packet Energy Entropy for Emotion Classification in Speech and Glottal Signals
He, Ling
Lech, Margaret
Zhang, Jing
Ren, Xiaomei
Deng, Lihua
FIFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2013), 2013, 8878
[28] Hybrid Approach for Emotion Classification of Audio Conversation Based on Text and Speech Mining
Bhaskar, Jasmine
Sruthi, K.
Nedungadi, Prema
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 635 - 643
[29] A New Amharic Speech Emotion Dataset and Classification Benchmark
Retta, Ephrem Afele
Almekhlafi, Eiad
Sutcliffe, Richard
Mhamed, Mustafa
Ali, Haider
Feng, Jun
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
[30] Multistage classification scheme to enhance speech emotion recognition
Poorna, S. S.
Nair, G. J.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (02) : 327 - 340

← 1 2 3 4 5 →