Annotating and modeling empathy in spoken conversations

被引:47
作者
Alam, Firoj [1 ]
Danieli, Morena [1 ]
Riccardi, Giuseppe [1 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
关键词
Empathy; Emotion; Spoken conversation; Behavior analysis; Affective scene; Affect; Call center; Human-Human conversation; CLASSIFICATION; SENTIMENT; DIFFUSION; AGREEMENT; EMOTIONS; PROSODY; SPEECH; WORDS; MOOD;
D O I
10.1016/j.csl.2017.12.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Empathy, as defined in behavioral sciences, expresses the ability of human beings to recognize, understand and react to emotions, attitudes and beliefs of others. In this paper, we address two related problems in automatic affective behavior analysis: the design of the annotation protocol and the automatic recognition of empathy from human human dyadic spoken conversations. We propose and evaluate an annotation scheme for empathy inspired by the modal model of emotions. The annotation scheme was evaluated on a corpus of real-life, dyadic spoken conversations. In the context of behavioral analysis, we designed an automatic segmentation and classification system for empathy. Given the different speech and language levels of representation where empathy may be communicated, we investigated features derived from the lexical and acoustic spaces. The feature development process was designed to support both the fusion and automatic selection of relevant features from a high dimensional space. The automatic classification system was evaluated on call center conversations where it showed significantly better performance than the baseline. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:40 / 61
页数:22
相关论文
共 95 条
[1]  
Alam Firoj, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P955, DOI 10.1109/ICASSP.2014.6853738
[2]  
[Anonymous], INTERSPEECH
[3]  
[Anonymous], 2004, Proceedings of the 42nd annual meeting on Association for Computational Linguistics, DOI DOI 10.3115/1218955.1218990
[4]  
[Anonymous], P SPEECH PROSODY
[5]  
[Anonymous], P WORKSH SPEECH LANG
[6]  
[Anonymous], 2005, DATA MINING
[7]  
[Anonymous], 2013, P INT
[8]  
[Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224
[9]  
[Anonymous], 2011, ICWSM
[10]  
[Anonymous], 2009, P 2009 C EMP METH NA