Sincerity in Acted Speech: Presenting the Sincere Apology Corpus and Results

被引:2
作者
Baird, Alice [1 ]
Coutinho, Eduardo [2 ]
Hirschberg, Julia [3 ]
Schuller, Bjoern [1 ,4 ]
机构
[1] Univ Augsburg, ZDB Chair Embedded Intelligence Hlth Care & Wellb, Augsburg, Germany
[2] Univ Liverpool, Dept Mus, Liverpool, Merseyside, England
[3] Columbia Univ, Comp Sci Dept, New York, NY 10027 USA
[4] Imperial Coll London, GLAM Grp Language Audio & Mus, London, England
来源
INTERSPEECH 2019 | 2019年
关键词
sincerity; acoustic features; deep data-representations; acted speech; speech corpus; DECEPTION; FORGIVENESS;
D O I
10.21437/Interspeech.2019-1349
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
The ability to discern an individual's level of sincerity varies from person to person and across cultures. Sincerity is typically a key indication of personality traits such as trustworthiness, and portraying sincerity can be integral to an abundance of scenarios, e. g., when apologising. Speech signals are one important factor when discerning sincerity and, with more modern interactions occurring remotely, automatic approaches for the recognition of sincerity from speech are beneficial during both interpersonal and professional scenarios. In this study we present details of the Sincere Apology Corpus (SINA-C). Annotated by 22 individuals for their perception of sincerity, SINA-C is an English acted-speech corpus of 32 speakers, apologising in multiple ways. To provide an updated baseline for the corpus, various machine learning experiments are conducted. Finding that extracting deep data-representations (utilising the DEEP SPECTRUM toolkit) from the speech signals is best suited. Classification results on the binary (sincere / not sincere) task are at best 79.2% Unweighted Average Recall and for regression, in regards to the degree of sincerity, a Root Mean Square Error of 0.395 from the standardised range [-1.51; 1.72] is obtained.
引用
收藏
页码:539 / 543
页数:5
相关论文
共 40 条
[1]  
Akehurst L, 1996, APPL COGNITIVE PSYCH, V10, P461, DOI 10.1002/(SICI)1099-0720(199612)10:6<461::AID-ACP413>3.0.CO
[2]  
2-2
[3]  
Amiriparian S., 2018, PROC CHALLENGE DETEC
[4]   Snore Sound Classification Using Image-based Deep Spectrum Features [J].
Amiriparian, Shahin ;
Gerczuk, Maurice ;
Ottl, Sandra ;
Cummins, Nicholas ;
Freitag, Michael ;
Pugachevskiy, Sergey ;
Baird, Alice ;
Schuller, Bjoern .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :3512-3516
[5]   Is deception emotional? An emotion-driven predictive approach [J].
Amiriparian, Shahin ;
Pohjalainen, Jouni ;
Marchi, Erik ;
Pugachevskiy, Sergey ;
Schuller, Bjorn .
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :2011-2015
[6]   Western 'sincerity' and Confucian 'cheng' [J].
An, Y .
ASIAN PHILOSOPHY, 2004, 14 (02) :155-169
[7]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[8]  
[Anonymous], 2016, INTERSPEECH, DOI DOI 10.21437/Interspeech.2016-956
[9]  
[Anonymous], 2009, SINCERITY AUTHENTICI
[10]   Forgiveness, Apology, and Communicative Responses to Hurtful Events [J].
Bachman, Guy Foster ;
Guerrero, Laura K. .
COMMUNICATION REPORTS, 2006, 19 (01) :45-56