Speech Emotion Recognition among Couples using the Peak-End Rule and Transfer Learning

被引：4

作者：

Boateng, George ^{[1
]}

Sels, Laura ^{[2
]}

Kuppens, Peter ^{[3
]}

Hilpert, Peter ^{[4
]}

Kowatsch, Tobias ^{[1
]}

机构：

[1] Swiss Fed Inst Technol, Zurich, Switzerland

[2] Univ Ghent, Ghent, Belgium

[3] Katholieke Univ Leuven, Leuven, Belgium

[4] Univ Surrey, Surrey, England

来源：

COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION) | 2020年

关键词：

Speech emotion recognition; Speech processing; Affective computing; Couples; Transfer Learning; Peak-end rule; Convolutional neural network; Support vector machine; MODEL;

D O I：

10.1145/3395035.3425253

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Extensive couples' literature shows that how couples feel after a conflict is predicted by certain emotional aspects of that conversation. Understanding the emotions of couples leads to a better understanding of partners' mental well-being and consequently their relationships. Hence, automatic emotion recognition among couples could potentially guide interventions to help couples improve their emotional well-being and their relationships. It has been shown that people's global emotional judgment after an experience is strongly influenced by the emotional extremes and ending of that experience, known as the peak-end rule. In this work, we leveraged this theory and used machine learning to investigate, which audio segments can be used to best predict the end-of-conversation emotions of couples. We used speech data collected from 101 Dutch-speaking couples in Belgium who engaged in 10-minute long conversations in the lab. We extracted acoustic features from (1) the audio segments with the most extreme positive and negative ratings, and (2) the ending of the audio. We used transfer learning in which we extracted these acoustic features with a pre-trained convolutional neural network (YAMNet). We then used these features to train machine learning models - support vector machines - to predict the end-of-conversation valence ratings (positive vs negative) of each partner. The results of this work could inform how to best recognize the emotions of couples after conversation-sessions and eventually, lead to a better understanding of couples' relationships either in therapy or in everyday life.

引用

页码：17 / 21

页数：5

共 50 条

[21] Exploring Transfer Learning between Scripted and Spontaneous Speech for Emotion Recognition
Li, Qingqing
Chaspari, Theodora
ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 435 - 439
[22] A web crowdsourcing framework for transfer learning and personalized Speech Emotion Recognition
Vryzas, Nikolaos
Vrysis, Lazaros
Kotsakis, Rigas
Dimoulas, Charalampos
MACHINE LEARNING WITH APPLICATIONS, 2021, 6
[23] Feature Selection Based Transfer Subspace Learning for Speech Emotion Recognition
Song, Peng
Zheng, Wenming
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2020, 11 (03) : 373 - 382
[24] Analysis of thermal comfort experience using peak-end rule with air conditioner in heating season
Yang, Zixu
Cui, Mengdi
Xiao, Hansong
Sun, Hongli
Wang, Baolong
Lin, Borong
Shi, Wenxing
BUILDING AND ENVIRONMENT, 2023, 229
[25] Speech Emotion Recognition Using Deep Neural Networks, Transfer Learning, and Ensemble Classification Techniques
Mihalache, Serban
Burileanu, Dragos
ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2023, 26 (3-4): : 375 - 387
[26] Speech Emotion Recognition Using Deep Learning Transfer Models and Explainable Techniques
Kim, Tae-Wan
Kwak, Keun-Chang
APPLIED SCIENCES-BASEL, 2024, 14 (04):
[27] Speech Emotion Recognition Using Deep Learning
Ahmed, Waqar
Riaz, Sana
Iftikhar, Khunsa
Konur, Savas
ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
[28] Multi-branch feature learning based speech emotion recognition using SCAR-NET
Mao, Keji
Wang, Yuxiang
Ren, Ligang
Zhang, Jinhong
Qiu, Jiefan
Dai, Guanglin
CONNECTION SCIENCE, 2023, 35 (01)
[29] Forming Global Retrospective Evaluation: Duration Neglect and the Peak-End Rule
Bovan, Kosta
DRUSTVENA ISTRAZIVANJA, 2013, 22 (02): : 325 - 345
[30] Towards end-to-end speech recognition with transfer learning
Qin, Chu-Xiong
Qu, Dan
Zhang, Lian-Hai
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,

← 1 2 3 4 5 →