Speech Emotion Recognition among Couples using the Peak-End Rule and Transfer Learning

被引:4
|
作者
Boateng, George [1 ]
Sels, Laura [2 ]
Kuppens, Peter [3 ]
Hilpert, Peter [4 ]
Kowatsch, Tobias [1 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Univ Ghent, Ghent, Belgium
[3] Katholieke Univ Leuven, Leuven, Belgium
[4] Univ Surrey, Surrey, England
来源
COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION) | 2020年
关键词
Speech emotion recognition; Speech processing; Affective computing; Couples; Transfer Learning; Peak-end rule; Convolutional neural network; Support vector machine; MODEL;
D O I
10.1145/3395035.3425253
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Extensive couples' literature shows that how couples feel after a conflict is predicted by certain emotional aspects of that conversation. Understanding the emotions of couples leads to a better understanding of partners' mental well-being and consequently their relationships. Hence, automatic emotion recognition among couples could potentially guide interventions to help couples improve their emotional well-being and their relationships. It has been shown that people's global emotional judgment after an experience is strongly influenced by the emotional extremes and ending of that experience, known as the peak-end rule. In this work, we leveraged this theory and used machine learning to investigate, which audio segments can be used to best predict the end-of-conversation emotions of couples. We used speech data collected from 101 Dutch-speaking couples in Belgium who engaged in 10-minute long conversations in the lab. We extracted acoustic features from (1) the audio segments with the most extreme positive and negative ratings, and (2) the ending of the audio. We used transfer learning in which we extracted these acoustic features with a pre-trained convolutional neural network (YAMNet). We then used these features to train machine learning models - support vector machines - to predict the end-of-conversation valence ratings (positive vs negative) of each partner. The results of this work could inform how to best recognize the emotions of couples after conversation-sessions and eventually, lead to a better understanding of couples' relationships either in therapy or in everyday life.
引用
收藏
页码:17 / 21
页数:5
相关论文
共 50 条
  • [21] Exploring Transfer Learning between Scripted and Spontaneous Speech for Emotion Recognition
    Li, Qingqing
    Chaspari, Theodora
    ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 435 - 439
  • [22] A web crowdsourcing framework for transfer learning and personalized Speech Emotion Recognition
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Kotsakis, Rigas
    Dimoulas, Charalampos
    MACHINE LEARNING WITH APPLICATIONS, 2021, 6
  • [23] Feature Selection Based Transfer Subspace Learning for Speech Emotion Recognition
    Song, Peng
    Zheng, Wenming
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2020, 11 (03) : 373 - 382
  • [24] Analysis of thermal comfort experience using peak-end rule with air conditioner in heating season
    Yang, Zixu
    Cui, Mengdi
    Xiao, Hansong
    Sun, Hongli
    Wang, Baolong
    Lin, Borong
    Shi, Wenxing
    BUILDING AND ENVIRONMENT, 2023, 229
  • [25] Speech Emotion Recognition Using Deep Neural Networks, Transfer Learning, and Ensemble Classification Techniques
    Mihalache, Serban
    Burileanu, Dragos
    ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2023, 26 (3-4): : 375 - 387
  • [26] Speech Emotion Recognition Using Deep Learning Transfer Models and Explainable Techniques
    Kim, Tae-Wan
    Kwak, Keun-Chang
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [27] Speech Emotion Recognition Using Deep Learning
    Ahmed, Waqar
    Riaz, Sana
    Iftikhar, Khunsa
    Konur, Savas
    ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
  • [28] Multi-branch feature learning based speech emotion recognition using SCAR-NET
    Mao, Keji
    Wang, Yuxiang
    Ren, Ligang
    Zhang, Jinhong
    Qiu, Jiefan
    Dai, Guanglin
    CONNECTION SCIENCE, 2023, 35 (01)
  • [29] Forming Global Retrospective Evaluation: Duration Neglect and the Peak-End Rule
    Bovan, Kosta
    DRUSTVENA ISTRAZIVANJA, 2013, 22 (02): : 325 - 345
  • [30] Towards end-to-end speech recognition with transfer learning
    Qin, Chu-Xiong
    Qu, Dan
    Zhang, Lian-Hai
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,