Switch fusion for continuous emotion estimation from multiple physiological signals

被引:0
作者
Vu, Ngoc Tu [1 ]
Huynh, Van Thong [3 ]
Kim, Seung-Won [1 ]
Shin, Ji-eun [2 ]
Yang, Hyung-Jeong [1 ]
Kim, Soo-Hyung [1 ]
机构
[1] Chonnam Natl Univ, Dept AI Convergence, Gwangju 61186, South Korea
[2] Chonnam Natl Univ, Dept Psychol, Gwangju 61186, South Korea
[3] FPT Univ, Dept ITS, HoChiMinh City 71216, Vietnam
基金
新加坡国家研究基金会;
关键词
Continuous emotion estimation; Multimodal dynamic fusion; Physiological signals; Affective computing; FACIAL EXPRESSION; RECOGNITION;
D O I
10.1016/j.bspc.2025.107831
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Physiological signals represent a robust foundation for affective computing, primarily due to their resistance to conscious manipulation by subjects. With the proliferation of applications such as safe driving, mental health treatment, and wearable wellness technologies, emotion recognition based on physiological signals has garnered substantial attention. However, the increasing variety of signals captured by diverse sensors poses a challenge for models to integrate these inputs and accurately predict emotional states efficiently. Determining an optimized fusion strategy becomes increasingly complex as the number of signals grows. To address this, we propose switch fusion, a dynamic allocation fusion algorithm designed to dynamically enable models to learn optimal fusion strategies of multiple modalities. Leveraging the mixture of experts' frameworks, our approach employs a gating mechanism to route modalities to specialized experts, utilizing these experts as fusion encoder modules. Furthermore, we demonstrate the effectiveness of time series-based models in processing physiological signals for continuous emotion estimation to enhance computational efficiency. Experiments conducted on the continuously annotated signals of emotion dataset highlight the effectiveness of switch fusion, achieving root mean square errors of 1.064 and 1.089 for arousal and valence scores, respectively, surpassing stateof-the-art methods in 3 out of 4 experimental scenarios. This study underscores the critical role of dynamic fusion strategies in continuous emotion estimation from diverse physiological signals, effectively addressing the challenges posed by the increasing complexity of sensor inputs.
引用
收藏
页数:13
相关论文
共 70 条
  • [11] FACIAL EXPRESSION AND EMOTION
    EKMAN, P
    [J]. AMERICAN PSYCHOLOGIST, 1993, 48 (04) : 384 - 392
  • [12] Survey on speech emotion recognition: Features, classification schemes, and databases
    El Ayadi, Moataz
    Kamel, Mohamed S.
    Karray, Fakhri
    [J]. PATTERN RECOGNITION, 2011, 44 (03) : 572 - 587
  • [13] Elgendi M., 2010, Frequency bands effects on QRS detection, P428
  • [14] Systolic Peak Detection in Acceleration Photoplethysmograms Measured from Emergency Responders in Tropical Conditions
    Elgendi, Mohamed
    Norton, Ian
    Brearley, Matt
    Abbott, Derek
    Schuurmans, Dale
    [J]. PLOS ONE, 2013, 8 (10):
  • [15] Fedus W, 2022, J MACH LEARN RES, V23
  • [16] Brain Computer Interfaces, a Review
    Fernando Nicolas-Alonso, Luis
    Gomez-Gil, Jaime
    [J]. SENSORS, 2012, 12 (02) : 1211 - 1279
  • [17] Frank Dana L, 2010, Ment Health Fam Med, V7, P85
  • [18] Emotion recognition based on multi-modal physiological signals and transfer learning
    Fu, Zhongzheng
    Zhang, Boning
    He, Xinrun
    Li, Yixuan
    Wang, Haoyuan
    Huang, Jian
    [J]. FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [19] A review on speech emotion recognition: A survey, recent advances, challenges, and the influence of noise
    George, Swapna Mol
    Ilyas, P. Muhamed
    [J]. NEUROCOMPUTING, 2024, 568
  • [20] Social media big data analytics: A survey
    Ghani, Norjihan Abdul
    Hamid, Suraya
    Hashem, Ibrahim Abaker Targio
    Ahmed, Ejaz
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2019, 101 : 417 - 428