Switch fusion for continuous emotion estimation from multiple physiological signals

被引：0

作者：

Vu, Ngoc Tu ^{[1
]}

Huynh, Van Thong ^{[3
]}

Kim, Seung-Won ^{[1
]}

Shin, Ji-eun ^{[2
]}

Yang, Hyung-Jeong ^{[1
]}

Kim, Soo-Hyung ^{[1
]}

机构：

[1] Chonnam Natl Univ, Dept AI Convergence, Gwangju 61186, South Korea

[2] Chonnam Natl Univ, Dept Psychol, Gwangju 61186, South Korea

[3] FPT Univ, Dept ITS, HoChiMinh City 71216, Vietnam

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2025年 / 107卷

基金：

新加坡国家研究基金会;

关键词：

Continuous emotion estimation; Multimodal dynamic fusion; Physiological signals; Affective computing; FACIAL EXPRESSION; RECOGNITION;

D O I：

10.1016/j.bspc.2025.107831

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Physiological signals represent a robust foundation for affective computing, primarily due to their resistance to conscious manipulation by subjects. With the proliferation of applications such as safe driving, mental health treatment, and wearable wellness technologies, emotion recognition based on physiological signals has garnered substantial attention. However, the increasing variety of signals captured by diverse sensors poses a challenge for models to integrate these inputs and accurately predict emotional states efficiently. Determining an optimized fusion strategy becomes increasingly complex as the number of signals grows. To address this, we propose switch fusion, a dynamic allocation fusion algorithm designed to dynamically enable models to learn optimal fusion strategies of multiple modalities. Leveraging the mixture of experts' frameworks, our approach employs a gating mechanism to route modalities to specialized experts, utilizing these experts as fusion encoder modules. Furthermore, we demonstrate the effectiveness of time series-based models in processing physiological signals for continuous emotion estimation to enhance computational efficiency. Experiments conducted on the continuously annotated signals of emotion dataset highlight the effectiveness of switch fusion, achieving root mean square errors of 1.064 and 1.089 for arousal and valence scores, respectively, surpassing stateof-the-art methods in 3 out of 4 experimental scenarios. This study underscores the critical role of dynamic fusion strategies in continuous emotion estimation from diverse physiological signals, effectively addressing the challenges posed by the increasing complexity of sensor inputs.

引用

页数：13

共 70 条

[11] FACIAL EXPRESSION AND EMOTION
EKMAN, P
[J]. AMERICAN PSYCHOLOGIST, 1993, 48 (04) : 384 - 392
[12] Survey on speech emotion recognition: Features, classification schemes, and databases
El Ayadi, Moataz
Kamel, Mohamed S.
Karray, Fakhri
[J]. PATTERN RECOGNITION, 2011, 44 (03) : 572 - 587
[13] Elgendi M., 2010, Frequency bands effects on QRS detection, P428
[14] Systolic Peak Detection in Acceleration Photoplethysmograms Measured from Emergency Responders in Tropical Conditions
Elgendi, Mohamed
Norton, Ian
Brearley, Matt
Abbott, Derek
Schuurmans, Dale
[J]. PLOS ONE, 2013, 8 (10):
[15] Fedus W, 2022, J MACH LEARN RES, V23
[16] Brain Computer Interfaces, a Review
Fernando Nicolas-Alonso, Luis
Gomez-Gil, Jaime
[J]. SENSORS, 2012, 12 (02) : 1211 - 1279
[17] Frank Dana L, 2010, Ment Health Fam Med, V7, P85
[18] Emotion recognition based on multi-modal physiological signals and transfer learning
Fu, Zhongzheng
Zhang, Boning
He, Xinrun
Li, Yixuan
Wang, Haoyuan
Huang, Jian
[J]. FRONTIERS IN NEUROSCIENCE, 2022, 16
[19] A review on speech emotion recognition: A survey, recent advances, challenges, and the influence of noise
George, Swapna Mol
Ilyas, P. Muhamed
[J]. NEUROCOMPUTING, 2024, 568
[20] Social media big data analytics: A survey
Ghani, Norjihan Abdul
Hamid, Suraya
Hashem, Ibrahim Abaker Targio
Ahmed, Ejaz
[J]. COMPUTERS IN HUMAN BEHAVIOR, 2019, 101 : 417 - 428

← 1 2 3 4 5 6 7 →