MSF-Net: Multi-stage fusion network for emotion recognition from multimodal signals in scalable healthcare

被引:0
|
作者
Islam, Md. Milon [1 ]
Karray, Fakhri [1 ,2 ]
Muhammad, Ghulam [3 ]
机构
[1] Univ Waterloo, Ctr Pattern Anal & Machine Intelligence, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[2] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[3] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh 11543, Saudi Arabia
基金
加拿大自然科学与工程研究理事会;
关键词
Multimodal emotion recognition; Multi-stage fusion; Vision transformer; Bi-directional Gated Recurrent Unit; Triplet attention; Scalable healthcare;
D O I
10.1016/j.inffus.2025.103028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic emotion recognition has attracted significant interest in healthcare, thanks to remarkable developments made recently in smart and innovative technologies. A real-time emotion recognition system allows for continuous monitoring, comprehension, and enhancement of the physical entity's capacities, along with continuing advice for enhancing quality of life and well-being in the context of personalized healthcare. Multimodal emotion recognition presents a significant challenge in terms of efficiently using the diverse modalities present in the data. In this article, we introduce a Multi-Stage Fusion Network (MSF-Net) for emotion recognition capable of extracting multimodal information and achieving significant performances. We propose utilizing the transformer-based structure to extract deep features from facial expressions. We exploited two visual descriptors, local binary pattern and Oriented FAST and Rotated BRIEF, to retrieve the computer vision- based features from the facial videos. A feature-level fusion network integrates the extraction of features from these modules, directing the output into the triplet attention technique. This module employs a three-branch architecture to compute attention weights to capture cross-dimensional interactions efficiently. The temporal dependencies in physiological signals are modeled by a Bi-directional Gated Recurrent Unit (Bi-GRU) in forward and backward directions at each time step. Lastly, the output feature representations from the triplet attention module and the extracted high-level patterns from Bi-GRU are fused and fed into the classification module to recognize emotion. The extensive experimental evaluations revealed that the proposed MSF-Net outperformed the state-of-the-art approaches on two popular datasets, BioVid Emo DB and MGEED. Finally, we tested the proposed MSF-Net in the Internet of Things environment to facilitate real-world scalable smart healthcare application.
引用
收藏
页数:15
相关论文
共 37 条
  • [1] A multi-stage dynamical fusion network for multimodal emotion recognition
    Chen, Sihan
    Tang, Jiajia
    Zhu, Li
    Kong, Wanzeng
    COGNITIVE NEURODYNAMICS, 2023, 17 (03) : 671 - 680
  • [2] A multi-stage dynamical fusion network for multimodal emotion recognition
    Sihan Chen
    Jiajia Tang
    Li Zhu
    Wanzeng Kong
    Cognitive Neurodynamics, 2023, 17 : 671 - 680
  • [3] MSF-Net: Multi-stream fusion network for image manipulation detection and localization
    Dou, Liyun
    Chen, Meng
    Qiu, Jiaqing
    Wang, Jin
    DIGITAL SIGNAL PROCESSING, 2025, 161
  • [4] MSF-Net: A Lightweight Multi-Scale Feature Fusion Network for Skin Lesion Segmentation
    Shao, Dangguo
    Ren, Lifan
    Ma, Lei
    BIOMEDICINES, 2023, 11 (06)
  • [5] A Multi-Stage Adaptive Feature Fusion Neural Network for Multimodal Gait Recognition
    Zou, Shinan
    Xiong, Jianbo
    Fan, Chao
    Shen, Chuanfu
    Yu, Shiqi
    Tang, Jin
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2024, 6 (04): : 539 - 549
  • [6] A Multi-Stage Adaptive Feature Fusion Neural Network for Multimodal Gait Recognition
    Zou, Shinan
    Xiong, Jianbo
    Fan, Chao
    Yu, Shiqi
    Tang, Jin
    2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
  • [7] MF-Net: a multimodal fusion network for emotion recognition based on multiple physiological signals
    Zhu, Lei
    Ding, Yu
    Huang, Aiai
    Tan, Xufei
    Zhang, Jianhai
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [8] Emotion Recognition from Multimodal Physiological Signals for Emotion Aware Healthcare Systems
    Ayata, Deger
    Yaslan, Yusuf
    Kamasak, Mustafa E.
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2020, 40 (02) : 149 - 157
  • [9] Emotion Recognition from Multimodal Physiological Signals for Emotion Aware Healthcare Systems
    Değer Ayata
    Yusuf Yaslan
    Mustafa E. Kamasak
    Journal of Medical and Biological Engineering, 2020, 40 : 149 - 157
  • [10] A novel feature fusion network for multimodal emotion recognition from EEG and eye movement signals
    Fu, Baole
    Gu, Chunrui
    Fu, Ming
    Xia, Yuxiao
    Liu, Yinhua
    FRONTIERS IN NEUROSCIENCE, 2023, 17