IIFDD: Intra and inter-modal fusion for depression detection with multi-modal information from Internet of Medical Things

被引:34
作者
Chen, Jian [1 ,2 ]
Hu, Yuzhu [1 ,2 ]
Lai, Qifeng [1 ,2 ]
Wang, Wei [1 ,3 ]
Chen, Junxin [4 ]
Liu, Han [4 ]
Srivastava, Gautam [5 ,6 ]
Bashir, Ali Kashif [7 ,8 ,9 ]
Hu, Xiping [1 ,3 ]
机构
[1] Shenzhen MSU BIT Univ, Artificial Intelligence Res Inst, Guangdong Hong Kong Macao Joint Lab Emot Intellige, Shenzhen 518172, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen 518000, Guangdong, Peoples R China
[3] Beijing Inst Technol, Sch Med Technol, Beijing 100081, Peoples R China
[4] Dalian Univ Technol, Sch Software, Dalian 116024, Liaoning, Peoples R China
[5] Brandon Univ, Dept Math & Comp Sci, Brandon, MB R7A 6A9, Canada
[6] China Med Univ Hosp, Res Ctr Interneural Comp, Taichung 40402, Taiwan
[7] Manchester Metropolitan Univ, Dept Comp & Math, Manchester, England
[8] Woxsen Univ, Woxsen Sch Business, Hyderabad 502345, India
[9] Lebanese Amer Univ, Dept Comp Sci & Math, Beirut, Lebanon
基金
中国国家自然科学基金;
关键词
IoMT; Depression detection; Multimedia affective computing; Multi-modal data fusion; Attention mechanism; CLASSIFICATION;
D O I
10.1016/j.inffus.2023.102017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression is now a prevalent mental illness and multimodal data-based depression detection is an essential topic of research. Internet of Medical Things devices can provide data resources such as text, audio, and vision, which is valuable for depression detection. Moreover, previous studies have concentrated on using single characteristics of each modality, such as low-dimensional pre-designed features and high-level deep representation, which cannot completely capture the emotional information included in the data. Against this background, we design an intra-modal and inter-modal fusion framework called IIFDD for Corpus-based depression detection. Intra-modal fusion module is designed to integrate low-dimensional pre-designed features and high-dimension deep representation from the same modality for better learning of the semantics information. Then, the inter-modal fusion module is proposed to fuse features from different modalities with attention mechanisms and use the fused result to complete the depression classification. Experiments on two Chinese depression corpus datasets with acoustics, textual, and visual features show that IIFDD can achieve state-of-the-art performance for depression detection.
引用
收藏
页数:11
相关论文
共 67 条
[1]  
Al Asad Nafiz, 2019, 2019 IEEE International Conference on Signal Processing, Information, Communication & Systems (SPICSCON), P13, DOI 10.1109/SPICSCON48833.2019.9065101
[2]   Intelligence in the Internet of Medical Things era: A systematic review of current and future trends [J].
Al-Turjman, Fadi ;
Nawaz, Muhammad Hassan ;
Ulusar, Umit Deniz .
COMPUTER COMMUNICATIONS, 2020, 150 :644-660
[3]   Towards video streaming in IoT Environments: Vehicular communication perspective [J].
Aliyu, Ahmed ;
Abdullah, Abdul H. ;
Kaiwartya, Omprakash ;
Cao, Yue ;
Lloret, Jaime ;
Aslam, Nauman ;
Joda, Usman Mohammed .
COMPUTER COMMUNICATIONS, 2018, 118 :93-119
[4]   Machine Learning-Based Approach for Depression Detection in Twitter Using Content and Activity Features [J].
Alsagri, Hatoon S. ;
Ykhlef, Mourad .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (08) :1825-1832
[5]  
[Anonymous], 2013, Diagnostic and Statistical Manual of Mental Disorders: DSM-5, V5
[6]   Automated Depression Detection Using Deep Representation and Sequence Learning with EEG Signals [J].
Ay, Betul ;
Yildirim, Ozal ;
Talo, Muhammed ;
Baloglu, Ulas Baran ;
Aydin, Galip ;
Puthankattil, Subha D. ;
Acharya, U. Rajendra .
JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (07)
[7]   OpenFace 2.0: Facial Behavior Analysis Toolkit [J].
Baltrusaitis, Tadas ;
Zadeh, Amir ;
Lim, Yao Chong ;
Morency, Louis-Philippe .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :59-66
[8]  
Bertasius G, 2021, PR MACH LEARN RES, V139
[9]   A text classification framework for simple and effective early depression detection over social media streams [J].
Burdisso, Sergio G. ;
Errecalde, Marcelo ;
Montes-y-Gomez, Manuel .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 133 :182-197
[10]  
Carreira J., 2018, arXiv