A Novel Audio-Visual Information Fusion System for Mental Disorders Detection

被引:0
作者
Li, Yichun [1 ]
Li, Shuanglin [1 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Intelligent Sensing & Commun Res Grp, Newcastle Upon Tyne, Tyne & Wear, England
来源
2024 27TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, FUSION 2024 | 2024年
关键词
mental disorder; machine learning; depression; ADHD; multimodal;
D O I
10.23919/FUSION59988.2024.10706499
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mental disorders are among the foremost contributors to the global healthcare challenge. Research indicates that timely diagnosis and intervention are vital in treating various mental disorders. However, the early somatization symptoms of certain mental disorders may not be immediately evident, often resulting in their oversight and misdiagnosis. Additionally, the traditional diagnosis methods incur high time and cost. Deep learning methods based on fMRI and EEG have improved the efficiency of the mental disorder detection process. However, the cost of the equipment and trained staff are generally huge. Moreover, most systems are only trained for a specific mental disorder and are not general-purpose. Recently, physiological studies have shown that there are some speech and facial-related symptoms in a few mental disorders (e.g., depression and ADHD). In this paper, we focus on the emotional expression features of mental disorders and introduce a multimodal mental disorder diagnosis system based on audio-visual information input. Our proposed system is based on spatial-temporal attention networks and innovative uses a less computationally intensive pre-train audio recognition network to fine-tune the video recognition module for better results. We also apply the unified system for multiple mental disorders (ADHD and depression) for the first time. The proposed system achieves over 80% accuracy on the real multimodal ADHD dataset and achieves state-of-the-art results on the depression dataset AVEC 2014.
引用
收藏
页数:7
相关论文
共 34 条
[1]  
[Anonymous], About us
[2]   The Neuro Bureau ADHD-200 Preprocessed repository [J].
Bellec, Pierre ;
Chu, Carlton ;
Chouinard-Decorte, Francois ;
Benhajali, Yassine ;
Margulies, Daniel S. ;
Craddock, R. Cameron .
NEUROIMAGE, 2017, 144 :275-286
[3]   ADHD classification by dual subspace learning using resting-state functional connectivity [J].
Chen, Ying ;
Tang, Yibin ;
Wang, Chun ;
Liu, Xiaofeng ;
Zhao, Li ;
Wang, Zhishun .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 103
[4]  
de Melo WC, 2020, INT CONF ACOUST SPEE, P1080, DOI [10.1109/icassp40776.2020.9054375, 10.1109/ICASSP40776.2020.9054375]
[5]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[6]   Encoding Visual Behaviors with Attentive Temporal Convolution for Depression Prediction [J].
Du, Zhengyin ;
Li, Weixin ;
Huang, Di ;
Wang, Yunhong .
2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, :562-568
[7]   Diagnosis of Major Depressive Disorder Using Machine Learning Based on Multisequence MRI Neuroimaging Features [J].
Li, Qinghe ;
Dong, Fanghui ;
Gai, Qun ;
Che, Kaili ;
Ma, Heng ;
Zhao, Feng ;
Chu, Tongpeng ;
Mao, Ning ;
Wang, Peiyuan .
JOURNAL OF MAGNETIC RESONANCE IMAGING, 2023, 58 (05) :1420-1430
[8]  
Li S., 2024, IEEE Journal of Translational Engineering in Health and Medicine, P1
[9]  
Li S., 2023, IEEE INT PERF COMP C
[10]  
Li Y., 2023, Journal of Neurology, Neurosurgery & Psychiatry, V94, pe2