A Novel Audio-Visual Information Fusion System for Mental Disorders Detection

被引:0
作者
Li, Yichun [1 ]
Li, Shuanglin [1 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Intelligent Sensing & Commun Res Grp, Newcastle Upon Tyne, Tyne & Wear, England
来源
2024 27TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, FUSION 2024 | 2024年
关键词
mental disorder; machine learning; depression; ADHD; multimodal;
D O I
10.23919/FUSION59988.2024.10706499
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mental disorders are among the foremost contributors to the global healthcare challenge. Research indicates that timely diagnosis and intervention are vital in treating various mental disorders. However, the early somatization symptoms of certain mental disorders may not be immediately evident, often resulting in their oversight and misdiagnosis. Additionally, the traditional diagnosis methods incur high time and cost. Deep learning methods based on fMRI and EEG have improved the efficiency of the mental disorder detection process. However, the cost of the equipment and trained staff are generally huge. Moreover, most systems are only trained for a specific mental disorder and are not general-purpose. Recently, physiological studies have shown that there are some speech and facial-related symptoms in a few mental disorders (e.g., depression and ADHD). In this paper, we focus on the emotional expression features of mental disorders and introduce a multimodal mental disorder diagnosis system based on audio-visual information input. Our proposed system is based on spatial-temporal attention networks and innovative uses a less computationally intensive pre-train audio recognition network to fine-tune the video recognition module for better results. We also apply the unified system for multiple mental disorders (ADHD and depression) for the first time. The proposed system achieves over 80% accuracy on the real multimodal ADHD dataset and achieves state-of-the-art results on the depression dataset AVEC 2014.
引用
收藏
页数:7
相关论文
共 34 条
[21]   Automatic Depression Level Detection via lp-norm Pooling [J].
Niu, Mingyue ;
Tao, Jianhua ;
Liu, Bin ;
Fan, Cunhang .
INTERSPEECH 2019, 2019, :4559-4563
[22]   Evaluating Therapeutic Effects of ADHD Medication Objectively by Movement Quantification with a Video-Based Skeleton Analysis [J].
Ouyang, Chen-Sen ;
Chiu, Yi-Hung ;
Chiang, Ching-Tai ;
Wu, Rong-Ching ;
Lin, Ying-Tong ;
Yang, Rei-Cheng ;
Lin, Lung-Chang .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (17)
[23]   Spatial-Temporal Attention Network for Depression Recognition from facial videos [J].
Pan, Yuchen ;
Shang, Yuanyuan ;
Liu, Tie ;
Shao, Zhuhong ;
Guo, Guodong ;
Ding, Hui ;
Hu, Qiang .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
[24]   Efficacy of novel Summation-based Synergetic Artificial Neural Network in ADHD diagnosis [J].
Peng, Jian ;
Debnath, Madhuri ;
Biswas, Ashis Kumer .
MACHINE LEARNING WITH APPLICATIONS, 2021, 6
[25]   A Survey of Computational Methods for Online Mental State Assessment on Social Media [J].
Rissola, Esteban A. ;
Losada, David E. ;
Crestani, Fabio .
ACM TRANSACTIONS ON COMPUTING FOR HEALTHCARE, 2021, 2 (02)
[26]  
Russell J. A., 1989, EMOTION THEORY RES E, P83, DOI DOI 10.1016/B978-0-12-558704-4.50010-4
[27]   Machine learning in the prediction of depression treatment outcomes: a systematic review and meta-analysis [J].
Sajjadian, Mehri ;
Lam, Raymond W. ;
Milev, Roumen ;
Rotzinger, Susan ;
Frey, Benicio N. ;
Soares, Claudio N. ;
Parikh, Sagar V. ;
Foster, Jane A. ;
Turecki, Gustavo ;
Muller, Daniel J. ;
Strother, Stephen C. ;
Farzan, Faranak ;
Kennedy, Sidney H. ;
Uher, Rudolf .
PSYCHOLOGICAL MEDICINE, 2021, 51 (16) :2742-2751
[28]   The Impact of Walking and Resting on Wrist Motion for Automated Detection of Meals [J].
Sharma, Surya ;
Jasper, Phillip ;
Muth, Eric ;
Hoover, Adam .
ACM TRANSACTIONS ON COMPUTING FOR HEALTHCARE, 2020, 1 (04)
[29]   ADHD classification using auto-encoding neural network and binary hypothesis testing [J].
Tang, Yibin ;
Sun, Jia ;
Wang, Chun ;
Zhong, Yuan ;
Jiang, Aimin ;
Liu, Gang ;
Liu, Xiaofeng .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 123
[30]   A Closer Look at Spatiotemporal Convolutions for Action Recognition [J].
Tran, Du ;
Wang, Heng ;
Torresani, Lorenzo ;
Ray, Jamie ;
LeCun, Yann ;
Paluri, Manohar .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6450-6459