DPD (DePression Detection) Net: a deep neural network for multimodal depression detection

被引:0
作者
He, Manlu [1 ]
Bakker, Erwin M. [1 ]
Lew, Michael S. [1 ]
机构
[1] Leiden Univ, Leiden Inst Adv Comp Sci LIACS, Niels Bohrweg 1, NL-2333 CA Leiden, Netherlands
来源
HEALTH INFORMATION SCIENCE AND SYSTEMS | 2024年 / 12卷 / 01期
关键词
Depression detection; Multimodal data; Deep neural network; Transformers; Graph neural networks; Ensemble model; RECOGNITION; FRAMEWORK;
D O I
10.1007/s13755-024-00311-9
中图分类号
R-058 [];
学科分类号
摘要
Depression is one of the most prevalent mental conditions which could impair people's productivity and lead to severe consequences. The diagnosis of this disease is complex as it often relies on a physician's subjective interview-based screening. The aim of our work is to propose deep learning models for automatic depression detection by using different data modalities, which could assist in the diagnosis of depression. Current works on automatic depression detection mostly are tested on a single dataset, which might lack robustness, flexibility and scalability. To alleviate this problem, we design a novel Graph Neural Network-enhanced Transformer model named DePressionDetect Net (DPD Net) that leverages textual, audio and visual features and can work under two different application settings: the clinical setting and the social media setting. The model consists of a unimodal encoder module for encoding single modality, a multimodal encoder module for integrating the multimodal information, and a detection module for producing the final prediction. We also propose a model named DePressionDetect-with-EEG Net (DPD-E Net) to incorporate Electroencephalography (EEG) signals and speech data for depression detection. Experiments across four benchmark datasets show that DPD Net and DPD-E Net can outperform the state-of-the-art models on three datasets (i.e., E-DAIC dataset, Twitter depression dataset and MODMA dataset), and achieve competitive performance on the fourth one (i.e., D-vlog dataset). Ablation studies demonstrate the advantages of the proposed modules and the effectiveness of combining diverse modalities for automatic depression detection.
引用
收藏
页数:17
相关论文
共 44 条
  • [1] Ensemble Hybrid Learning Methods for Automated Depression Detection
    Ansari, Luna
    Ji, Shaoxiong
    Chen, Qian
    Cambria, Erik
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (01) : 211 - 219
  • [2] Baevski A, 2020, ADV NEUR IN, V33
  • [3] Relative power and coherence of EEG series are related to amnestic mild cognitive impairment in diabetes
    Bian, Zhijie
    Li, Qiuli
    Wang, Lei
    Lu, Chengbiao
    Yin, Shimin
    Li, Xiaoli
    [J]. FRONTIERS IN AGING NEUROSCIENCE, 2014, 6
  • [4] Bucur A-M., 2023, Its just a matter of time: detecting depression with time-enriched multimodal transformers, P200, DOI [10.1007/978-3-031-28244-7_13, DOI 10.1007/978-3-031-28244-7_13]
  • [5] A multi-modal open dataset for mental-disorder analysis
    Cai, Hanshu
    Yuan, Zhenqin
    Gao, Yiwen
    Sun, Shuting
    Li, Na
    Tian, Fuze
    Xiao, Han
    Li, Jianxiu
    Yang, Zhengwu
    Li, Xiaowei
    Zhao, Qinglin
    Liu, Zhenyu
    Yao, Zhijun
    Yang, Minqiang
    Peng, Hong
    Zhu, Jing
    Zhang, Xiaowei
    Gao, Guoping
    Zheng, Fang
    Li, Rui
    Guo, Zhihua
    Ma, Rong
    Yang, Jing
    Zhang, Lan
    Hu, Xiping
    Li, Yumin
    Hu, Bin
    [J]. SCIENTIFIC DATA, 2022, 9 (01)
  • [6] Cucurull G., 2017, ARXIV
  • [7] de Melo WC, 2020, INT CONF ACOUST SPEE, P1080, DOI [10.1109/ICASSP40776.2020.9054375, 10.1109/icassp40776.2020.9054375]
  • [8] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [9] The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing
    Eyben, Florian
    Scherer, Klaus R.
    Schuller, Bjoern W.
    Sundberg, Johan
    Andre, Elisabeth
    Busso, Carlos
    Devillers, Laurence Y.
    Epps, Julien
    Laukka, Petri
    Narayanan, Shrikanth S.
    Truong, Khiet P.
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2016, 7 (02) : 190 - 202
  • [10] Early depression detection in social media based on deep learning and underlying emotions
    Figueredo, Jose Solenir L.
    Maia, Ana Lucia L. M.
    Calumby, Rodrigo Tripodi
    [J]. ONLINE SOCIAL NETWORKS AND MEDIA, 2022, 31