Depression detection using cascaded attention based deep learning framework using speech data

被引:0
作者
Gupta, Sachi [1 ]
Agarwal, Gaurav [2 ]
Agarwal, Shivani [3 ]
Pandey, Dilkeshwar [4 ]
机构
[1] Galgotias Coll Engn & Technol, Dept Comp Sci & Engn, Greater Noida 201310, Uttar Pradesh, India
[2] Galgotias Univ, Sch Comp Sci & Engn, Gr Noida 203201, Uttar Pradesh, India
[3] Ajay Kumar Garg Engn Coll, Dept Informat Technol, Ghaziabad 201009, Uttar Pradesh, India
[4] KIET Grp Inst, Dept Comp Sci & Engn, Ghaziabad 201206, Uttar Pradesh, India
关键词
Speech signals; Multi-stage Discrete Wavelet Transform; Auction Optimization; Deep convolutional Attention; Depression; And Non-depression;
D O I
10.1007/s11042-023-18076-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient detection of depression is a challenging scenario in the field of speech signal processing. Since the speech signals provide a better diagnosis of depression, a significant methodology is required for detection. However, manual examination performed by radiologists can be time-consuming and may not be feasible in complex circumstances. Diverse detection methodologies have been proposed previously, but they are found to be less accurate, time-consuming and lead over maximized error rates. The proposed research article presents an effective and automatic deep learning-based depression detection using speech signal data. The steps involved in depression prediction are data acquisition, pre-processing, Feature Extraction, Feature selection and classification. The initial step in depression detection is data acquisition, which aims at collecting speech signals from the Distress Analysis Interview Corpus (DAIC-WOZ) and Sonde Health-free speech (SH2-FS) datasets. The collected data are pre-processed through MS_DWT (Multi-stage Discrete Wavelet Transform) to offer noise-free signals and improved signal quality. The relevant features required for processing the speech signal are extracted through Hilbert Huang (H-H) transform linear prediction cepstrum coefficient (LPCC), fundamental frequency, formants, speaking rate and Mel frequency cepstral coefficients (MFCC). From the extracted features, ideal features required for enhancing the detection accuracy are selected using the Price Auction optimization algorithm (PAOA). Finally, the depression and non-depression states are classified using deep convolutional Attention Cascaded two directional long short-term memory (DAttn_Conv 2D LSTM) with a softmax classifier. The overall accuracy obtained in classifying the depressed and non-depressed classes is 97.82% and 98.91%, respectively.
引用
收藏
页码:66135 / 66173
页数:39
相关论文
共 50 条
  • [41] A data-centric and interpretable EEG framework for depression severity grading using SHAP-based insights
    Anruo Shen
    Jingnan Sun
    Xiaogang Chen
    Xiaorong Gao
    Journal of NeuroEngineering and Rehabilitation, 22 (1)
  • [42] Multi-Head Attention-Based Long Short-Term Memory for Depression Detection From Speech
    Zhao, Yan
    Liang, Zhenlin
    Du, Jing
    Zhang, Li
    Liu, Chengyu
    Zhao, Li
    FRONTIERS IN NEUROROBOTICS, 2021, 15
  • [43] Detection of Depression and Scaling of Severity Using Six Channel EEG Data
    Mahato, Shalini
    Goyal, Nishant
    Ram, Daya
    Paul, Sanchita
    JOURNAL OF MEDICAL SYSTEMS, 2020, 44 (07)
  • [44] AudiBERT: A Deep Transfer Learning Multimodal Classification Framework for Depression Screening
    Toto, Ermal
    Tlachac, M. L.
    Rundensteiner, Elke A.
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4145 - 4154
  • [45] Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
    Zhang, Yuezhou
    Folarin, Amos A.
    Dineley, Judith
    Conde, Pauline
    de Angel, Valeria
    Sun, Shaoxiong
    Ranjan, Yatharth
    Rashid, Zulqarnain
    Stewart, Callum
    Laiou, Petroula
    Sankesara, Heet
    Qian, Linglong
    Matcham, Faith
    White, Katie
    Oetzmann, Carolin
    Lamers, Femke
    Siddi, Sara
    Simblett, Sara
    Schuller, Bjorn W.
    Vairavan, Srinivasan
    Wykes, Til
    Haro, Josep Maria
    Penninx, Brenda W. J. H.
    Narayan, Vaibhav A.
    Hotopf, Matthew
    Dobson, Richard J. B.
    Cummins, Nicholas
    JOURNAL OF AFFECTIVE DISORDERS, 2024, 355 : 40 - 49
  • [46] Depression Detection from Social Media Text Analysis using Natural Language Processing Techniques and Hybrid Deep Learning Model
    Tejaswini, Vankayala
    Babu, Korra Sathya
    Sahoo, Bibhudatta
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (01)
  • [47] Improving Text-Based Depression Analysis Through Hybrid Deep Learning Architectures: A Methodological Framework
    Shaik Shabana
    V. C. Bharathi
    SN Computer Science, 5 (7)
  • [48] Using the Short-Time Fourier Transform and ResNet to Diagnose Depression from Speech Data
    Elfaki, Ayman
    Asnawi, Ani Liza
    Jusoh, Ahmad Zamani
    Ismail, Ahmad Fadzil
    Ibrahim, Siti Noorjannah
    Azmin, Nor Fadhillah Mohamed
    Hashim, Nik Nur Wahidah Binti Nik
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING (ICOCO), 2021, : 372 - 376
  • [49] AUTOMATIC DEPRESSION DETECTION VIA FACIAL EXPRESSIONS USING MULTIPLE INSTANCE LEARNING
    Wang, Yanfei
    Ma, Jie
    Hao, Bibo
    Hu, Pengwei
    Wang, Xiaoqian
    Mei, Jing
    Li, Shaochun
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 1933 - 1936
  • [50] Depression detection using emotional artificial intelligence and machine learning: A closer review
    Joshi, Manju Lata
    Kanoongo, Nehal
    MATERIALS TODAY-PROCEEDINGS, 2022, 58 : 217 - 226