Depression detection using cascaded attention based deep learning framework using speech data

被引:0
作者
Gupta, Sachi [1 ]
Agarwal, Gaurav [2 ]
Agarwal, Shivani [3 ]
Pandey, Dilkeshwar [4 ]
机构
[1] Galgotias Coll Engn & Technol, Dept Comp Sci & Engn, Greater Noida 201310, Uttar Pradesh, India
[2] Galgotias Univ, Sch Comp Sci & Engn, Gr Noida 203201, Uttar Pradesh, India
[3] Ajay Kumar Garg Engn Coll, Dept Informat Technol, Ghaziabad 201009, Uttar Pradesh, India
[4] KIET Grp Inst, Dept Comp Sci & Engn, Ghaziabad 201206, Uttar Pradesh, India
关键词
Speech signals; Multi-stage Discrete Wavelet Transform; Auction Optimization; Deep convolutional Attention; Depression; And Non-depression;
D O I
10.1007/s11042-023-18076-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient detection of depression is a challenging scenario in the field of speech signal processing. Since the speech signals provide a better diagnosis of depression, a significant methodology is required for detection. However, manual examination performed by radiologists can be time-consuming and may not be feasible in complex circumstances. Diverse detection methodologies have been proposed previously, but they are found to be less accurate, time-consuming and lead over maximized error rates. The proposed research article presents an effective and automatic deep learning-based depression detection using speech signal data. The steps involved in depression prediction are data acquisition, pre-processing, Feature Extraction, Feature selection and classification. The initial step in depression detection is data acquisition, which aims at collecting speech signals from the Distress Analysis Interview Corpus (DAIC-WOZ) and Sonde Health-free speech (SH2-FS) datasets. The collected data are pre-processed through MS_DWT (Multi-stage Discrete Wavelet Transform) to offer noise-free signals and improved signal quality. The relevant features required for processing the speech signal are extracted through Hilbert Huang (H-H) transform linear prediction cepstrum coefficient (LPCC), fundamental frequency, formants, speaking rate and Mel frequency cepstral coefficients (MFCC). From the extracted features, ideal features required for enhancing the detection accuracy are selected using the Price Auction optimization algorithm (PAOA). Finally, the depression and non-depression states are classified using deep convolutional Attention Cascaded two directional long short-term memory (DAttn_Conv 2D LSTM) with a softmax classifier. The overall accuracy obtained in classifying the depressed and non-depressed classes is 97.82% and 98.91%, respectively.
引用
收藏
页码:66135 / 66173
页数:39
相关论文
共 50 条
  • [1] Evaluation of deep learning-based depression detection using medical claims data
    Bertl, Markus
    Bignoumba, Nzamba
    Ross, Peeter
    Ben Yahia, Sadok
    Draheim, Dirk
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 147
  • [2] Detection of Depression in Thai Social Media Messages using Deep Learning
    Kumnunt, Boriharn
    Sornil, Ohm
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON DEEP LEARNING THEORY AND APPLICATIONS (DELTA), 2020, : 111 - 118
  • [3] An Effective Depression Diagnostic System Using Speech Signal Analysis Through Deep Learning Methods
    Verma, Aman
    Jain, Pooja
    Kumar, Tapan
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2023, 32 (02)
  • [4] Deep learning based depression classification using environmental factor selection
    Nam W.
    Kim B.W.
    Transactions of the Korean Institute of Electrical Engineers, 2020, 69 (07): : 1102 - 1110
  • [5] Early Depression Detection from Social Network Using Deep Learning Techniques
    Shah, Faisal Muhammad
    Ahmed, Farzad
    Joy, Sajib Kumar Saha
    Ahmed, Sifat
    Sadek, Samir
    Shil, Rimon
    Kabir, Md Hasanul
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 823 - 826
  • [6] Automated speech-based screening of depression using deep convolutional neural networks
    Chlasta, Karol
    Wolk, Krzysztof
    Krejtz, Izabela
    CENTERIS2019--INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS/PROJMAN2019--INTERNATIONAL CONFERENCE ON PROJECT MANAGEMENT/HCIST2019--INTERNATIONAL CONFERENCE ON HEALTH AND SOCIAL CARE INFORMATION SYSTEMS AND TECHNOLOGIES, 2019, 164 : 618 - 628
  • [7] Depression detection from social network data using machine learning techniques
    Islam, Md. Rafiqul
    Kabir, Muhammad Ashad
    Ahmed, Ashir
    Kamal, Abu Raihan M.
    Wang, Hua
    Ulhaq, Anwaar
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2018, 6
  • [8] Diagnostic accuracy of deep learning using speech samples in depression: a systematic review and meta-analysis
    Liu, Lidan
    Liu, Lu
    Wafa, Hatem A.
    Tydeman, Florence
    Xie, Wanqing
    Wang, Yanzhong
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (10) : 2394 - 2404
  • [9] Depression Detection From Social Networks Data Based on Machine Learning and Deep Learning Techniques: An Interrogative Survey
    Hasib, Khan Md
    Islam, Md Rafiqul
    Sakib, Shadman
    Akbar, Md. Ali
    Razzak, Imran
    Alam, Mohammad Shafiul
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (04): : 1568 - 1586
  • [10] Polysomnographic identification of anxiety and depression using deep learning
    Thakre, Tushar P.
    Kulkarni, Hemant
    Adams, Katie S.
    Mischel, Ryan
    Hayes, Ronnie
    Pandurangi, Ananda
    JOURNAL OF PSYCHIATRIC RESEARCH, 2022, 150 : 54 - 63