Depression detection using cascaded attention based deep learning framework using speech data

被引:0
作者
Gupta, Sachi [1 ]
Agarwal, Gaurav [2 ]
Agarwal, Shivani [3 ]
Pandey, Dilkeshwar [4 ]
机构
[1] Galgotias Coll Engn & Technol, Dept Comp Sci & Engn, Greater Noida 201310, Uttar Pradesh, India
[2] Galgotias Univ, Sch Comp Sci & Engn, Gr Noida 203201, Uttar Pradesh, India
[3] Ajay Kumar Garg Engn Coll, Dept Informat Technol, Ghaziabad 201009, Uttar Pradesh, India
[4] KIET Grp Inst, Dept Comp Sci & Engn, Ghaziabad 201206, Uttar Pradesh, India
关键词
Speech signals; Multi-stage Discrete Wavelet Transform; Auction Optimization; Deep convolutional Attention; Depression; And Non-depression;
D O I
10.1007/s11042-023-18076-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient detection of depression is a challenging scenario in the field of speech signal processing. Since the speech signals provide a better diagnosis of depression, a significant methodology is required for detection. However, manual examination performed by radiologists can be time-consuming and may not be feasible in complex circumstances. Diverse detection methodologies have been proposed previously, but they are found to be less accurate, time-consuming and lead over maximized error rates. The proposed research article presents an effective and automatic deep learning-based depression detection using speech signal data. The steps involved in depression prediction are data acquisition, pre-processing, Feature Extraction, Feature selection and classification. The initial step in depression detection is data acquisition, which aims at collecting speech signals from the Distress Analysis Interview Corpus (DAIC-WOZ) and Sonde Health-free speech (SH2-FS) datasets. The collected data are pre-processed through MS_DWT (Multi-stage Discrete Wavelet Transform) to offer noise-free signals and improved signal quality. The relevant features required for processing the speech signal are extracted through Hilbert Huang (H-H) transform linear prediction cepstrum coefficient (LPCC), fundamental frequency, formants, speaking rate and Mel frequency cepstral coefficients (MFCC). From the extracted features, ideal features required for enhancing the detection accuracy are selected using the Price Auction optimization algorithm (PAOA). Finally, the depression and non-depression states are classified using deep convolutional Attention Cascaded two directional long short-term memory (DAttn_Conv 2D LSTM) with a softmax classifier. The overall accuracy obtained in classifying the depressed and non-depressed classes is 97.82% and 98.91%, respectively.
引用
收藏
页码:66135 / 66173
页数:39
相关论文
共 50 条
  • [21] Social Media Multiaspect Detection by Using Unsupervised Deep Active Attention
    Ahmed, Usman
    Lin, Jerry Chun-Wei
    Srivastava, Gautam
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (04) : 2137 - 2145
  • [22] Attention-Enabled Ensemble Deep Learning Models and Their Validation for Depression Detection: A Domain Adoption Paradigm
    Singh, Jaskaran
    Singh, Narpinder
    Fouda, Mostafa M.
    Saba, Luca
    Suri, Jasjit S.
    DIAGNOSTICS, 2023, 13 (12)
  • [23] Ensemble-based Depression Detection in Speech
    Liu, Zhenyu
    Li, Changcong
    Gao, Xiang
    Wang, Gang
    Yang, Jing
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 975 - 980
  • [24] MDD: A Unified Multimodal Deep Learning Approach for Depression Diagnosis Based on Text and Audio Speech
    Mohammad, Farah
    Al Mansoor, Khulood Mohammed
    Computers, Materials and Continua, 2024, 81 (03) : 4125 - 4147
  • [25] Attention-Based Deep Entropy Active Learning Using Lexical Algorithm for Mental Health Treatment
    Ahmed, Usman
    Mukhiya, Suresh Kumar
    Srivastava, Gautam
    Lamo, Yngve
    Lin, Jerry Chun-Wei
    FRONTIERS IN PSYCHOLOGY, 2021, 12
  • [26] Review of Advancements in Depression Detection Using Social Media Data
    Dalal, Sumit
    Jain, Sarika
    Dav, Mayank
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025, 12 (01): : 77 - 100
  • [27] SPEECH-BASED DEPRESSION PREDICTION USING ENCODER-WEIGHT-ONLY TRANSFER LEARNING AND A LARGE CORPUS
    Harati, Amir
    Shriberg, Elizabeth
    Rutowski, Tomasz
    Chlebek, Piotr
    Lu, Yang
    Oliveira, Ricardo
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7273 - 7277
  • [28] Early depression detection in social media based on deep learning and underlying emotions
    Figueredo, Jose Solenir L.
    Maia, Ana Lucia L. M.
    Calumby, Rodrigo Tripodi
    ONLINE SOCIAL NETWORKS AND MEDIA, 2022, 31
  • [29] Naive Bayes Classifier for depression detection using text data
    Samanvitha, S.
    Bindiya, A. R.
    Sudhanva, Shreya
    Mahanand, B. S.
    2021 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2021, : 418 - 421
  • [30] Efficacy of novel attention-based gated recurrent units transformer for depression detection using electroencephalogram signals
    Tigga, Neha Prerna
    Garg, Shruti
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2022, 11 (01)