Depression detection using cascaded attention based deep learning framework using speech data

被引:0
作者
Gupta, Sachi [1 ]
Agarwal, Gaurav [2 ]
Agarwal, Shivani [3 ]
Pandey, Dilkeshwar [4 ]
机构
[1] Galgotias Coll Engn & Technol, Dept Comp Sci & Engn, Greater Noida 201310, Uttar Pradesh, India
[2] Galgotias Univ, Sch Comp Sci & Engn, Gr Noida 203201, Uttar Pradesh, India
[3] Ajay Kumar Garg Engn Coll, Dept Informat Technol, Ghaziabad 201009, Uttar Pradesh, India
[4] KIET Grp Inst, Dept Comp Sci & Engn, Ghaziabad 201206, Uttar Pradesh, India
关键词
Speech signals; Multi-stage Discrete Wavelet Transform; Auction Optimization; Deep convolutional Attention; Depression; And Non-depression;
D O I
10.1007/s11042-023-18076-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient detection of depression is a challenging scenario in the field of speech signal processing. Since the speech signals provide a better diagnosis of depression, a significant methodology is required for detection. However, manual examination performed by radiologists can be time-consuming and may not be feasible in complex circumstances. Diverse detection methodologies have been proposed previously, but they are found to be less accurate, time-consuming and lead over maximized error rates. The proposed research article presents an effective and automatic deep learning-based depression detection using speech signal data. The steps involved in depression prediction are data acquisition, pre-processing, Feature Extraction, Feature selection and classification. The initial step in depression detection is data acquisition, which aims at collecting speech signals from the Distress Analysis Interview Corpus (DAIC-WOZ) and Sonde Health-free speech (SH2-FS) datasets. The collected data are pre-processed through MS_DWT (Multi-stage Discrete Wavelet Transform) to offer noise-free signals and improved signal quality. The relevant features required for processing the speech signal are extracted through Hilbert Huang (H-H) transform linear prediction cepstrum coefficient (LPCC), fundamental frequency, formants, speaking rate and Mel frequency cepstral coefficients (MFCC). From the extracted features, ideal features required for enhancing the detection accuracy are selected using the Price Auction optimization algorithm (PAOA). Finally, the depression and non-depression states are classified using deep convolutional Attention Cascaded two directional long short-term memory (DAttn_Conv 2D LSTM) with a softmax classifier. The overall accuracy obtained in classifying the depressed and non-depressed classes is 97.82% and 98.91%, respectively.
引用
收藏
页码:66135 / 66173
页数:39
相关论文
共 50 条
  • [31] A Deep Learning Approach for Automated Depression Assessment Using Roman Urdu
    Mohmand, Ruba
    Habib, Usman
    Usman, Muhammad
    Baili, Jamel
    Nam, Yunyoung
    IEEE ACCESS, 2024, 12 : 193387 - 193401
  • [32] DepNet: An automated industrial intelligent system using deep learning for video-based depression analysis
    He, Lang
    Guo, Chenguang
    Tiwari, Prayag
    Su, Rui
    Pandey, Hari Mohan
    Dang, Wei
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (07) : 3815 - 3835
  • [33] Efficacy of novel attention-based gated recurrent units transformer for depression detection using electroencephalogram signals
    Neha Prerna Tigga
    Shruti Garg
    Health Information Science and Systems, 11
  • [34] Diagnosis of depression level using multimodal approaches using deep learning techniques with multiple selective features
    Meshram, Pratiksha
    Rambola, Radha Krishna
    EXPERT SYSTEMS, 2023, 40 (04)
  • [35] EEG based depression detection by machine learning: Does inner or overt speech condition provide better biomarkers when using emotion words as experimental cues?
    Kapitany-Foveny, Mate
    Vetro, Mihaly
    Revy, Gabor
    Fabo, Daniel
    Szirmai, Danuta
    Hullam, Gabor
    JOURNAL OF PSYCHIATRIC RESEARCH, 2024, 178 : 66 - 76
  • [36] Machine Learning Models for Depression Detection Using the Concept of Perceived Control
    Azaglo, Prosper
    van de Ven, Pepijn
    Msetfi, Rachel M.
    Nelson, John
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT II, 2023, 14135 : 339 - 351
  • [37] Integration of Deep Learning for Improved Diagnosis of Depression using EEG and Facial Features
    Abdul Hamid D.S.B.
    Goyal S.B.
    Bedi P.
    Materials Today: Proceedings, 2023, 80 : 1965 - 1969
  • [38] Predicting the language of depression from multivariate twitter data using a feature-rich hybrid deep learning model
    Kour, Harnain
    Gupta, Manoj Kumar
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (24)
  • [39] Detection of Depression and Scaling of Severity Using Six Channel EEG Data
    Shalini Mahato
    Nishant Goyal
    Daya Ram
    Sanchita Paul
    Journal of Medical Systems, 2020, 44
  • [40] StutterNet: Stuttering Disfluencies Detection in Synthetic Speech Signals via Mel Frequency Cepstral Coefficients Features Using Deep Learning
    Abubakar, Muhammad
    Mujahid, Muhammad
    Kanwal, Khadija
    Iqbal, Sajid
    Asghar, Muhammad Nabeel
    Alaulamie, Abdullah
    IEEE ACCESS, 2024, 12 : 99308 - 99320