Automated Classification of Depression Severity Using Speech - A Comparison of Two Machine Learning Architectures

被引:4
作者
Aharonson, Vered [1 ]
de Nooy, Alexandra [1 ]
Bulkin, Seth [1 ]
Sessel, Gareth [1 ]
机构
[1] Univ Witwatersrand, Sch Elect & Informat Engn, Johannesburg, South Africa
来源
2020 8TH IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2020) | 2020年
关键词
Speech analytics; Speech signal processing for healthcare; Depression recognition; disease severity discrimination; multi-stage classifiers;
D O I
10.1109/ICHI48887.2020.9374335
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression affects approximately 300 million people worldwide, resulting in significant suffering and economic costs. Millions of sufferers remain undiagnosed and untreated due to a shortage of trained personnel, social stigma, and expensive treatments. Two novel machine learning architectures, used to predict depression severity from audio recordings, are presented and compared in this study. The data was taken from the Distress Analysis Interview Corpus, which contains recordings of 189 participant interviews and their Public Health Questionnaire 8 depression scores. Feature extraction and feature selection were performed on the participants' speech, and two machine learning architectures were designed to provide prediction models for depression severity. In the first architecture, participants' data were initially classified into depressed or not-depressed classes, and a regression model was trained on each class. The second architecture sorted the data into depression severity classes, which were then used in addition to the original features to predict the depression scores. The second architecture outperformed the first in both the classification and regression stages, achieving an RMSE value of 4.1, a significant improvement over previous studies that reported RMSE values of 6.32 to 6.94 for the same data. The results demonstrate a potential for a speech-based depression screening tool, able to assist healthcare professionals in the diagnosis and monitoring of patients, and to provide a scalable depression screening method enabling individuals to recognise their illnesses and seek professional help.
引用
收藏
页码:128 / 131
页数:4
相关论文
共 16 条
  • [1] Alghowinem S, 2013, INT CONF ACOUST SPEE, P8022, DOI 10.1109/ICASSP.2013.6639227
  • [2] Detecting Depression with Audio/Text Sequence Modeling of Interviews
    Alhanai, Tuka
    Ghassemi, Mohammad
    Glass, James
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1716 - 1720
  • [3] Degottex G, 2014, INT CONF ACOUST SPEE, DOI 10.1109/ICASSP.2014.6853739
  • [4] Dham S, 2017, ARXIV170905865
  • [5] Eyben F., 2013, P 21 ACM INT C MULT, P835, DOI [10.1145/2502081.2502224, DOI 10.1145/2502081.2502224]
  • [6] Gratch J, 2014, LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P3123
  • [7] Health Quality Ontario, 2017, Ont Health Technol Assess Ser, V17, P1
  • [8] Social functioning in depression:: A review
    Hirschfeld, RMA
    Montgomery, SA
    Keller, MB
    Kasper, S
    Schatzberg, AF
    Möller, HJ
    Healy, D
    Baldwin, D
    Humble, M
    Versiani, M
    Montenegro, R
    Bourgeois, M
    [J]. JOURNAL OF CLINICAL PSYCHIATRY, 2000, 61 (04) : 268 - 275
  • [9] Contemporary behavioral activation treatments for depression: Procedures, principles, and progress
    Hopko, DR
    Lejuez, CW
    Ruggiero, KJ
    Eifert, GH
    [J]. CLINICAL PSYCHOLOGY REVIEW, 2003, 23 (05) : 699 - 717
  • [10] The PHQ-8 as a measure of current depression in the general population
    Kroenke, Kurt
    Strine, Tara W.
    Spitzer, Robert L.
    Williams, Janet B. W.
    Berry, Joyce T.
    Mokdad, Ali H.
    [J]. JOURNAL OF AFFECTIVE DISORDERS, 2009, 114 (1-3) : 163 - 173