The voice of depression: speech features as biomarkers for major depressive disorder

被引:1
作者
Menne, Felix [1 ]
Doerr, Felix [1 ]
Schraeder, Julia [2 ,3 ]
Troeger, Johannes [1 ]
Habel, Ute [2 ,3 ]
Koenig, Alexandra [1 ,4 ]
Wagels, Lisa [2 ,3 ]
机构
[1] Ki Elements GmbH, Bleichstr 27, D-66111 Saarbrucken, Germany
[2] Rhein Westfal TH Aachen, Dept Psychiat Psychotherapy & Psychosomat, Aachen, Germany
[3] Res Ctr Julich, Inst Neurosci & Med, JARA Inst Brain Struct Funct Relationship INM 10, Julich, Germany
[4] Univ Cote Azur, Ctr Hosp & Univ, Ctr Memoire Ressources & Rech, Clin Geriatr Cerveau & Mouvement, Nice, France
关键词
Precision psychiatry; Depression; Speech biomarkers; Machine learning; POSTTRAUMATIC-STRESS-DISORDER; ACOUSTIC MEASURES; SEVERITY;
D O I
10.1186/s12888-024-06253-6
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Background Psychiatry faces a challenge due to the lack of objective biomarkers, as current assessments are based on subjective evaluations. Automated speech analysis shows promise in detecting symptom severity in depressed patients. This project aimed to identify discriminating speech features between patients with major depressive disorder (MDD) and healthy controls (HCs) by examining associations with symptom severity measures. Methods Forty-four MDD patients from the Psychiatry Department, University Hospital Aachen, Germany and fifty-two HCs were recruited. Participants described positive and negative life events, which were recorded for analysis. The Beck Depression Inventory (BDI-II) and the Hamilton Rating Scale for Depression gauged depression severity. Transcribed audio recordings underwent feature extraction, including acoustics, speech rate, and content. Machine learning models including speech features and neuropsychological assessments, were used to differentiate between the MDD patients and HCs. Results Acoustic variables such as pitch and loudness differed significantly between the MDD patients and HCs (effect sizes eta 2 between 0.183 and 0.3, p < 0.001). Furthermore, variables pertaining to temporality, lexical richness, and speech sentiment displayed moderate to high effect sizes (eta 2 between 0.062 and 0.143, p < 0.02). A support vector machine (SVM) model based on 10 acoustic features showed a high performance (AUC = 0.93) in differentiating between HCs and patients with MDD, comparable to an SVM based on the BDI-II (AUC = 0.99, p = 0.01). Conclusions This study identified robust speech features associated with MDD. A machine learning model based on speech features yielded similar results to an established pen-and-paper depression assessment. In the future, these findings may shape voice-based biomarkers, enhancing clinical diagnosis and MDD monitoring.
引用
收藏
页数:17
相关论文
共 83 条
  • [41] Detecting subtle signs of depression with automated speech analysis in a non-clinical sample
    Koenig, Alexandra
    Troeger, Johannes
    Mallick, Elisa
    Mina, Mario
    Linz, Nicklas
    Wagnon, Carole
    Karbach, Julia
    Kuhn, Caroline
    Peter, Jessica
    [J]. BMC PSYCHIATRY, 2022, 22 (01)
  • [42] Predicting Depression Severity from Spontaneous Speech as Prompted by a Virtual Agent
    Konig, A.
    Mina, M.
    Schaefer, S.
    Linz, N.
    Troeger, J.
    [J]. EUROPEAN PSYCHIATRY, 2023, 66 : S157 - S158
  • [43] Measuring neuropsychiatric symptoms in patients with early cognitive decline using speech analysis
    Konig, Alexandra
    Mallick, Elisa
    Troger, Johannes
    Linz, Nicklas
    Zeghari, Radia
    Manera, Valeria
    Robert, Philippe
    [J]. EUROPEAN PSYCHIATRY, 2021, 64 (01)
  • [44] Detecting Apathy in Older Adults with Cognitive Disorders Using Automatic Speech Analysis
    Konig, Alexandra
    Linz, Nicklas
    Zeghari, Radia
    Klinge, Xenia
    Troeger, Johannes
    Alexandersson, Jan
    Robert, Philippe
    [J]. JOURNAL OF ALZHEIMERS DISEASE, 2019, 69 (04) : 1183 - 1193
  • [45] Speech as a Biomarker for Depression
    Koops, Sanne
    Brederoo, Sanne G.
    de Boer, Janna N.
    Nadema, Femke G.
    Voppel, Alban E.
    Sommer, Iris E.
    [J]. CNS & NEUROLOGICAL DISORDERS-DRUG TARGETS, 2023, 22 (02) : 152 - 160
  • [46] Kraskov A, 2004, PHYS REV E, V69, DOI 10.1103/PhysRevE.69.066138
  • [47] Depression Severity Detection Using Read Speech with a Divide-and-Conquer Approach
    Kwon, Namhee
    Kim, Samuel
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 633 - 637
  • [48] Ladefoged PA., 1975, COURSE PHONETICS
  • [49] Ladefoged Peter., 1996, Elements of Acoustic Phonetics, V2nd
  • [50] Applications of machine learning algorithms to predict therapeutic outcomes in depression: A meta-analysis and systematic review
    Lee, Yena
    Ragguett, Renee-Marie
    Mansur, Rodrigo B.
    Boutilier, Justin J.
    Rosenblat, Joshua D.
    Trevizol, Alisson
    Brietzke, Elisa
    Lin, Kangguang
    Pan, Zihang
    Subramaniapillai, Mehala
    Chan, Timothy C. Y.
    Fus, Dominika
    Park, Caroline
    Musial, Natalie
    Zuckerman, Hannah
    Chen, Vincent Chin-Hung
    Ho, Roger
    Rong, Carola
    McIntyre, Roger S.
    [J]. JOURNAL OF AFFECTIVE DISORDERS, 2018, 241 : 519 - 532