Detection of Suicidal Ideation in Clinical Interviews for Depression Using Natural Language Processing and Machine Learning: Cross-Sectional Study

被引:4
作者
Li, Tim M. H. [1 ,8 ]
Chen, Jie [1 ]
Law, Framenia O. C. [1 ]
Li, Chun-Tung [1 ]
Chan, Ngan Yin [1 ]
Chan, Joey W. Y. [1 ]
Chau, Steven W. H. [1 ]
Liu, Yaping [1 ]
Li, Shirley Xin [2 ,3 ]
Zhang, Jihui [1 ,4 ,5 ]
Leung, Kwong-Sak [6 ,7 ]
Wing, Yun-Kwok [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Psychiat, Li Chiu Kong Family Sleep Assessment Unit, Hong Kong, Peoples R China
[2] Univ Hong Kong, Dept Psychol, Hong Kong, Peoples R China
[3] Univ Hong Kong, State Key Lab Brain & Cognit Sci, Hong Kong, Peoples R China
[4] Guangdong Gen Hosp, Guangdong Mental Hlth Ctr, Guangzhou, Guangdong, Peoples R China
[5] Guangdong Acad Med Sci, Guangzhou, Guangdong, Peoples R China
[6] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[7] Hong Kong Shue Yan Univ, Dept Appl Data Sci, Hong Kong, Peoples R China
[8] Chinese Univ Hong Kong, Dept Psychiat, Li Chiu Kong Family Sleep Assessment Unit, Ma On Shan, 7-F Shatin Hosp,33 A Kung Kok St, Hong Kong, Peoples R China
关键词
depression; suicidal ideation; clinical interview; machine learning; natural language processing; automated detection; METAANALYSIS; WORDS;
D O I
10.2196/50221
中图分类号
R-058 [];
学科分类号
摘要
Background: Assessing patients' suicide risk is challenging, especially among those who deny suicidal ideation. Primary care providers have poor agreement in screening suicide risk. Patients' speech may provide more objective, language-based clues about their underlying suicidal ideation. Text analysis to detect suicide risk in depression is lacking in the literature. Objective: This study aimed to determine whether suicidal ideation can be detected via language features in clinical inter-views for depression using natural language processing (NLP) and machine learning (ML). Methods: This cross-sectional study recruited 305 participants between October 2020 and May 2022 (mean age 53.0, SD 11.77 years; female: n=176, 57%), of which 197 had lifetime depression and 108 were healthy. This study was part of ongoing research on characterizing depression with a case-control design. In this study, 236 participants were nonsuicidal, while 56 and 13 had low and high suicide risks, respectively. The structured interview guide for the Hamilton Depression Rating Scale (HAMD) was adopted to assess suicide risk and depression severity. Suicide risk was clinician rated based on a suicide-related question (H11). The interviews were transcribed and the words in participants' verbal responses were translated into psychologically meaningful categories using Linguistic Inquiry and Word Count (LIWC). Results: Ordinal logistic regression revealed significant suicide-related language features in participants' responses to the HAMD questions. Increased use of anger words when talking about work and activities posed the highest suicide risk (odds ratio [OR] 2.91, 95% CI 1.22-8.55; P=.02). Random forest models demonstrated that text analysis of the direct responses to H11 was effective in identifying individuals with high suicide risk (AUC 0.76-0.89; P<.001) and detecting suicide risk in general, including both low and high suicide risk (AUC 0.83-0.92; P<.001). More importantly, suicide risk can be detected with satisfactory performance even without patients' disclosure of suicidal ideation. Based on the response to the question on hypochondriasis, ML models were trained to identify individuals with high suicide risk (AUC 0.76; P<.001). Conclusions: This study examined the perspective of using NLP and ML to analyze the texts from clinical interviews for suicidality detection, which has the potential to provide more accurate and specific markers for suicidal ideation detection. The findings may pave the way for developing high-performance assessment of suicide risk for automated detection, including online chatbot-based interviews for universal screening.
引用
收藏
页数:13
相关论文
共 41 条
  • [1] Agresti A., 2003, CATEGORICAL DATA ANA
  • [2] [Anonymous], 2021, Suicide worldwide in 2019
  • [3] The benefits and risks of asking research participants about suicide: A meta-analysis of the impact of exposure to suicide-related content
    Blades, Caroline A.
    Stritzke, Werner G. K.
    Page, Andrew C.
    Brown, Julia D.
    [J]. CLINICAL PSYCHOLOGY REVIEW, 2018, 64 : 1 - 12
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] The role of language in the experience and perception of emotion: a neuroimaging meta-analysis
    Brooks, Jeffrey A.
    Shablack, Holly
    Gendron, Maria
    Satpute, Ajay B.
    Parrish, Michael H.
    Lindquist, Kristen A.
    [J]. SOCIAL COGNITIVE AND AFFECTIVE NEUROSCIENCE, 2017, 12 (02) : 169 - 183
  • [6] FINDING PEOPLE WITH EMOTIONAL DISTRESS IN ONLINE SOCIAL MEDIA: A DESIGN COMBINING MACHINE LEARNING AND RULE-BASED CLASSIFICATION
    Chau, Michael
    Li, Tim M. H.
    Wong, Paul W. C.
    Xu, Jennifer J.
    Yip, Paul S. F.
    Chen, Hsinchun
    [J]. MIS QUARTERLY, 2020, 44 (02) : 933 - 955
  • [7] A forgotten sign of depression - the omega sign and its implication
    Chen, Jie
    Li, Chun-Tung
    Li, Tim M. H.
    Chan, Ngan Yin
    Chan, Joey W. Y.
    Liu, Yaping
    Lee, Tatia M. C.
    Wing, Yun-Kwok
    [J]. ASIAN JOURNAL OF PSYCHIATRY, 2023, 80
  • [8] Youths' attitudes toward open discussion of suicide, preferred contexts, and the impact of Internet use: An exploratory sequential mixed-methods study in Hong Kong
    Chen, Sikky Shiqi
    Lam, Tai Pong
    Lam, Kwok Fai
    Lo, Tak Lam
    Chao, David Vai Kiong
    Mak, Ki Yan
    Lam, Edmund Wing Wo
    Tang, Wai Sin
    Chan, Hoi Yan
    Yip, Paul Siu Fai
    [J]. INTERNATIONAL JOURNAL OF SOCIAL PSYCHIATRY, 2023, 69 (03) : 575 - 586
  • [9] Applying text mining methods to suicide research
    Cheng, Qijin
    Lui, Carrie S. M.
    [J]. SUICIDE AND LIFE-THREATENING BEHAVIOR, 2021, 51 (01) : 137 - 147
  • [10] Assessing Suicide Risk and Emotional Distress in Chinese Social Media: A Text Mining and Machine Learning Study
    Cheng, Qijin
    Li, Tim M. H.
    Kwok, Chi-Leung
    Zhu, Tingshao
    Yip, Paul S. F.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2017, 19 (07)