Predicting adolescent suicidal behavior following inpatient discharge using structured and unstructured data

被引:0
|
作者
Carson, Nicholas J. [1 ]
Yang, Xinyu [2 ]
Mullin, Brian [1 ]
Stettenbauer, Elizabeth [5 ]
Waddington, Marin [3 ]
Zhang, Alice [4 ]
Williams, Peyton [1 ]
Perez, Gabriel E. Rios [1 ]
Le Cook, Benjamin [1 ]
机构
[1] Cambridge Hlth Alliance, Hlth Equ Res Lab, 1035 Cambridge St, Cambridge, MA 02139 USA
[2] Parexel, 275 Grove St,Suite 101C, Newton, MA 02466 USA
[3] Brigham & Womens Hosp, Resnek Family Ctr PSC Res, Div Gastroenterol, 75 Francis St, Boston, MA 02115 USA
[4] NYU, Dept Psychol, 6 Washington Pl, New York, NY 10003 USA
[5] Brown Univ, Sch Publ Hlth, Providence, RI 02903 USA
关键词
Suicide; Adolescence; Risk; Patient discharge; Machine learning; Electronic health records; AFTER-DISCHARGE;
D O I
10.1016/j.jad.2023.12.059
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Background: The objective was to develop and assess performance of an algorithm predicting suicide -related ICD codes within three months of psychiatric discharge. Methods: This prognostic study used a retrospective cohort of EHR data from 2789 youth (12 to 20 years old) hospitalized in a safety net institution in the Northeastern United States. The dataset combined structured data with unstructured data obtained through natural language processing of clinical notes. Machine learning approaches compared gradient boosting to random forest analyses. Results: Area under the ROC and precision -recall curve were 0.88 and 0.17, respectively, for the final Gradient Boosting model. The cutoff point of the model -generated predicted probabilities of suicide that optimally classified the individual as high risk or not was 0.009. When applying the chosen cutoff (0.009) to the hold -out testing set, the model correctly identified 8 positive cases out of 10, and 418 negative cases out 548. The corresponding performance metrics showed 80 % sensitivity, 76 % specificity, 6 % PPV, 99 % NPV, F-1 score of 0.11, and an accuracy of 76 %. Limitations: The data in this study comes from a single health system, possibly introducing bias in the model's algorithm. Thus, the model may have underestimated the incidence of suicidal behavior in the study population. Further research should include multiple system EHRs. Conclusions: These performance metrics suggest a benefit to including both unstructured and structured data in design of predictive algorithms for suicidal behavior, which can be integrated into psychiatric services to help assess risk.
引用
收藏
页码:382 / 387
页数:6
相关论文
共 50 条
  • [21] Predicting risk of suicidal behavior from insurance claims data vs. linked data from insurance claims and electronic health records
    Simon, Gregory E.
    Shortreed, Susan M.
    Johnson, Eric
    Yaseen, Zimri S.
    Stone, Marc
    Mosholder, Andrew D.
    Ahmedani, Brian K.
    Coleman, Karen J.
    Coley, R. Yates
    Penfold, Robert B.
    Toh, Sengwee
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 (01)
  • [22] Using hospital Admission, Discharge & Transfer (ADT) data for predicting readmissions
    Saha, Pronojit
    Sircar, Reelina
    Bose, Arnab
    MACHINE LEARNING WITH APPLICATIONS, 2021, 5
  • [23] Development and application of pharmacological statin-associated muscle symptoms phenotyping algorithms using structured and unstructured electronic health records data
    Sun, Boguang
    Yew, Pui Ying
    Chi, Chih-Lin
    Song, Meijia
    Loth, Matt
    Zhang, Rui
    Straka, Robert J.
    JAMIA OPEN, 2023, 6 (04)
  • [24] A tree based approach for multi-class classification of surgical procedures using structured and unstructured data
    Tannaz Khaleghi
    Alper Murat
    Suzan Arslanturk
    BMC Medical Informatics and Decision Making, 21
  • [25] Outcome-specific Charlson Comorbidity Indices for Predicting Poor Inpatient Outcomes Following Noncardiac Surgery Using Hospital Administrative Data
    Moodley, Yoshan
    MEDICAL CARE, 2016, 55 (12) : 1082 - 1088
  • [26] The effect of the SAFE intervention on post-discharge suicidal behavior: a quasi-experimental study using propensity score matching
    Madsen, Trine
    Erlangsen, Annette
    Egilsdottir, Eybjorg
    Andersen, Per Kragh
    Nordentoft, Merete
    SOCIAL PSYCHIATRY AND PSYCHIATRIC EPIDEMIOLOGY, 2024, 59 (06) : 1053 - 1061
  • [27] Clinical notes as prognostic markers of mortality associated with diabetes mellitus following critical care: A retrospective cohort analysis using machine learning and unstructured big data
    De Silva, Kushan
    Mathews, Noel
    Teede, Helena
    Forbes, Andrew
    Jonsson, Daniel
    Demmer, Ryan T.
    Enticott, Joanne
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 132
  • [28] A tree based approach for multi-class classification of surgical procedures using structured and unstructured data
    Khaleghi, Tannaz
    Murat, Alper
    Arslanturk, Suzan
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [29] Clinical Prediction Models for Hospital-Induced Delirium Using Structured and Unstructured Electronic Health Record Data: Protocol for a Development and Validation Study
    Ser, Sarah E.
    Shear, Kristen
    Snigurska, Urszula A.
    Prosperi, Mattia
    Wu, Yonghui
    Magoc, Tanja
    Bjarnadottir, Ragnhildur, I
    Lucero, Robert J.
    JMIR RESEARCH PROTOCOLS, 2023, 12
  • [30] Predicting Adolescent Female Stress with Wearable Device Data Using Machine and Deep Learning
    Jin, Claire
    Osotsi, Ame
    Oravecz, Zita
    2023 IEEE 19TH INTERNATIONAL CONFERENCE ON BODY SENSOR NETWORKS, BSN, 2023,