Predicting adolescent suicidal behavior following inpatient discharge using structured and unstructured data

被引:0
作者
Carson, Nicholas J. [1 ]
Yang, Xinyu [2 ]
Mullin, Brian [1 ]
Stettenbauer, Elizabeth [5 ]
Waddington, Marin [3 ]
Zhang, Alice [4 ]
Williams, Peyton [1 ]
Perez, Gabriel E. Rios [1 ]
Le Cook, Benjamin [1 ]
机构
[1] Cambridge Hlth Alliance, Hlth Equ Res Lab, 1035 Cambridge St, Cambridge, MA 02139 USA
[2] Parexel, 275 Grove St,Suite 101C, Newton, MA 02466 USA
[3] Brigham & Womens Hosp, Resnek Family Ctr PSC Res, Div Gastroenterol, 75 Francis St, Boston, MA 02115 USA
[4] NYU, Dept Psychol, 6 Washington Pl, New York, NY 10003 USA
[5] Brown Univ, Sch Publ Hlth, Providence, RI 02903 USA
关键词
Suicide; Adolescence; Risk; Patient discharge; Machine learning; Electronic health records; AFTER-DISCHARGE;
D O I
10.1016/j.jad.2023.12.059
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Background: The objective was to develop and assess performance of an algorithm predicting suicide -related ICD codes within three months of psychiatric discharge. Methods: This prognostic study used a retrospective cohort of EHR data from 2789 youth (12 to 20 years old) hospitalized in a safety net institution in the Northeastern United States. The dataset combined structured data with unstructured data obtained through natural language processing of clinical notes. Machine learning approaches compared gradient boosting to random forest analyses. Results: Area under the ROC and precision -recall curve were 0.88 and 0.17, respectively, for the final Gradient Boosting model. The cutoff point of the model -generated predicted probabilities of suicide that optimally classified the individual as high risk or not was 0.009. When applying the chosen cutoff (0.009) to the hold -out testing set, the model correctly identified 8 positive cases out of 10, and 418 negative cases out 548. The corresponding performance metrics showed 80 % sensitivity, 76 % specificity, 6 % PPV, 99 % NPV, F-1 score of 0.11, and an accuracy of 76 %. Limitations: The data in this study comes from a single health system, possibly introducing bias in the model's algorithm. Thus, the model may have underestimated the incidence of suicidal behavior in the study population. Further research should include multiple system EHRs. Conclusions: These performance metrics suggest a benefit to including both unstructured and structured data in design of predictive algorithms for suicidal behavior, which can be integrated into psychiatric services to help assess risk.
引用
收藏
页码:382 / 387
页数:6
相关论文
共 50 条
  • [41] Predicting body weight in growing pigs from feeding behavior data using machine learning algorithms
    He, Yuqing
    Tiezzi, Francesco
    Howard, Jeremy
    Maltecca, Christian
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 184
  • [42] The risk of repeated suicidal presentations following residential treatment for substance use disorders: A recurrent event analysis using linked administrative data
    Tisdale, Calvert
    de Andrade, Dominique
    Leung, Janni
    Campbell, Gabrielle
    Hides, Leanne
    JOURNAL OF AFFECTIVE DISORDERS, 2024, 360 : 364 - 375
  • [43] Improved performance of machine learning models in predicting length of stay, discharge disposition, and inpatient mortality after total knee arthroplasty using patient-specific variables
    Abdul K. Zalikha
    Tannor Court
    Fong Nham
    Mouhanad M. El-Othmani
    Roshan P. Shah
    Arthroplasty, 5
  • [44] Improved performance of machine learning models in predicting length of stay, discharge disposition, and inpatient mortality after total knee arthroplasty using patient-specific variables
    Zalikha, Abdul K.
    Court, Tannor
    Nham, Fong
    El-Othmani, Mouhanad M.
    Shah, Roshan P.
    ARTHROPLASTY, 2023, 5 (01)
  • [45] Predicting the Cycle Life of Lithium-Ion Batteries Using Data-Driven Machine Learning Based on Discharge Voltage Curves
    Jiang, Yinfeng
    Song, Wenxiang
    BATTERIES-BASEL, 2023, 9 (08):
  • [46] Predicting 30-day readmission following total knee arthroplasty using machine learning and clinical expertise applied to clinical administrative and research registry data in an Australian cohort
    Gould, Daniel J.
    Bailey, James A.
    Spelman, Tim
    Bunzli, Samantha
    Dowsey, Michelle M.
    Choong, Peter F. M.
    ARTHROPLASTY, 2023, 5 (01)
  • [47] Predicting Self-declared Movie Watching Behavior Using Facebook Data and Information-Fusion Sensitivity Analysis
    Bogaert, Matthias
    Ballings, Michel
    Bergmans, Rob
    Van den Poel, Dirk
    DECISION SCIENCES, 2021, 52 (03) : 776 - 810
  • [48] Development of automated machine learning models using H2O platform based on clinical structured data in predicting colorectal adenoma in nonalcoholic fatty liver disease
    Liu, Lu
    Yin, Minyue
    Gao, Jingwen
    Zhu, Jinzhou
    Xu, Chunfang
    Liu, Xiaolin
    JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2023, 38 : 73 - 74
  • [49] Development of automated machine learning models using H2O platform based on clinical structured data in predicting colorectal adenoma in nonalcoholic fatty liver disease
    Liu, Lu
    Yin, Minyue
    Gao, Jingwen
    Zhu, Jinzhou
    Xu, Chunfang
    Liu, Xiaolin
    JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2023, 38 : 73 - 74
  • [50] How are depression and suicidal ideation associated with multiple health risk behaviours among adolescents? A secondary data analysis using the 2016 Korea Youth Risk Behavior Web-based Survey
    Kim, Eun-Mi
    Kim, Heejung
    Park, Eunhee
    JOURNAL OF PSYCHIATRIC AND MENTAL HEALTH NURSING, 2020, 27 (05) : 595 - 606