Identification of suicidal behavior among psychiatrically hospitalized adolescents using natural language processing and machine learning of electronic health records

被引:66
作者
Carson, Nicholas J. [1 ,2 ]
Mullin, Brian [1 ]
Sanchez, Maria Jose [1 ,3 ]
Lu, Frederick [1 ]
Yang, Kelly [1 ,4 ]
Menezes, Michelle [1 ,5 ]
Le Cook, Benjamin [1 ,2 ]
机构
[1] Cambridge Hlth Alliance, Hlth Equ Res Lab, Cambridge, MA 02139 USA
[2] Harvard Med Sch, Dept Psychiat, Boston, MA 02115 USA
[3] George Washington Univ, Prevent & Community Hlth Dept, Milken Sch Publ Hlth, Washington, DC USA
[4] Albert Einstein Coll Med, Dept Psychiat, Bronx, NY USA
[5] Univ Virginia, Charlottesville, VA USA
基金
美国国家卫生研究院;
关键词
RISK-FACTORS; IDEATION; RESILIENCE; PREDICTION; DIAGNOSES; VALIDITY; SUPPORT; TIME; CARE; AGE;
D O I
10.1371/journal.pone.0211116
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objective The rapid proliferation of machine learning research using electronic health records to classify healthcare outcomes offers an opportunity to address the pressing public health problem of adolescent suicidal behavior. We describe the development and evaluation of a machine learning algorithm using natural language processing of electronic health records to identify suicidal behavior among psychiatrically hospitalized adolescents. Methods Adolescents hospitalized on a psychiatric inpatient unit in a community health system in the northeastern United States were surveyed for history of suicide attempt in the past 12 months. A total of 73 respondents had electronic health records available prior to the index psychiatric admission. Unstructured clinical notes were downloaded from the year preceding the index inpatient admission. Natural language processing identified phrases from the notes associated with the suicide attempt outcome. We enriched this group of phrases with a clinically focused list of terms representing known risk and protective factors for suicide attempt in adolescents. We then applied the random forest machine learning algorithm to develop a classification model. The model performance was evaluated using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy. Results The final model had a sensitivity of 0.83, specificity of 0.22, AUC of 0.68, a PPV of 0.42, NPV of 0.67, and an accuracy of 0.47. The terms mostly highly associated with suicide attempt clustered around terms related to suicide, family members, psychiatric disorders, and psychotropic medications. Conclusion This analysis demonstrates modest success of a natural language processing and machine learning approach to identifying suicide attempt among a small sample of hospitalized adolescents in a psychiatric setting.
引用
收藏
页数:14
相关论文
共 65 条
[1]   Monitoring Suicidal Patients in Primary Care Using Electronic Health Records [J].
Anderson, Heather D. ;
Pace, Wilson D. ;
Brandt, Elias ;
Nielsen, Rodney D. ;
Allen, Richard R. ;
Libby, Anne M. ;
West, David R. ;
Valuck, Robert J. .
JOURNAL OF THE AMERICAN BOARD OF FAMILY MEDICINE, 2015, 28 (01) :65-71
[2]  
[Anonymous], 2018, YOUTH RISK BEH SURV
[3]  
[Anonymous], SUIC RISK PROT FACT
[4]  
[Anonymous], 2018, THINKING ITS OWN NHS
[5]  
[Anonymous], DULCANS TXB CHILD AD
[6]  
[Anonymous], INVENIO
[7]  
[Anonymous], TEEN SUIC IS PREV 20
[8]  
[Anonymous], J AM ACAD CHILD ADOL
[9]  
[Anonymous], 2017, MMWR MORBIDITY MORTA
[10]  
[Anonymous], TEEN SUICIDES WHAT A