Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation

被引:35
作者
Cusick, Marika [1 ,2 ]
Adekkanattu, Prakash [1 ]
Campion, Thomas R., Jr. [1 ,2 ]
Sholle, Evan T. [1 ]
Myers, Annie [2 ]
Banerjee, Samprit [2 ]
Alexopoulos, George [3 ]
Wang, Yanshan [4 ,5 ]
Pathak, Jyotishman [2 ,3 ]
机构
[1] Weill Cornell Med, Dept Informat & Technol Serv, New York, NY 10065 USA
[2] Weill Cornell Med, Dept Populat Hlth Sci, New York, NY 10065 USA
[3] Weill Cornell Med, Dept Psychiat, New York, NY 10065 USA
[4] Mayo Clin, Div Digital Hlth Sci, Rochester, MN USA
[5] Mayo Clin, Dept Hlth Sci Res, Rochester, MN USA
关键词
Weak supervision; Deep learning; Natural language processing; Machine learning; Suicidal ideation;
D O I
10.1016/j.jpsychires.2021.01.052
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Mental health concerns, such as suicidal thoughts, are frequently documented by providers in clinical notes, as opposed to structured coded data. In this study, we evaluated weakly supervised methods for detecting ?current? suicidal ideation from unstructured clinical notes in electronic health record (EHR) systems. Weakly supervised machine learning methods leverage imperfect labels for training, alleviating the burden of creating a large manually annotated dataset. After identifying a cohort of 600 patients at risk for suicidal ideation, we used a rule-based natural language processing approach (NLP) approach to label the training and validation notes (n = 17,978). Using this large corpus of clinical notes, we trained several statistical machine learning models?logistic classifier, support vector machines (SVM), Naive Bayes classifier?and one deep learning model, namely a text classification convolutional neural network (CNN), to be evaluated on a manually-reviewed test set (n = 837). The CNN model outperformed all other methods, achieving an overall accuracy of 94% and a F1-score of 0.82 on documents with ?current? suicidal ideation. This algorithm correctly identified an additional 42 encounters and 9 patients indicative of suicidal ideation but missing a structured diagnosis code. When applied to a random subset of 5,000 clinical notes, the algorithm classified 0.46% (n = 23) for ?current? suicidal ideation, of which 87% were truly indicative via manual review. Implementation of this approach for large-scale document screening may play an important role in point-of-care clinical information systems for targeted suicide prevention in-terventions and improve research on the pathways from ideation to attempt.
引用
收藏
页码:95 / 102
页数:8
相关论文
共 42 条
[1]   Health Care Contacts in the Year Before Suicide Death [J].
Ahmedani, Brian K. ;
Simon, Gregory E. ;
Stewart, Christine ;
Beck, Arne ;
Waitzfelder, Beth E. ;
Rossom, Rebecca ;
Lynch, Frances ;
Owen-Smith, Ashli ;
Hunkeler, Enid M. ;
Whiteside, Ursula ;
Operskalski, Belinda H. ;
Coffey, M. Justin ;
Solberg, Leif I. .
JOURNAL OF GENERAL INTERNAL MEDICINE, 2014, 29 (06) :870-877
[2]   Monitoring Suicidal Patients in Primary Care Using Electronic Health Records [J].
Anderson, Heather D. ;
Pace, Wilson D. ;
Brandt, Elias ;
Nielsen, Rodney D. ;
Allen, Richard R. ;
Libby, Anne M. ;
West, David R. ;
Valuck, Robert J. .
JOURNAL OF THE AMERICAN BOARD OF FAMILY MEDICINE, 2015, 28 (01) :65-71
[3]   Prediction Models for Suicide Attempts and Deaths: A Systematic Review and Simulation [J].
Belsher, Bradley E. ;
Smolenski, Derek J. ;
Pruitt, Larry D. ;
Bush, Nigel E. ;
Beech, Erin H. ;
Workman, Don E. ;
Morgan, Rebecca L. ;
Evatt, Daniel P. ;
Tucker, Jennifer ;
Skopp, Nancy A. .
JAMA PSYCHIATRY, 2019, 76 (06) :642-651
[4]   A risk index for 12-month suicide attempts in the National Comorbidity Survey Replication (NCS-R) [J].
Borges, Guilherme ;
Angst, Jules ;
Nock, Matthew K. ;
Ruscio, Ayelet Meron ;
Walters, Ellen E. ;
Kessler, Ronald C. .
PSYCHOLOGICAL MEDICINE, 2006, 36 (12) :1747-1757
[5]   Warning signs for suicide within a week of healthcare contact in Veteran decedents [J].
Britton, Peter C. ;
Ilgen, Mark A. ;
Rudd, M. David ;
Conner, Kenneth R. .
PSYCHIATRY RESEARCH, 2012, 200 (2-3) :395-399
[6]   Risk factors for suicide in psychiatric outpatients: A 20-year prospective study [J].
Brown, GK ;
Beck, AT ;
Steer, RA ;
Grisham, JR .
JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 2000, 68 (03) :371-377
[7]   C-SSRS performance in emergency department patients at high risk for suicide [J].
Brown, Lily A. ;
Boudreaux, Edwin D. ;
Arias, Sarah A. ;
Miller, Ivan W. ;
May, Alexis M. ;
Camargo, Carlos A., Jr. ;
Bryan, Craig J. ;
Armey, Michael F. .
SUICIDE AND LIFE-THREATENING BEHAVIOR, 2020, 50 (06) :1097-1104
[8]  
Cavnar W.B., 2001, N GRAM BASED TEXT CA
[9]   A simple algorithm for identifying negated findings and diseases in discharge summaries [J].
Chapman, WW ;
Bridewell, W ;
Hanbury, P ;
Cooper, GF ;
Buchanan, BG .
JOURNAL OF BIOMEDICAL INFORMATICS, 2001, 34 (05) :301-310
[10]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493