Machine learning and natural language processing methods to identify ischemic stroke, acuity and location from radiology reports

被引:57
作者
Ong, Charlene Jennifer [1 ,2 ,3 ,4 ]
Orfanoudaki, Agni [4 ]
Zhang, Rebecca [4 ]
Caprasse, Francois Pierre M. [4 ]
Hutch, Meghan [1 ,2 ]
Ma, Liang [1 ]
Fard, Darian [1 ]
Balogun, Oluwafemi [1 ,2 ]
Miller, Matthew, I [1 ]
Minnig, Margaret [1 ]
Saglam, Hanife [3 ]
Prescott, Brenton [2 ]
Greer, David M. [1 ,2 ]
Smirnakis, Stelios [3 ]
Bertsimas, Dimitris [4 ,5 ]
机构
[1] Boston Univ, Sch Med, Boston, MA 02118 USA
[2] Boston Med Ctr, Boston, MA 02118 USA
[3] Harvard Med Sch, Boston, MA 02115 USA
[4] MIT, Operat Res Ctr, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[5] MIT, Sloan Sch Management, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
ANNOTATION;
D O I
10.1371/journal.pone.0234908
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate, automated extraction of clinical stroke information from unstructured text has several important applications. ICD-9/10 codes can misclassify ischemic stroke events and do not distinguish acuity or location. Expeditious, accurate data extraction could provide considerable improvement in identifying stroke in large datasets, triaging critical clinical reports, and quality improvement efforts. In this study, we developed and report a comprehensive framework studying the performance of simple and complex stroke-specific Natural Language Processing (NLP) and Machine Learning (ML) methods to determine presence, location, and acuity of ischemic stroke from radiographic text. We collected 60,564 Computed Tomography and Magnetic Resonance Imaging Radiology reports from 17,864 patients from two large academic medical centers. We used standard techniques to featurize unstructured text and developed neurovascular specific word GloVe embeddings. We trained various binary classification algorithms to identify stroke presence, location, and acuity using 75% of 1,359 expert-labeled reports. We validated our methods internally on the remaining 25% of reports and externally on 500 radiology reports from an entirely separate academic institution. In our internal population, GloVe word embeddings paired with deep learning (Recurrent Neural Networks) had the best discrimination of all methods for our three tasks (AUCs of 0.96, 0.98, 0.93 respectively). Simpler NLP approaches (Bag of Words) performed best with interpretable algorithms (Logistic Regression) for identifying ischemic stroke (AUC of 0.95), MCA location (AUC 0.96), and acuity (AUC of 0.90). Similarly, GloVe and Recurrent Neural Networks (AUC 0.92, 0.89, 0.93) generalized better in our external test set than BOW and Logistic Regression for stroke presence, location and acuity, respectively (AUC 0.89, 0.86, 0.80). Our study demonstrates a comprehensive assessment of NLP techniques for unstructured radiographic text. Our findings are suggestive that NLP/ML methods can be used to discriminate stroke features from large data cohorts for both clinical and research-related investigations.
引用
收藏
页数:16
相关论文
共 38 条
[1]  
[Anonymous], 2018, BULLWHIP EFFECT SUPP
[2]   Radiology report annotation using intelligent word embeddings: Applied to multi-institutional chest CT cohort [J].
Banerjee, Imon ;
Chen, Matthew C. ;
Lungren, Matthew P. ;
Rubin, Daniel L. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 77 :11-20
[3]  
Bertsimas D., 2019, Machine learning under a modern optimiza-tion lens
[4]   Optimal classification trees [J].
Bertsimas, Dimitris ;
Dunn, Jack .
MACHINE LEARNING, 2017, 106 (07) :1039-1082
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Breiman L., 2017, Trees, DOI [DOI 10.1201/9781315139470, 10.1201/9781315139470-8, DOI 10.1201/9781315139470-8]
[7]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[8]  
Devlin J., 2018, ARXIV
[9]  
Dietterich T, 1996, 1 OR STAT U, V1, P1
[10]  
Fox GN, 2003, J FAM PRACTICE, V52, P706