Detecting Adverse Drug Events with Rapidly Trained Classification Models

被引:48
作者
Chapman, Alec B. [1 ]
Peterson, Kelly S. [2 ,3 ]
Alba, Patrick R. [2 ,3 ]
DuVall, Scott L. [2 ,3 ]
Patterson, Olga V. [2 ,3 ]
机构
[1] Hlth Fidel, San Mateo, CA USA
[2] Univ Utah, VA Salt Lake City Hlth Care Syst, Salt Lake City, UT 84112 USA
[3] Univ Utah, Div Epidemiol, Salt Lake City, UT 84112 USA
关键词
PHARMACOVIGILANCE; CHALLENGES; EXTRACTION;
D O I
10.1007/s40264-018-0763-y
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
IntroductionIdentifying occurrences of medication side effects and adverse drug events (ADEs) is an important and challenging task because they are frequently only mentioned in clinical narrative and are not formally reported.MethodsWe developed a natural language processing (NLP) system that aims to identify mentions of symptoms and drugs in clinical notes and label the relationship between the mentions as indications or ADEs. The system leverages an existing word embeddings model with induced word clusters for dimensionality reduction. It employs a conditional random field (CRF) model for named entity recognition (NER) and a random forest model for relation extraction (RE).ResultsFinal performance of each model was evaluated separately and then combined on a manually annotated evaluation set. The micro-averaged F1 score was 80.9% for NER, 88.1% for RE, and 61.2% for the integrated systems. Outputs from our systems were submitted to the NLP Challenges for Detecting Medication and Adverse Drug Events from Electronic Health Records (MADE 1.0) competition (Yu et al. in http://bio-nlp.org/index.php/projects/39-nlp-challenges, 2018). System performance was evaluated in three tasks (NER, RE, and complete system) with multiple teams submitting output from their systems for each task. Our RE system placed first in Task 2 of the challenge and our integrated system achieved third place in Task 3.ConclusionAdding to the growing number of publications thatutilize NLP to detect occurrences of ADEs, our study illustrates the benefits of employing innovative feature engineering.
引用
收藏
页码:147 / 156
页数:10
相关论文
共 41 条
[1]   Empirical estimation of under-reporting in the US Food and Drug Administration Adverse Event Reporting System (FAERS) [J].
Alatawi, Yasser M. ;
Hansen, Richard A. .
EXPERT OPINION ON DRUG SAFETY, 2017, 16 (07) :761-767
[2]  
[Anonymous], 2018, INT WORKSH MED ADV D
[3]  
[Anonymous], 2009, ICML
[4]   Extraction of Adverse Drug Effects from Clinical Records [J].
Aramaki, Eiji ;
Miura, Yasuhide ;
Tonoike, Masatsugu ;
Ohkuma, Tomoko ;
Masuichi, Hiroshi ;
Waki, Kayo ;
Ohe, Kazuhiko .
MEDINFO 2010, PTS I AND II, 2010, 160 :739-743
[5]  
Bird S., 2009, Natural language processing with Python: analyzing text with the natural language toolkit
[6]   Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings [J].
Carrell, David S. ;
Schoen, Robert E. ;
Leffler, Daniel A. ;
Morris, Michele ;
Rose, Sherri ;
Baer, Andrew ;
Crockett, Seth D. ;
Gourevitch, Rebecca A. ;
Dean, Katie M. ;
Mehrotra, Ateev .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (05) :986-991
[7]   Opportunities and obstacles for deep learning in biology and medicine [J].
Ching, Travers ;
Himmelstein, Daniel S. ;
Beaulieu-Jones, Brett K. ;
Kalinin, Alexandr A. ;
Do, Brian T. ;
Way, Gregory P. ;
Ferrero, Enrico ;
Agapow, Paul-Michael ;
Zietz, Michael ;
Hoffman, Michael M. ;
Xie, Wei ;
Rosen, Gail L. ;
Lengerich, Benjamin J. ;
Israeli, Johnny ;
Lanchantin, Jack ;
Woloszynek, Stephen ;
Carpenter, Anne E. ;
Shrikumar, Avanti ;
Xu, Jinbo ;
Cofer, Evan M. ;
Lavender, Christopher A. ;
Turaga, Srinivas C. ;
Alexandari, Amr M. ;
Lu, Zhiyong ;
Harris, David J. ;
DeCaprio, Dave ;
Qi, Yanjun ;
Kundaje, Anshul ;
Peng, Yifan ;
Wiley, Laura K. ;
Segler, Marwin H. S. ;
Boca, Simina M. ;
Swamidass, S. Joshua ;
Huang, Austin ;
Gitter, Anthony ;
Greene, Casey S. .
JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2018, 15 (141)
[8]  
Comeau D.C., 2013, DATABASE, V2013
[9]   A Few Useful Things to Know About Machine Learning [J].
Domingos, Pedro .
COMMUNICATIONS OF THE ACM, 2012, 55 (10) :78-87
[10]  
Guo J, 2014, Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), P110