Evaluating automated approaches to anaphylaxis case classification using unstructured data from the FDA Sentinel System

被引:20
作者
Ball, Robert [1 ]
Toh, Sengwee [2 ,3 ]
Nolan, Jamie [2 ,3 ]
Haynes, Kevin [4 ]
Forshee, Richard [5 ]
Botsis, Taxiarchis [5 ,6 ]
机构
[1] US FDA, Ctr Drug Evaluat & Res, Off Surveillance & Epidemiol, Silver Spring, MD 20993 USA
[2] Harvard Med Sch, Dept Populat Med, Boston, MA USA
[3] Harvard Pilgrim Hlth Care Inst, Boston, MA USA
[4] HealthCore Inc, Translat Res Affordabil & Qual, Wilmington, DE USA
[5] US FDA, Ctr Biol Evaluat & Res, Off Biostat & Epidemiol, Silver Spring, MD USA
[6] Johns Hopkins Univ, Sch Med, Sidney Kimmel Comprehens Canc Ctr, Baltimore, MD USA
关键词
anaphylaxis; case classification; natural language processing; pharmacoepidemiology; sentinel system; validation; IDENTIFYING HEALTH OUTCOMES; EVENT REPORTING SYSTEM; TEXT MINING SYSTEM; CASE-DEFINITION; INFORMATION; SURVEILLANCE; FEATURES; RECORDS;
D O I
10.1002/pds.4645
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Introduction In May 2008, the Food and Drug Administration launched the Sentinel Initiative, a multi-year program for the establishment of a national electronic monitoring system for medical product safety that led, in 2016, to the launch of the full Sentinel System. Under the Mini-Sentinel pilot, several algorithms for identifying health outcomes of interest, including one for anaphylaxis, were developed and evaluated using data available from the Sentinel common data model. PurposeMethodsTo evaluate whether features extracted from unstructured narrative data using natural language processing (NLP) could be used to classify anaphylaxis cases. Using previously developed methods, we extracted features from unstructured narrative data using NLP and applied rule-based and similarity-based algorithms to identify anaphylaxis among 62 potential cases previously classified by human experts as anaphylaxis (N=33), not anaphylaxis (N=27), and unknown (N=2). ResultsConclusionsThe rule-based and similarity-based approaches demonstrated almost equal performance (recall 100% vs 100%, precision 60.3% vs 57.4%, F-measure: 0.753 vs 0.729). Reasons for misclassification included the inability of the algorithms to make the same clinical judgments as human experts about the timing, severity, or presence of alternative explanations; and the identification of terms consistent with anaphylaxis but present in conditions other than anaphylaxis. Although precision needs to be improved before these algorithms could be used without human review, we demonstrated that applying rule-based and similarity-based algorithms to unstructured narrative information from clinical records can be used for classification of anaphylaxis in the Sentinel System. Further development and assessment of these methods in the Sentinel System are warranted.
引用
收藏
页码:1077 / 1084
页数:8
相关论文
共 18 条
[1]   An overview of MetaMap: historical perspective and recent advances [J].
Aronson, Alan R. ;
Lang, Francois-Michel .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (03) :229-236
[2]   The FDA's Sentinel Initiative-A Comprehensive Approach to Medical Product Surveillance [J].
Ball, R. ;
Robb, M. ;
Anderson, S. A. ;
Dal Pan, G. .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2016, 99 (03) :265-268
[3]   Developing the Sentinel System - A National Resource for Evidence Development [J].
Behrman, Rachel E. ;
Benner, Joshua S. ;
Brown, Jeffrey S. ;
McClellan, Mark ;
Woodcock, Janet ;
Platt, Richard .
NEW ENGLAND JOURNAL OF MEDICINE, 2011, 364 (06) :498-499
[4]   The contribution of the Vaccine adverse event Text Mining system to the classification of possible Guillain-Barre Syndrome reports [J].
Botsis, T. ;
Woo, E. J. ;
Ball, R. .
APPLIED CLINICAL INFORMATICS, 2013, 4 (01) :88-99
[5]   Decision support environment for medical product safety surveillance [J].
Botsis, Taxiarchis ;
Jankosky, Christopher ;
Arya, Deepa ;
Kreimeyer, Kory ;
Foster, Matthew ;
Pandey, Abhishek ;
Wang, Wei ;
Zhang, Guangfan ;
Forshee, Richard ;
Goud, Ravi ;
Menschik, David ;
Walderhaug, Mark ;
Woo, Emily Jane ;
Scott, John .
JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 64 :354-362
[6]   Application of Information Retrieval Approaches to Case Classification in the Vaccine Adverse Event Reporting System [J].
Botsis, Taxiarchis ;
Woo, Emily Jane ;
Ball, Robert .
DRUG SAFETY, 2013, 36 (07) :573-582
[7]   Vaccine adverse event text mining system for extracting features from vaccine safety reports [J].
Botsis, Taxiarchis ;
Buttolph, Thomas ;
Nguyen, Michael D. ;
Winiecki, Scott ;
Woo, Emily Jane ;
Ball, Robert .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2012, 19 (06) :1011-1018
[8]   Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection [J].
Botsis, Taxiarchis ;
Nguyen, Michael D. ;
Woo, Emily Jane ;
Markatou, Marianthi ;
Ball, Robert .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) :631-638
[9]   Use abstracted patient-specific features to assist an information-theoretic measurement to assess similarity between medical cases [J].
Cao, Hui ;
Melton, Genevieve B. ;
Markatou, Marianthi ;
Hripcsak, George .
JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (06) :882-888
[10]   Mini-Sentinel's systematic reviews of validated methods for identifying health outcomes using administrative and claims data: methods and lessons learned [J].
Carnahan, Ryan M. ;
Moores, Kevin G. .
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2012, 21 :82-89