Application of Information Retrieval Approaches to Case Classification in the Vaccine Adverse Event Reporting System

被引:16
作者
Botsis, Taxiarchis [1 ,2 ]
Woo, Emily Jane [1 ]
Ball, Robert [1 ]
机构
[1] US FDA, Off Biostat & Epidemiol, CBER, Rockville, MD 20852 USA
[2] Univ Tromso, Dept Comp Sci, Tromso, Norway
关键词
SIMILARITY; GUIDELINES; COLLECTION; FEATURES; MEDDRA;
D O I
10.1007/s40264-013-0064-4
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background Automating the classification of adverse event reports is an important step to improve the efficiency of vaccine safety surveillance. Previously we showed it was possible to classify reports using features extracted from the text of the reports. Objective The aim of this study was to use the information encoded in the Medical Dictionary for Regulatory Activities (MedDRA((R))) in the US Vaccine Adverse Event Reporting System (VAERS) to support and evaluate two classification approaches: a multiple information retrieval strategy and a rule-based approach. To evaluate the performance of these approaches, we selected the conditions of anaphylaxis and Guillain-Barr, syndrome (GBS). Methods We used MedDRA((R)) Preferred Terms stored in the VAERS, and two standardized medical terminologies: the Brighton Collaboration (BC) case definitions and Standardized MedDRA((R)) Queries (SMQ) to classify two sets of reports for GBS and anaphylaxis. Two approaches were used: (i) the rule-based instruments that are available by the two terminologies (the Automatic Brighton Classification [ABC] tool and the SMQ algorithms); and (ii) the vector space model. Results We found that the rule-based instruments, particularly the SMQ algorithms, achieved a high degree of specificity; however, there was a cost in terms of sensitivity in all but the narrow GBS SMQ algorithm that outperformed the remaining approaches (sensitivity in the testing set was equal to 99.06 % for this algorithm vs. 93.40 % for the vector space model). In the case of anaphylaxis, the vector space model achieved higher sensitivity compared with the best values of both the ABC tool and the SMQ algorithms in the testing set (86.44 % vs. 64.11 % and 52.54 %, respectively). Conclusions Our results showed the superiority of the vector space model over the existing rule-based approaches irrespective of the standardized medical knowledge represented by either the SMQ or the BC case definition. The vector space model might make automation of case definitions for spontaneous report review more efficient than current rule-based approaches, allowing more time for critical assessment and decision making by pharmacovigilance experts.
引用
收藏
页码:573 / 582
页数:10
相关论文
共 20 条
[1]  
[Anonymous], 2008, Introduction to information retrieval
[2]   The Brighton Collaboration: addressing the need for standardized case definitions of adverse events following immunization (AEFI) [J].
Bonhoeffer, J ;
Kohl, K ;
Chen, R ;
Duclos, P ;
Heijbel, H ;
Heininger, U ;
Jefferson, T ;
Loupi, E .
VACCINE, 2002, 21 (3-4) :298-302
[3]   Vaccine adverse event text mining system for extracting features from vaccine safety reports [J].
Botsis, Taxiarchis ;
Buttolph, Thomas ;
Nguyen, Michael D. ;
Winiecki, Scott ;
Woo, Emily Jane ;
Ball, Robert .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2012, 19 (06) :1011-1018
[4]   Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection [J].
Botsis, Taxiarchis ;
Nguyen, Michael D. ;
Woo, Emily Jane ;
Markatou, Marianthi ;
Ball, Robert .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) :631-638
[5]   The Medical Dictionary for Regulatory Activities (MedDRA) [J].
Brown, EG ;
Wood, L ;
Wood, S .
DRUG SAFETY, 1999, 20 (02) :109-117
[6]   Using MedDRA - Implications for risk management [J].
Brown, EG .
DRUG SAFETY, 2004, 27 (08) :591-602
[7]   Use abstracted patient-specific features to assist an information-theoretic measurement to assess similarity between medical cases [J].
Cao, Hui ;
Melton, Genevieve B. ;
Markatou, Marianthi ;
Hripcsak, George .
JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (06) :882-888
[8]  
Food and Drug Administration, GUID IND GOOD PHARM
[9]  
Humphreys BL, 1998, J AM MED INFORM ASSN, V5, P1
[10]  
Lin D., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P296