A dataset of 200 structured product labels annotated for adverse drug reactions

被引:37
作者
Demner-Fushman, Dina [1 ]
Shooshan, Sonya E. [1 ]
Rodriguez, Laritza [1 ]
Aronson, Alan R. [1 ]
Lang, Francois [1 ]
Rogers, Willie [1 ]
Roberts, Kirk [2 ]
Tonning, Joseph [3 ]
机构
[1] NIH, US Natl Lib Med, 8600 Rockville Pike, Bethesda, MD 20894 USA
[2] UT Hlth Sch Biomed Informat, 7000 Fannin St, Houston, TX 77030 USA
[3] US FDA, Ctr Drug Evaluat & Res, Off New Drugs, 10001 New Hampshire Ave, Silver Spring, MD 20903 USA
基金
美国国家卫生研究院;
关键词
DATABASE;
D O I
10.1038/sdata.2018.1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Adverse drug reactions (ADRs), unintended and sometimes dangerous effects that a drug may have, are one of the leading causes of morbidity and mortality during medical care. To date, there is no structured machine-readable authoritative source of known ADRs. The United States Food and Drug Administration (FDA) partnered with the National Library of Medicine to create a pilot dataset containing standardised information about known adverse reactions for 200 FDA-approved drugs. The Structured Product Labels (SPLs), the documents FDA uses to exchange information about drugs and other products, were manually annotated for adverse reactions at the mention level to facilitate development and evaluation of text mining tools for extraction of ADRs from all SPLs. The ADRs were then normalised to the Unified Medical Language System (UMLS) and to the Medical Dictionary for Regulatory Activities (MedDRA). We present the curation process and the structure of the publicly available database SPL-ADR-200db containing 5,098 distinct ADRs. The database is available at https://bionlp.nlm.nih.gov/tac2017adversereactions/; the code for preparing and validating the data is available at https://github.com/lhncbc/fda-ars.
引用
收藏
页数:8
相关论文
共 11 条
[1]   An overview of MetaMap: historical perspective and recent advances [J].
Aronson, Alan R. ;
Lang, Francois-Michel .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (03) :229-236
[2]   A standard database for drug repositioning [J].
Brown, Adam S. ;
Patel, Chirag J. .
SCIENTIFIC DATA, 2017, 4
[3]   ADReCS: an ontology database for aiding standardization and hierarchical classification of adverse drug reaction terms [J].
Cai, Mei-Chun ;
Xu, Quan ;
Pan, Yan-Jing ;
Pan, Wen ;
Ji, Nan ;
Li, Yin-Bo ;
Jin, Hai-Jing ;
Liu, Ke ;
Ji, Zhi-Liang .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D907-D913
[4]  
Duke Jon D, 2010, AMIA Annu Symp Proc, V2010, P177
[5]  
Fort Karen., 2010, 4 ACL LING ANN WORKS, P56
[6]   Agreement, the F-measure, and reliability in information retrieval [J].
Hripcsak, G ;
Rothschild, AS .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2005, 12 (03) :296-298
[7]   Scaling drug indication curation through crowdsourcing [J].
Khare, Ritu ;
Burger, John D. ;
Aberdeen, John S. ;
Tresner-Kirsch, David W. ;
Corrales, Theodore J. ;
Hirchman, Lynette ;
Lu, Zhiyong .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2015,
[8]   The SIDER database of drugs and side effects [J].
Kuhn, Michael ;
Letunic, Ivica ;
Jensen, Lars Juhl ;
Bork, Peer .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D1075-D1079
[9]  
Ogren P, 2008, P LANG RES EV C LREC, P28
[10]   Evaluating the effects of machine pre-annotation and an interactive annotation interface on manual de-identification of clinical text [J].
South, Brett R. ;
Mowery, Danielle ;
Suo, Ying ;
Leng, Jianwei ;
Ferrandez, Oscar ;
Meystre, Stephane M. ;
Chapman, Wendy W. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 50 :162-172