Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure

被引:42
作者
Tafti, Ahmad P. [1 ]
Badger, Jonathan [1 ]
LaRose, Eric [1 ]
Shirzadi, Ehsan [2 ]
Mahnke, Andrea [1 ]
Mayer, John [1 ]
Ye, Zhan [1 ]
Page, David [3 ]
Peissig, Peggy [1 ]
机构
[1] Marshfield Clin Res Inst, Biomed Informat Res Ctr, 1000 N Oak Ave, Marshfield, WI 54449 USA
[2] Inst Elect & Elect Engn, Dublin, Ireland
[3] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI USA
关键词
adverse drug event; adverse drug reaction; drug side effects; machine learning; text mining; HOSPITALIZED-PATIENTS; TEXT CLASSIFICATION; MEDICATION ERRORS; EXTRACTION; METHODOLOGIES; EMERGENCY; SAFETY;
D O I
10.2196/medinform.9170
中图分类号
R-058 [];
学科分类号
摘要
Background: The study of adverse drug events (ADEs) is a tenured topic in medical literature. In recent years, increasing numbers of scientific articles and health-related social media posts have been generated and shared daily, albeit with very limited use for ADE study and with little known about the content with respect to ADEs. Objective: The aim of this study was to develop a big data analytics strategy that mines the content of scientific articles and health-related Web-based social media to detect and identify ADEs. Methods: We analyzed the following two data sources: (1) biomedical articles and (2) health-related social media blog posts. We developed an intelligent and scalable text mining solution on big data infrastructures composed of Apache Spark, natural language processing, and machine learning. This was combined with an Elasticsearch No-SQL distributed database to explore and visualize ADEs. Results: The accuracy, precision, recall, and area under receiver operating characteristic of the system were 92.7%, 93.6%, 93.0%, and 0.905, respectively, and showed better results in comparison with traditional approaches in the literature. This work not only detected and classified ADE sentences from big data biomedical literature but also scientifically visualized ADE interactions. Conclusions: To the best of our knowledge, this work is the first to investigate a big data machine learning strategy for ADE discovery on massive datasets downloaded from PubMed Central and social media. This contribution illustrates possible capacities in big data biomedical text analysis using advanced computational methods with real-time update from new data published on a daily basis.
引用
收藏
页数:17
相关论文
共 56 条
  • [21] Novel Data-Mining Methodologies for Adverse Drug Event Discovery and Analysis
    Harpaz, R.
    DuMouchel, W.
    Shah, N. H.
    Madigan, D.
    Ryan, P.
    Friedman, C.
    [J]. CLINICAL PHARMACOLOGY & THERAPEUTICS, 2012, 91 (06) : 1010 - 1021
  • [22] Text Mining for Adverse Drug Events: the Promise, Challenges, and State of the Art
    Harpaz, Rave
    Callahan, Alison
    Tamang, Suzanne
    Low, Yen
    Odgers, David
    Finlayson, Sam
    Jung, Kenneth
    LePendu, Paea
    Shah, Nigam H.
    [J]. DRUG SAFETY, 2014, 37 (10) : 777 - 790
  • [23] Identifying adverse drug event information in clinical notes with distributional semantic representations of context
    Henriksson, Aron
    Kvist, Maria
    Dalianis, Hercules
    Duneld, Martin
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 57 : 333 - 349
  • [24] Mining clinical text for signals of adverse drug-drug interactions
    Iyer, Srinivasan V.
    Harpaz, Rave
    LePendu, Paea
    Bauer-Mehren, Anna
    Shah, Nigam H.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (02) : 353 - 362
  • [25] Adverse Drug Events After Hospital Discharge in Older Adults: Types, Severity, and Involvement of Beers Criteria Medications
    Kanaan, Abir O.
    Donovan, Jennifer L.
    Duchin, Nerissa P.
    Field, Terry S.
    Tjia, Jennifer
    Cutrona, Sarah L.
    Gagne, Shawn J.
    Garber, Lawrence
    Preusse, Peggy
    Harrold, Leslie R.
    Gurwitz, Jerry H.
    [J]. JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2013, 61 (11) : 1894 - 1899
  • [26] Knowledge-based extraction of adverse drug events from biomedical text
    Kang, Ning
    Singh, Bharat
    Bui, Chinh
    Afzal, Zubair
    van Mulligen, Erik M.
    Kors, Jan A.
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [27] Incidence of adverse drug reactions in hospitalized patients - A meta-analysis of prospective studies
    Lazarou, J
    Pomeranz, BH
    Corey, PN
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1998, 279 (15): : 1200 - 1205
  • [28] DL-ADR: a novel deep learning model for classifying genomic variants into adverse drug reactions
    Liang, Zhaohui
    Huang, Jimmy Xiangji
    Zeng, Xing
    Zhang, Gang
    [J]. BMC MEDICAL GENOMICS, 2016, 9
  • [29] Lilleberg J, 2015, PROCEEDINGS OF 2015 IEEE 14TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), P136, DOI 10.1109/ICCI-CC.2015.7259377
  • [30] An ensemble method for extracting adverse drug events from social media
    Liu, Jing
    Zhao, Songzheng
    Zhang, Xiaodi
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2016, 70 : 62 - 76