Knowledge discovery in open data for epidemic disease prediction

被引:4
|
作者
Wu, ChienHsing [1 ]
Kao, Shu-Chen [2 ]
机构
[1] Natl Univ Kaohsiung, Dept Informat Management, 700 Kaohsiung Univ Rd, Kaohsiung 81148, Taiwan
[2] Kun Shan Univ, Dept Informat Management, 195 Kunda Rd, Tainan, Taiwan
关键词
Open data; Knowledge extraction; Dengue; Influenza; Enterovirus; Google trends; Indexing; Knowledge discovery; Epidemic diseases; Health care; DENGUE-FEVER; SPATIOTEMPORAL PATTERNS; DECISION TREE; MODEL; IDENTIFICATION; ASSOCIATION; INFLUENZA; CLIMATE; IMPACT; HUMIDITY;
D O I
10.1016/j.hlpt.2021.01.001
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective: The research reveals the determinants associated with the epidemic diseases (dengue, influenza, and enterovirus) in Taiwan. It demonstrates the value of open data in prediction model development to support policymaking in the domain of public health care. Method: A knowledge discovery technique was employed to extract determinants from open data on epidemic diseases. The open dataset collected and integrated from Taiwan's Center for Disease Control, the Center Weather Bureau, and Google Trends includes 70,915 dengue, 34,062 enterovirus, and 52,908 influenza cases. A prediction model using the classification-oriented extraction mechanism was applied to open epidemic data, climate data, and Google Trends data. Prediction models that either included or did not include Google Trends data were compared. Prediction accuracy and simplicity of the decision rules are presented. Results: Prediction accuracy and simplicity of three diseases is acceptable when Google Trends is excluded but is slightly different when Google Trends is considered. Location (county) holds the main predictor of the three epidemic diseases. Time (month) presents the second-highest determinant for dengue, and age shows remarkable determinant for enterovirus and influenza. Mean temperature exhibits the highest entropy for dengue, time for enterovirus, and humidity for influenza. Conclusions: The number of confirmed cases for all three epidemic diseases cannot be predicted by a single variable. Knowledge extraction using the classification-oriented technique can be successfully applied in prediction model development. Google Trends data reveal a remarkable but inconsistent role in predicting three epidemic diseases with respect to prediction accuracy and simplicity of the generated decision tree. (c) 2021 Fellowship of Postgraduate Medicine. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:126 / 134
页数:9
相关论文
共 50 条
  • [1] Open data mining for Taiwan's dengue epidemic
    Wu, ChienHsing
    Kao, Shu-Chen
    Shih, Chia-Hung
    Kan, Meng-Hsuan
    ACTA TROPICA, 2018, 183 : 1 - 7
  • [2] A Systematic Mapping Study of Data Preparation in Heart Disease Knowledge Discovery
    Benhar, H.
    Idri, A.
    Fernandez-Aleman, J. L.
    JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (01)
  • [3] Knowledge discovery of drug data on the example of adverse reaction prediction
    Pinar Yildirim
    Ljiljana Majnarić
    Ozgur Ilyas Ekmekci
    Andreas Holzinger
    BMC Bioinformatics, 15
  • [4] A Big Data-Driven Intelligent Knowledge Discovery Method for Epidemic Spreading Paths
    Zhang, Yibo
    Zhang, Jierui
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (11)
  • [5] Prediction Model of Sports Results Base on Knowledge Discovery in Data - base
    Zhao, Baojin
    Chen, Lei
    2016 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2016), 2016, : 288 - 291
  • [6] Knowledge Discovery, Analysis And Prediction in Healthcare using Data Mining And Analytics
    Raul, Akshay
    Patil, Atharva
    Raheja, Prem
    Sawant, Rupali
    PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2016, : 475 - 478
  • [7] Fuzzy Knowledge Discovery from Time Series Data for Events Prediction
    Gholami, Ehsanollah
    Borujerdi, Mohammadreza Matash
    PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 646 - 657
  • [8] Knowledge Discovery in the Prediction of Bankruptcy
    Almeida, R. J.
    Vieira, S.
    Milea, V.
    Kaymak, U.
    Sousa, J. M. C.
    PROCEEDINGS OF THE JOINT 2009 INTERNATIONAL FUZZY SYSTEMS ASSOCIATION WORLD CONGRESS AND 2009 EUROPEAN SOCIETY OF FUZZY LOGIC AND TECHNOLOGY CONFERENCE, 2009, : 1785 - 1790
  • [9] Fiscal Knowledge Discovery in Municipalities of Athens and Thessaloniki via Linked Open Data
    Koupidis, Kleanthis
    Bratsas, Charalampos
    Karampatakis, Sotiris
    Martzopoulou, Anastasia
    Antoniou, Ioannis
    2016 11TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP), 2016, : 171 - 176
  • [10] Knowledge Discovery for A Temporal Prediction Rules
    TIAN Yuan1
    2.College of Business and Administration
    电脑知识与技术, 2007, (10) : 1067 - 1067