Knowledge discovery in open data for epidemic disease prediction

被引:4
|
作者
Wu, ChienHsing [1 ]
Kao, Shu-Chen [2 ]
机构
[1] Natl Univ Kaohsiung, Dept Informat Management, 700 Kaohsiung Univ Rd, Kaohsiung 81148, Taiwan
[2] Kun Shan Univ, Dept Informat Management, 195 Kunda Rd, Tainan, Taiwan
关键词
Open data; Knowledge extraction; Dengue; Influenza; Enterovirus; Google trends; Indexing; Knowledge discovery; Epidemic diseases; Health care; DENGUE-FEVER; SPATIOTEMPORAL PATTERNS; DECISION TREE; MODEL; IDENTIFICATION; ASSOCIATION; INFLUENZA; CLIMATE; IMPACT; HUMIDITY;
D O I
10.1016/j.hlpt.2021.01.001
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective: The research reveals the determinants associated with the epidemic diseases (dengue, influenza, and enterovirus) in Taiwan. It demonstrates the value of open data in prediction model development to support policymaking in the domain of public health care. Method: A knowledge discovery technique was employed to extract determinants from open data on epidemic diseases. The open dataset collected and integrated from Taiwan's Center for Disease Control, the Center Weather Bureau, and Google Trends includes 70,915 dengue, 34,062 enterovirus, and 52,908 influenza cases. A prediction model using the classification-oriented extraction mechanism was applied to open epidemic data, climate data, and Google Trends data. Prediction models that either included or did not include Google Trends data were compared. Prediction accuracy and simplicity of the decision rules are presented. Results: Prediction accuracy and simplicity of three diseases is acceptable when Google Trends is excluded but is slightly different when Google Trends is considered. Location (county) holds the main predictor of the three epidemic diseases. Time (month) presents the second-highest determinant for dengue, and age shows remarkable determinant for enterovirus and influenza. Mean temperature exhibits the highest entropy for dengue, time for enterovirus, and humidity for influenza. Conclusions: The number of confirmed cases for all three epidemic diseases cannot be predicted by a single variable. Knowledge extraction using the classification-oriented technique can be successfully applied in prediction model development. Google Trends data reveal a remarkable but inconsistent role in predicting three epidemic diseases with respect to prediction accuracy and simplicity of the generated decision tree. (c) 2021 Fellowship of Postgraduate Medicine. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:126 / 134
页数:9
相关论文
共 50 条
  • [41] Grouting knowledge discovery based on data mining
    Liu, Qian
    Xiao, Fei
    Zhao, Zhiye
    TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2020, 95
  • [42] A New Survey On knowledge Discovery And Data Mining
    Mhamdi, Faouzi
    Elloumi, Mourad
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE: RCIS 2008, 2007, : 427 - +
  • [43] Data mining and knowledge discovery in databases - An overview
    MacKinnon, MJ
    Glick, N
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 1999, 41 (03) : 255 - 275
  • [45] Knowledge discovery in data sets with graded attributes
    Glodeanu, Cynthia Vera
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2016, 45 (02) : 232 - 249
  • [46] Knowledge Discovery Processing and Data Mining in Karyometry
    Bartels, Peter H.
    Montironi, Rodolfo
    Scarpelli, Marina
    Bartels, Hubert G.
    Alberts, David S.
    ANALYTICAL AND QUANTITATIVE CYTOLOGY AND HISTOLOGY, 2009, 31 (03): : 125 - 136
  • [47] Sampling and Evaluating the Big Data for Knowledge Discovery
    Sung, Andrew H.
    Ribeiro, Bernardete
    Liu, Qingzhong
    IOTBD: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND BIG DATA, 2016, : 378 - 382
  • [48] A knowledge discovery technique for heterogeneous data sources
    Shi, BS
    Shen, XJ
    Liu, ZT
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 293 - 298
  • [49] Knowledge discovery based on multidisciplinary simulation data
    Hu, Jie
    Yin, Ji-Long
    Peng, Ying-Hong
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1108 - +
  • [50] Application of Ontologies for Knowledge discovery in data mining
    Branquinho, Lucelia Pinto
    Abrantes Baracho, Renata Maria
    Almeida, Mauricio Barcellos
    ABAKOS, 2015, 4 (01): : 20 - 33