Respiratory Diseases, Malaria and Leishmaniasis: Temporal and Spatial Association with Fire Occurrences from Knowledge Discovery and Data Mining

被引:5
|
作者
Schroeder, Lucas [1 ]
Veronez, Mauricio Roberto [1 ]
de Souza, Eniuce Menezes [2 ]
Brum, Diego [1 ]
Gonzaga, Luiz, Jr. [1 ]
Rofatto, Vinicius Francisco [3 ]
机构
[1] Vale Rio Sinos Univ, X Real & Geoinformat Lab, BR-93022750 Sao Leopoldo, Brazil
[2] Univ Estadual Maringa, Dept Stat, BR-87020900 Maringa, Parana, Brazil
[3] Univ Fed Uberlandia, Dept Geog, BR-38408100 Uberlandia, MG, Brazil
关键词
health; fire; big data; Data Mining; Knowledge Discovery from Databases; machine learning; PARTICULATE MATTER; HEALTH IMPACTS; PUBLIC-HEALTH; DEFORESTATION; FOREST; AGREEMENT; AREAS;
D O I
10.3390/ijerph17103718
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The relationship between the fires occurrences and diseases is an essential issue for making public health policy and environment protecting strategy. Thanks to the Internet, today, we have a huge amount of health data and fire occurrence reports at our disposal. The challenge, therefore, is how to deal with 4 Vs (volume, variety, velocity and veracity) associated with these data. To overcome this problem, in this paper, we propose a method that combines techniques based on Data Mining and Knowledge Discovery from Databases (KDD) to discover spatial and temporal association between diseases and the fire occurrences. Here, the case study was addressed to Malaria, Leishmaniasis and respiratory diseases in Brazil. Instead of losing a lot of time verifying the consistency of the database, the proposed method uses Decision Tree, a machine learning-based supervised classification, to perform a fast management and extract only relevant and strategic information, with the knowledge of how reliable the database is. Namely, States, Biomes and period of the year (months) with the highest rate of fires could be identified with great success rates and in few seconds. Then, the K-means, an unsupervised learning algorithms that solves the well-known clustering problem, is employed to identify the groups of cities where the fire occurrences is more expressive. Finally, the steps associated with KDD is perfomed to extract useful information from mined data. In that case, Spearman's rank correlation coefficient, a nonparametric measure of rank correlation, is computed to infer the statistical dependence between fire occurrences and those diseases. Moreover, maps are also generated to represent the distribution of the mined data. From the results, it was possible to identify that each region showed a susceptible behaviour to some disease as well as some degree of correlation with fire outbreak, mainly in the drought period.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] A study on artificial neural network based spatial data mining and knowledge discovery
    Guo, H
    Shen, XJ
    ICEMI 2005: CONFERENCE PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL 4, 2005, : 556 - 561
  • [22] A meteorological conceptual modeling approach based on spatial data mining and knowledge discovery
    Yang, YB
    Lin, H
    Guo, ZY
    Jiang, JX
    INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2005, 3533 : 490 - 499
  • [23] Difference between data mining and knowledge discovery - A view to discovery from knowledge-processing
    Ohsuga, S
    2005 IEEE International Conference on Granular Computing, Vols 1 and 2, 2005, : 7 - 12
  • [24] From data collection to knowledge data discovery: A medical application of data mining
    Duhamel, A
    Picavet, M
    Devos, P
    Beuscart, R
    MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 1329 - 1333
  • [25] From data collection to knowledge data discovery: A medical application of data mining
    CERIM, Faculté de Médecine, 1 Place de Verdun, 59045 Lille cedex, France
    不详
    Stud. Health Technol. Informatics, 1600, (1329-1333):
  • [26] Knowledge Discovery: Methods from data mining and machine learning
    Shu, Xiaoling
    Ye, Yiwan
    SOCIAL SCIENCE RESEARCH, 2023, 110
  • [27] Open Knowledge Discovery and Data Mining from Patient Forums
    Dirkson, Anne
    Verberne, Suzan
    van Oortmerssen, Gerard
    Gelderblom, Hans
    Kraaij, Wessel
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE 2018), 2018, : 397 - 398
  • [28] Personalized Knowledge Discovery: Mining Novel Association Rules from Text
    Chen, Xin
    Wu, Yi-Fang
    PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 589 - 593
  • [29] Editorial: Data Mining and Statistical Methods for Knowledge Discovery in Diseases Based on Multimodal Omics
    Wang, Tao
    Renteria, Miguel E.
    Peng, Jiajie
    FRONTIERS IN GENETICS, 2022, 13
  • [30] METHODOLOGIES OF KNOWLEDGE DISCOVERY FROM DATA AND DATA MINING METHODS IN MECHANICAL ENGINEERING
    Rogalewicz, Michal
    Sika, Robert
    MANAGEMENT AND PRODUCTION ENGINEERING REVIEW, 2016, 7 (04) : 97 - 108