Respiratory Diseases, Malaria and Leishmaniasis: Temporal and Spatial Association with Fire Occurrences from Knowledge Discovery and Data Mining

被引:5
|
作者
Schroeder, Lucas [1 ]
Veronez, Mauricio Roberto [1 ]
de Souza, Eniuce Menezes [2 ]
Brum, Diego [1 ]
Gonzaga, Luiz, Jr. [1 ]
Rofatto, Vinicius Francisco [3 ]
机构
[1] Vale Rio Sinos Univ, X Real & Geoinformat Lab, BR-93022750 Sao Leopoldo, Brazil
[2] Univ Estadual Maringa, Dept Stat, BR-87020900 Maringa, Parana, Brazil
[3] Univ Fed Uberlandia, Dept Geog, BR-38408100 Uberlandia, MG, Brazil
关键词
health; fire; big data; Data Mining; Knowledge Discovery from Databases; machine learning; PARTICULATE MATTER; HEALTH IMPACTS; PUBLIC-HEALTH; DEFORESTATION; FOREST; AGREEMENT; AREAS;
D O I
10.3390/ijerph17103718
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The relationship between the fires occurrences and diseases is an essential issue for making public health policy and environment protecting strategy. Thanks to the Internet, today, we have a huge amount of health data and fire occurrence reports at our disposal. The challenge, therefore, is how to deal with 4 Vs (volume, variety, velocity and veracity) associated with these data. To overcome this problem, in this paper, we propose a method that combines techniques based on Data Mining and Knowledge Discovery from Databases (KDD) to discover spatial and temporal association between diseases and the fire occurrences. Here, the case study was addressed to Malaria, Leishmaniasis and respiratory diseases in Brazil. Instead of losing a lot of time verifying the consistency of the database, the proposed method uses Decision Tree, a machine learning-based supervised classification, to perform a fast management and extract only relevant and strategic information, with the knowledge of how reliable the database is. Namely, States, Biomes and period of the year (months) with the highest rate of fires could be identified with great success rates and in few seconds. Then, the K-means, an unsupervised learning algorithms that solves the well-known clustering problem, is employed to identify the groups of cities where the fire occurrences is more expressive. Finally, the steps associated with KDD is perfomed to extract useful information from mined data. In that case, Spearman's rank correlation coefficient, a nonparametric measure of rank correlation, is computed to infer the statistical dependence between fire occurrences and those diseases. Moreover, maps are also generated to represent the distribution of the mined data. From the results, it was possible to identify that each region showed a susceptible behaviour to some disease as well as some degree of correlation with fire outbreak, mainly in the drought period.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Data Mining Technique for Knowledge Discovery from Engineering Materials Data Sets
    Doreswamy
    Hemanth, K. S.
    Vastrad, Channabasayya M.
    Nagaraju, S.
    ADVANCES IN COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, PT I, 2011, 131 : 512 - +
  • [32] From Patterns in Data to Knowledge Discovery: What Data Mining Can Do
    Gullo, Francesco
    3RD INTERNATIONAL CONFERENCE FRONTIERS IN DIAGNOSTIC TECHNOLOGIES, ICFDT3 2013, 2015, 62 : 18 - 22
  • [33] Mining time-series association rules from Western Pacific spatial-temporal data
    Ma, Weixuan
    Xue, Cunjin
    Zhou, Junqi
    35TH INTERNATIONAL SYMPOSIUM ON REMOTE SENSING OF ENVIRONMENT (ISRSE35), 2014, 17
  • [34] Mining Local Association Rules from Temporal Data Set
    Mazarbhuiya, Fokrul Alom
    Abulaish, Muhammad
    Mahanta, Anjana Kakoti
    Ahmad, Tanvir
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 255 - +
  • [35] Knowledge Discovery in Web Usage Patterns Using Pageviews and Data Mining Association Rule
    Vijaiprabhu, G.
    Arivazhagan, B.
    Shunmuganathan, N.
    UBIQUITOUS INTELLIGENT SYSTEMS, 2022, 302 : 233 - 247
  • [36] Toxicological knowledge discovery by mining emerging patterns from toxicity data
    Richard Sherhod
    Valerie J Gillet
    Thierry Hanser
    Philip N Judson
    Jonathan D Vessey
    Journal of Cheminformatics, 5 (Suppl 1)
  • [37] A data mining approach to knowledge discovery from multidimensional cube structures
    Usman, Muhammad
    Pears, Russel
    Fong, A. C. M.
    KNOWLEDGE-BASED SYSTEMS, 2013, 40 : 36 - 49
  • [38] On the discovery of interesting patterns from evolving spatial data mining environments
    Al-Hegami, Ahmed Sultan
    Kaur, Harleen
    IMECS 2006: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, 2006, : 342 - +
  • [39] Spatial data mining and geographic knowledge discovery - An introduction (vol 33, pg 403, 2009)
    Guo, Diansheng
    Mennis, Jeremy
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2010, 34 (02) : 175 - 175
  • [40] Knowledge discovery of Middle East dust sources using Apriori spatial data mining algorithm
    Papi, Ramin
    Attarchi, Sara
    Boloorani, Ali Darvishi
    Samany, Najmeh Neysani
    ECOLOGICAL INFORMATICS, 2022, 72