Data mining techniques on satellite images for discovery of risk areas

被引:33
作者
Traore, Boukaye Boubacar [1 ,2 ]
Kamsu-Foguem, Bernard [1 ]
Tangara, Fana [2 ]
机构
[1] Univ Toulouse, ENIT, LGP, EA 1905, 47 Ave Azereix,BP 1629, F-65016 Tarbes, France
[2] Univ Sci Tech & Technol Bamako, Fac Sci & Tech, Ctr Calcul Modelisat & Simulat, Ancien Lycee Badala BP 28 11 FAST 223, Bamako, Mali
关键词
Data mining; Discretization; Remote sensing; Risk identification; Knowledge; TIME-SERIES; KNOWLEDGE DISCOVERY; REMOTE; CLASSIFICATION; EPIDEMICS; ALGORITHM; CHOLERA; BIOMASS; ASSOCIATION; EXPERIENCE;
D O I
10.1016/j.eswa.2016.10.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The high rates of cholera epidemic mortality in less developed countries is a challenge for health facilities to which it is necessary to equip itself with the epidemiological surveillance. To strengthen the capacity of epidemiological surveillance, this paper focuses on remote sensing satellite data processing using data mining methods to discover risk areas of the epidemic disease by connecting the environment, climate and health. These satellite data are combined with field data collected during the same set of periods in order to explain and deduct the causes of the epidemic evolution from one period to another in relation to the environment. The existing technical (algorithms) for processing satellite images are mature and efficient, so the challenge today is to provide the most suitable means allowing the best interpretation of obtained results. For that, we focus on supervised classification algorithm to process a set of satellite images from the same area but on different periods. A novel research methodology (describing pre-treatment, data mining, and post-treatment) is proposed to ensure suitable means for transforming data, generating information and extracting knowledge. This methodology consists of six phases: (1.A) Acquisition of information from the field about epidemic, (1.B) Satellite data acquisition, (2) Selection and transformation of data (Data derived from images), (3) Remote sensing measurements, (4) Discretization of data, (5) Data treatment, and (6) Interpretation of results. The main contributions of the paper are: to establish the nature of links between the environment and the epidemic, and to highlight those risky environments when the public awareness of the problem and the prevention policies are absolutely necessary for mitigation of the propagation and emergence of the epidemic. This will allow national governments, local authorities and the public health officials to effective management according to risk areas. The case study concerns the knowledge discovery in databases related to risk areas of the cholera epidemic in Mopti region, Mali (West Africa). The results generate from data mining association rules indicate that the level of the Niger River in the wintering periods and some societal factors have an impact on the variation of cholera epidemic rate in Mopti town. More the river level is high, at 66% the rate of contamination is high. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:443 / 456
页数:14
相关论文
共 78 条
[1]  
Abkar A.A., 2000, INT J APPL EARTH OBS, V2, P104
[2]  
Ahmad A., 2012, Applied Mathematical Sciences, V6, DOI [10.1016/S0034-4257(01)00311-X, DOI 10.12988/AMS.2013.34214]
[3]   A knowledge discovery process for spatiotemporal data: Application to river water quality monitoring [J].
Alatrista-Salas, H. ;
Aze, J. ;
Bringay, S. ;
Cernesson, F. ;
Selmaoui-Folcher, N. ;
Teisseire, M. .
ECOLOGICAL INFORMATICS, 2015, 26 :127-139
[4]   Land use/cover classification of arid and semi-arid Mediterranean landscapes using Landsat ETM [J].
Alrababah, M. A. ;
Alhamad, M. N. .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2006, 27 (13) :2703-2718
[5]   EVALUATING LANDSAT THEMATIC MAPPER DERIVED VEGETATION INDEXES FOR ESTIMATING ABOVEGROUND BIOMASS ON SEMIARID RANGELANDS [J].
ANDERSON, GL ;
HANSON, JD ;
HAAS, RH .
REMOTE SENSING OF ENVIRONMENT, 1993, 45 (02) :165-175
[6]  
[Anonymous], 1994, P INT C VERY LARGE D
[7]  
Arentze T.A., 2009, International Encyclopedia of Human Geography, P325, DOI DOI 10.1016/B978008044910-4.00524-1
[8]   Leveraging "big data" to enhance the effectiveness of "one health" in an era of health informatics [J].
Asokan G.V. ;
Asokan V. .
Journal of Epidemiology and Global Health, 2015, 5 (4) :311-314
[9]   Modeling the dynamics of mosquito breeding sites vs rainfall in Barkedji area, Senegal [J].
Bicout, Dominique J. ;
Vautrin, Mathias ;
Vignolles, Cecile ;
Sabatier, Philippe .
ECOLOGICAL MODELLING, 2015, 317 :41-49
[10]   Comparing prediction power and stability of broadband and hyperspectral vegetation indices for estimation of green leaf area index and canopy chlorophyll density [J].
Broge, NH ;
Leblanc, E .
REMOTE SENSING OF ENVIRONMENT, 2001, 76 (02) :156-172