Mining spatial association rules with geostatistics

被引:0
作者
Chen, Jiangping [1 ]
Tan, Xiaojin [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing Informat Engn, Wuhan 430079, Hubei, Peoples R China
来源
PROCEEDINGS OF THE 8TH INTERNATIONAL SYMPOSIUM ON SPATIAL ACCURACY ASSESSMENT IN NATURAL RESOURCES AND ENVIRONMENTAL SCIENCES, VOL I: SPATIAL UNCERTAINTY | 2008年
关键词
geographical information science; statistical analysis; spatial autocorrelation; geostatistics; spatial association rule;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In 1962, G. Matheron introduced the term geostatistics to describe a scientific approach to evaluate problems in geology and mining, from ore reserve estimation to grade control. Geostatistics provides statistical methods used to describe spatial relationships among sample data and to apply this analysis to the prediction of spatial and temporal phenomena. They are used to explain spatial patterns and to interpolate values at unsampled locations. Geostatistics have traditionally been used in the sphere of geosciences: meteorology, mining, soil science, forestry, fisheries, remote sensing, and cartography. It later were successfully applied to economics, health, and other disciplines. Currently, it's a trend to integrate powerful methods of geostaitsitcs into a geographic information system (GIS). This paper put forward a new algorithm of mining association rules with geostatistics in analyzing the epidemic problem. A key feature of epidemic data is their location in a space-time continuum. Geostatistics is independent of mean variance relationship and therefore can be used to verify more traditional methods of evaluation inner spatial structure. During structural analysis, spatial autocorrelation can be analyzed using covariance and semivariogram. With structural analysis predictions at unsampled locations can be made using geostatistic method such as kriging (i.e. multiple linear regression in a spatial context). Geostatistical analysis can interpret statistical distributions of data and also examine spatial relationships. It is capable of revealing how cohesion values vary over distance, and of predicting areas of high and low cohesion values. The geostatistics software provides tools for capturing maximum information on a phenomenon from sparse, often biased, and often under-sampled data. It is a good method for spatial data mining by taking account of the autocorrelation between the spatial data. In this paper, the first step is to use the geostatistics methods such as kriging, Spatial Autoregressive Model (SAR) to analyse and estimate the correlation of the land use/cover change and hay fever incidence. Then build a spatial autocorrelation model and then use the model to mining the spatial association rules. We can get the spatial frequency items from the autocorrelation Model. This replaces the repeated scanning of the spatial database by the measure of conventional spatial association rules mining. From the result of the example, the method is more quick and efficient than the traditional data mining algorithm Apriori.
引用
收藏
页码:369 / 376
页数:8
相关论文
共 20 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
BOULICAUT JF, 2000, 200019 INSA I NAT SC
[3]  
BRIN S, 1997, P ACM SIGMOD INT C M, P265
[4]  
Chaudhuri S., 1997, SIGMOD Record, V26, P65, DOI 10.1145/248603.248616
[5]  
CHEN JP, 2006, GEOINFORMATICS
[6]  
CHEUNG D, 1996, P 1996 INT C DAT ENG
[7]  
Chiles J.-P., 2009, GEOSTATISTICS MODELI, V497
[8]  
DAO S, 1995, P 1 INT C KNOWL DISC, P63
[9]  
Ester M., 1996, P 2 INT C KNOWL DISC, P226, DOI DOI 10.5555/3001460.3001507
[10]   The KDD process for extracting useful knowledge from volumes of data [J].
Fayyad, U ;
PiatetskyShapiro, G ;
Smyth, P .
COMMUNICATIONS OF THE ACM, 1996, 39 (11) :27-34