A framework for regional association rule mining and scoping in spatial datasets

被引:28
作者
Ding, Wei [1 ]
Eick, Christoph F. [2 ]
Yuan, Xiaojing [3 ]
Wang, Jing [2 ]
Nicot, Jean-Philippe [4 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA
[2] Univ Houston, Dept Comp Sci, Houston, TX 77004 USA
[3] Univ Houston, Engn Technol Dept, Houston, TX 77004 USA
[4] Univ Texas Austin, Bur Econ Geol, John A & Katherine G Jackson Sch Geosci, Austin, TX USA
关键词
Association rule mining and scoping; Region discovery; Clustering; Spatial data mining; KNOWLEDGE;
D O I
10.1007/s10707-010-0111-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The motivation for regional association rule mining and scoping is driven by the facts that global statistics seldom provide useful insight and that most relationships in spatial datasets are geographically regional, rather than global. Furthermore, when using traditional association rule mining, regional patterns frequently fail to be discovered due to insufficient global confidence and/or support. In this paper, we systematically study this problem and address the unique challenges of regional association mining and scoping: (1) region discovery: how to identify interesting regions from which novel and useful regional association rules can be extracted; (2) regional association rule scoping: how to determine the scope of regional association rules. We investigate the duality between regional association rules and regions where the associations are valid: interesting regions are identified to seek novel regional patterns, and a regional pattern has a scope of a set of regions in which the pattern is valid. In particular, we present a reward-based region discovery framework that employs a divisive grid-based supervised clustering for region discovery. We evaluate our approach in a real-world case study to identify spatial risk patterns from arsenic in the Texas water supply. Our experimental results confirm and validate research results in the study of arsenic contamination, and our work leads to the discovery of novel findings to be further explored by domain scientists.
引用
收藏
页码:1 / 28
页数:28
相关论文
共 43 条
[1]  
AGRAWAL R, 1993, P 1993 ACM SIGMOD IN, V26, P207
[2]  
[Anonymous], 1995, 12 INT C MACH LEARN
[3]  
[Anonymous], 2006, Introduction to Data Mining
[4]  
[Anonymous], 1993, Proceedings of the 13th International Joint Conference on Artificial Intelligence
[5]  
[Anonymous], 2001, GEOGRAPHIC DATA MINI
[6]  
Appice A., 2003, Intelligent Data Analysis, V7, P541
[7]  
Bistarelli S, 2005, LECT NOTES ARTIF INT, V3721, P22
[8]   Reducing uninteresting spatial association rules in geographic databases using background knowledge: a summary of results [J].
Bogorny, V. ;
Kuijpers, B. ;
Alvares, L. O. .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2008, 22 (04) :361-386
[9]  
Bogorny V, 2006, P INT S ADV GEOGR IN, P139
[10]  
Bogorny V, 2006, IEEE DATA MINING, P813