Mining regional co-location patterns with kNNG

被引:56
作者
Qian, Feng [1 ]
Chiew, Kevin [2 ]
He, Qinming [3 ]
Huang, Hao [4 ]
机构
[1] NetEase Inc, Hangzhou R&D Ctr, Hangzhou, Zhejiang, Peoples R China
[2] Tan Tao Univ, Sch Engn, Duc Hoa Dist, Long An Provinc, Vietnam
[3] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310003, Zhejiang, Peoples R China
[4] Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore
基金
中国国家自然科学基金;
关键词
Regional co-location pattern mining; kNNG; Variation coefficient; DATA SETS; ALGORITHMS; DISCOVERY;
D O I
10.1007/s10844-013-0280-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatial co-location pattern mining discovers the subsets of features of which the events are frequently located together in geographic space. The current research on this topic adopts a distance threshold that has limitations in spatial data sets with various magnitudes of neighborhood distances, especially for mining of regional co-location patterns. In this paper, we propose a hierarchical co-location mining framework accounting for both variety of neighborhood distances and spatial heterogeneity. By adopting k-nearest neighbor graph (kNNG) instead of distance threshold, we propose "distance variation coefficient" as a new measure to drive the mining operations and determine an individual neighborhood relationship graph for each region. The proposed mining algorithm outputs a set of regions with each of them an individual set of regional co-location patterns. The experimental results on both synthetic and real world data sets show that our framework is effective to discover these regional co-location patterns.
引用
收藏
页码:485 / 505
页数:21
相关论文
共 37 条
[1]  
Agrawal R., P 20 INT C VERY LARG
[2]  
[Anonymous], 2011, NIPS
[3]  
Arge L., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P570
[4]   Scalable clustering algorithms with balancing constraints [J].
Banerjee, Arindam ;
Ghosh, Joydeep .
DATA MINING AND KNOWLEDGE DISCOVERY, 2006, 13 (03) :365-395
[5]   Connectivity of the mutual k-nearest-neighbor graph in clustering and outlier detection [J].
Brito, MR ;
Chavez, EL ;
Quiroz, AJ ;
Yukich, JE .
STATISTICS & PROBABILITY LETTERS, 1997, 35 (01) :33-42
[6]   Zonal co-location pattern discovery with dynamic parameters [J].
Celik, Mete ;
Kang, James M. ;
Shekhar, Shashi .
ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, :433-438
[7]  
Celik M, 2006, IEEE DATA MINING, P119
[8]  
Ding C., 2004, SAC '04: Proceedings of the 2004 ACM symposium on Applied computing, P584, DOI [DOI 10.1145/967900.968021, 10.1145/967900.968021]
[9]  
Eick ChristophF., 2008, GIS'08: Proc. of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems, P1
[10]  
Friedman J.H., 1977, ACM T MATH SOFTWARE, V3, P290