Redundancy Reduction for Prevalent Co-Location Patterns

被引:70
作者
Wang, Lizhen [1 ]
Bao, Xuguang [1 ]
Zhou, Lihua [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Dept Comp Sci & Engn, Kunming 650221, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Spatial co-location pattern; redundancy; semantic distance; delta-covered;
D O I
10.1109/TKDE.2017.2759110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatial co-location pattern mining is an interesting and important task in spatial data mining which discovers the subsets of spatial features frequently observed together in nearby geographic space. However, the traditional framework of mining prevalent co-location patterns produces numerous redundant co-location patterns, which makes it hard for users to understand or apply. To address this issue, in this paper, we study the problem of reducing redundancy in a collection of prevalent co-location patterns by utilizing the spatial distribution information of co-location instances. We first introduce the concept of semantic distance between a co-location pattern and its super-patterns, and then define redundant co-locations by introducing the concept of delta-covered, where delta (0 <= d <= 1) is a coverage measure. We develop two algorithms RRclosed and RRnull to perform the redundancy reduction for prevalent co-location patterns. The former adopts the post-mining framework that is commonly used by existing redundancy reduction techniques, while the latter employs the mine-and-reduce framework that pushes redundancy reduction into the co-location mining process. Our performance studies on the synthetic and real-world data sets demonstrate that our method effectively reduces the size of the original collection of closed co-location patterns by about 50 percent. Furthermore, the RRnull method runs much faster than the related closed co-location pattern mining algorithm.
引用
收藏
页码:142 / 155
页数:14
相关论文
共 26 条
[1]   A generic regional spatio-temporal co-occurrence pattern mining model: a case study for air pollution [J].
Akbari, Mohammad ;
Samadzadegan, Farhad ;
Weibel, Robert .
JOURNAL OF GEOGRAPHICAL SYSTEMS, 2015, 17 (03) :249-274
[2]  
Al-Naymat G., 2007, P 7 IEEE INT C DAT M, P679
[3]   Mining Statistically Significant Co-location and Segregation Patterns [J].
Barua, Sajib ;
Sander, Joerg .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) :1185-1199
[4]   Zonal co-location pattern discovery with dynamic parameters [J].
Celik, Mete ;
Kang, James M. ;
Shekhar, Shashi .
ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, :433-438
[5]   Discovering colocation patterns from spatial data sets: A general approach [J].
Huang, Y ;
Shekhar, S ;
Xiong, H .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (12) :1472-1485
[6]  
Jin Soung Yoo, 2011, Proceedings of the 2011 IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM 2011), P100, DOI 10.1109/ICSDM.2011.5969013
[7]   On discovering co-location patterns in datasets: a case study of pollutants and child cancers [J].
Li, Jundong ;
Adilmagambetov, Aibek ;
Jabbar, Mohomed Shazan Mohomed ;
Zaiane, Osmar R. ;
Osornio-Vargas, Alvaro ;
Wine, Osnat .
GEOINFORMATICA, 2016, 20 (04) :651-692
[8]   Mining Competitive Pairs Hidden in Co-location Patterns from Dynamic Spatial Databases [J].
Lu, Junli ;
Wang, Lizhen ;
Fang, Yuan ;
Li, Momo .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 :467-480
[9]  
Mielikäinen T, 2003, LECT NOTES ARTIF INT, V2838, P327
[10]  
Mohan Pradeep., 2011, Proceedings of the 19th ACM SIGSPATIAL international conference on advances in geographic information systems, P122