A Novel Algorithm for Efficiently Mining Spatial Multi-Level Co-Location Patterns

被引:2
作者
Li, Junyi [1 ]
Wang, Lizhen [2 ,3 ]
Yang, Peizhong [1 ]
Zhou, Lihua [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650091, Peoples R China
[2] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650091, Peoples R China
[3] Yunnan Univ, Dianchi Coll, Kunming 650091, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Spatial databases; Graphical models; Distribution functions; Particle measurements; Atmospheric measurements; Clustering algorithms; Spatial data mining; multi-level co-location pattern; spatial distribution form; relative distribution coefficient; DISCOVERY;
D O I
10.1109/TKDE.2024.3381178
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spatial co-location pattern is a collection of spatial features in which instances of features prevalently appear in neighboring spatial regions. Due to the heterogeneity of spatial data distribution, the instances of some patterns appear prevalently in the global region (i.e., Global Prevalent Co-location Patterns, GPCPs), while some patterns are not prevalent globally, and their instances are clustered only in some local regions (i.e., Local Prevalent Co-location Patterns, LPCPs). Multi-level co-location pattern mining aims to mine these two types of patterns simultaneously, but existing methods cannot accurately judge the spatial distribution of patterns in a certain region, leading to unsuitable judgment of both GPCPs and LPCPs. To overcome this problem, this paper firstly proposes the relative distribution coefficient to identify the spatial distribution form of patterns, and provides a more refined way for discovering both GPCPs and LPCPs. Secondly, a novel multi-level co-location pattern mining algorithm is proposed by using the relative distribution coefficient as the interest metrics, and some pruning strategies are suggested to improve the mining efficiency. Finally, extensive experiments are conducted on both real and synthetic datasets to verify the effectiveness and efficiency of the proposed method.
引用
收藏
页码:4361 / 4374
页数:14
相关论文
共 30 条
  • [1] Agrawal Rakesh, 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
  • [2] A clique-based approach for co-location pattern mining
    Bao, Xuguang
    Wang, Lizhen
    [J]. INFORMATION SCIENCES, 2019, 490 : 244 - 264
  • [3] Mining Statistically Significant Co-location and Segregation Patterns
    Barua, Sajib
    Sander, Joerg
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) : 1185 - 1199
  • [4] Adaptive detection of statistically significant regional spatial co-location patterns
    Cai, Jiannan
    Liu, Qiliang
    Deng, Min
    Tang, Jianbo
    He, Zhanjun
    [J]. COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2018, 68 : 53 - 63
  • [5] Zonal co-location pattern discovery with dynamic parameters
    Celik, Mete
    Kang, James M.
    Shekhar, Shashi
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 433 - 438
  • [6] Multi-level method for discovery of regional co-location patterns
    Deng, Min
    Cai, Jiannan
    Liu, Qiliang
    He, Zhanjun
    Tang, Jianbo
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2017, 31 (09) : 1846 - 1870
  • [7] Eick Christoph F., 2008, GIS'08: Proc. of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems, P1
  • [8] Computing Co-Location Patterns in Spatial Data with Extended Objects: A Scalable Buffer-Based Approach
    Ge, Yong
    Yao, Zijun
    Li, Huayu
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (02) : 401 - 414
  • [9] Han Y., 2021, 2021 IEEE 4 INT C EL, P420
  • [10] Discovering colocation patterns from spatial data sets: A general approach
    Huang, Y
    Shekhar, S
    Xiong, H
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (12) : 1472 - 1485