Using grid for accelerating density-based clustering

被引:35
|
作者
Mahran, Shaaban [1 ]
Mahar, Khaled [1 ]
机构
[1] Arab Acad Sci & Technol, Alexandria, Egypt
关键词
D O I
10.1109/CIT.2008.4594646
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering analysis is a primary method for data mining. The ever increasing volumes of data in different applications forces clustering algorithms to cope with it. DBSCAN is a well-known algorithm for density-based clustering. It is both effective so it can detect arbitrary shaped clusters of dense regions and efficient especially in existence of spatial indexes to perform the neighborhood queries efficiently. In this paper we introduce a new algorithm GriDBSCAN to enhance the performance of DBSCAN using grid partitioning and merging, yielding a high performance with the advantage of high degree of parallelism. We verified the correctness of the algorithm theoretically and experimentally, studied the performance theoretically and using experiments on both real and synthetic data It proved to run much faster than original DBSCAN. We compared the algorithm with a similar algorithm, Enhanced DBSCAN, which is also an enhancement to DBSCAN using partitioning. Experiments showed the new algorithm's superiority in performance and degree of parallelism.
引用
收藏
页码:35 / 40
页数:6
相关论文
共 50 条
  • [1] Incremental grid density-based clustering algorithm
    Chen, Ning
    Chen, An
    Zhou, Long-Xiang
    Ruan Jian Xue Bao/Journal of Software, 2002, 13 (01): : 1 - 7
  • [2] Detecting crash hotspots using grid and density-based spatial clustering
    Khosrowshahi, Amin Ganjali
    Aghayan, Iman
    Kunt, Mehmet Metin
    Choupani, Abdoul-Ahad
    PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS-TRANSPORT, 2021, 176 (04) : 200 - 212
  • [3] Combination of Density-Based Spatial Clustering With Grid Search Using Nash Equilibrium
    Kazemi, Uranus
    Soleimani, Seyfollah
    ENGINEERING REPORTS, 2025, 7 (03)
  • [4] Accelerating Density-Based Subspace Clustering in High-Dimensional Data
    Prinzbach, Juergen
    Lauer, Tobias
    Kiefer, Nicolas
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 474 - 481
  • [5] A Grid and Density-based Clustering Algorithm for Processing Data Stream
    Jia, Chen
    Tan, ChengYu
    Yong, Ai
    SECOND INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING: WGEC 2008, PROCEEDINGS, 2008, : 517 - +
  • [6] GRIDBSCAN: GRId density-based spatial clustering of applications with noise
    Uncu, Ozge
    Gruver, William A.
    Kotak, Dilip B.
    Sabaz, Dorian
    Alibhai, Zafeer
    Ng, Colin
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 2976 - +
  • [7] Research on application of grid-based and density-based clustering algorithm
    Shen, LX
    Yan, C
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS I AND II, 2003, : 684 - 689
  • [8] Density-based clustering
    Campello, Ricardo J. G. B.
    Kroeger, Peer
    Sander, Jorg
    Zimek, Arthur
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 10 (02)
  • [9] Density-based clustering
    Kriegel, Hans-Peter
    Kroeger, Peer
    Sander, Joerg
    Zimek, Arthur
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (03) : 231 - 240
  • [10] Hierarchical Density-Based Clustering Using MapReduce
    dos Santos, Joelson Antonio
    Syed, Talat Iqbal
    Naldi, Murilo C.
    Campello, Ricardo J. G. B.
    Sander, Joerg
    IEEE TRANSACTIONS ON BIG DATA, 2021, 7 (01) : 102 - 114