A method for efficient clustering of spatial data in network space

被引:7
|
作者
Nguyen, Trang T. D. [1 ]
Nguyen, Loan T. T. [2 ,3 ]
Anh Nguyen [4 ]
Yun, Unil [5 ]
Bay Vo [6 ]
机构
[1] Nha Trang Univ, Fac Informat Technol, Nha Trang, Vietnam
[2] Int Univ, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam
[3] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[4] Wroclaw Univ Sci & Technol, Dept Appl Informat, Wroclaw, Poland
[5] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[6] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
Spatial data mining; spatial data clustering; NS-DBSCAN; network spatial analysis; FAST SEARCH; ALGORITHM; DBSCAN; FIND;
D O I
10.3233/JIFS-202806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatial clustering is one of the main techniques for spatial data mining and spatial data analysis. However, existing spatial clustering methods primarily focus on points distributed in planar space with the Euclidean distance measurement. Recently, NS-DBSCAN has been developed to perform clustering of spatial point events in Network Space based on a well-known clustering algorithm, named Density-Based Spatial Clustering of Applications with Noise (DBSCAN). The NS-DBSCAN algorithm has efficiently solved the problem of clustering network constrained spatial points. When compared to the NC_DT (Network-Constraint Delaunay Triangulation) clustering algorithm, the NS-DBSCAN algorithm efficiently solves the problem of clustering network constrained spatial points by visualizing the intrinsic clustering structure of spatial data by constructing density ordering charts. However, the main drawback of this algorithm is when the data are processed, objects that are not specifically categorized into types of clusters cannot be removed, which is undeniably a waste of time, particularly when the dataset is large. In an attempt to have this algorithm work with great efficiency, we thus recommend removing edges that are longer than the threshold and eliminating low-density points from the density ordering table when forming clusters and also take other effective techniques into consideration. In this paper, we develop a theorem to determine the maximum length of an edge in a road segment. Based on this theorem, an algorithm is proposed to greatly improve the performance of the density-based clustering algorithm in network space (NS-DBSCAN). Experiments using our proposed algorithm carried out in collaboration with Ho Chi Minh City, Vietnam yield the same results but shows an advantage of it over NS-DBSCAN in execution time.
引用
收藏
页码:11653 / 11670
页数:18
相关论文
共 50 条
  • [1] An efficient topological-based clustering method on spatial data in network space
    Nguyen, Trang T. D.
    Nguyen, Loan T. T.
    Bui, Quang-Thinh
    Yun, Unil
    Vo, Bay
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
  • [2] NS-IDBSCAN: An efficient incremental clustering method for geospatial data in network space
    Nguyen, Trang T. D.
    Nguyen, Loan T. T.
    Bui, Quang-Thinh
    Duy, Le Nhat
    Vo, Bay
    INFORMATION SCIENCES, 2025, 690
  • [3] Efficient strategies for spatial data clustering using topological relations
    Nguyen, Trang T. D.
    Nguyen, Loan T. T.
    Bui, Quang-Thinh
    Duy, Le Nhat
    Pedrycz, Witold
    Vo, Bay
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [4] CLARANS: A method for clustering objects for spatial data mining
    Ng, RT
    Han, JW
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (05) : 1003 - 1016
  • [5] Clustering Algorithms for Spatial Big Data
    Schoier, Gabriella
    Gregorio, Caterina
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2017, PT IV, 2017, 10407 : 571 - 583
  • [6] Parallel and distributed clustering framework for big spatial data mining
    Bendechache, Malika
    Tari, A-Kamel
    Kechadi, M-Tahar
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2019, 34 (06) : 671 - 689
  • [7] Data Clustering Method Using Efficient Fuzzifier Values Derivation
    Cho, Jaehyuk
    Joo, Wonhee
    IEEE ACCESS, 2020, 8 : 124624 - 124632
  • [8] STiMR k-Means: An Efficient Clustering Method for Big Data
    Ben HajKacem, Mohamed Aymen
    Ben N'cir, Chiheb-Eddine
    Essoussi, Nadia
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (08)
  • [9] HYBRIDIZATION OF MAGNETIC CHARGE SYSTEM SEARCH METHOD FOR EFFICIENT DATA CLUSTERING
    Kumar, Yugal
    Sahoo, G.
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2018, 31 (02) : 108 - 129
  • [10] Efficient Large Scale Clustering based on Data Partitioning
    Bendechache, Malika
    Le-Khac, Nhien-An
    Kechadi, M-Tahar
    PROCEEDINGS OF 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, (DSAA 2016), 2016, : 612 - 621