A method for efficient clustering of spatial data in network space

被引：7

作者：

Nguyen, Trang T. D. ^{[1
]}

Nguyen, Loan T. T. ^{[2
,3
]}

Anh Nguyen ^{[4
]}

Yun, Unil ^{[5
]}

Bay Vo ^{[6
]}

机构：

[1] Nha Trang Univ, Fac Informat Technol, Nha Trang, Vietnam

[2] Int Univ, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam

[3] Vietnam Natl Univ, Ho Chi Minh City, Vietnam

[4] Wroclaw Univ Sci & Technol, Dept Appl Informat, Wroclaw, Poland

[5] Sejong Univ, Dept Comp Engn, Seoul, South Korea

[6] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2021年 / 40卷 / 06期

关键词：

Spatial data mining; spatial data clustering; NS-DBSCAN; network spatial analysis; FAST SEARCH; ALGORITHM; DBSCAN; FIND;

D O I：

10.3233/JIFS-202806

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Spatial clustering is one of the main techniques for spatial data mining and spatial data analysis. However, existing spatial clustering methods primarily focus on points distributed in planar space with the Euclidean distance measurement. Recently, NS-DBSCAN has been developed to perform clustering of spatial point events in Network Space based on a well-known clustering algorithm, named Density-Based Spatial Clustering of Applications with Noise (DBSCAN). The NS-DBSCAN algorithm has efficiently solved the problem of clustering network constrained spatial points. When compared to the NC_DT (Network-Constraint Delaunay Triangulation) clustering algorithm, the NS-DBSCAN algorithm efficiently solves the problem of clustering network constrained spatial points by visualizing the intrinsic clustering structure of spatial data by constructing density ordering charts. However, the main drawback of this algorithm is when the data are processed, objects that are not specifically categorized into types of clusters cannot be removed, which is undeniably a waste of time, particularly when the dataset is large. In an attempt to have this algorithm work with great efficiency, we thus recommend removing edges that are longer than the threshold and eliminating low-density points from the density ordering table when forming clusters and also take other effective techniques into consideration. In this paper, we develop a theorem to determine the maximum length of an edge in a road segment. Based on this theorem, an algorithm is proposed to greatly improve the performance of the density-based clustering algorithm in network space (NS-DBSCAN). Experiments using our proposed algorithm carried out in collaboration with Ho Chi Minh City, Vietnam yield the same results but shows an advantage of it over NS-DBSCAN in execution time.

引用

页码：11653 / 11670

页数：18

共 50 条

[1] An efficient topological-based clustering method on spatial data in network space
Nguyen, Trang T. D.
Nguyen, Loan T. T.
Bui, Quang-Thinh
Yun, Unil
Vo, Bay
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
[2] NS-IDBSCAN: An efficient incremental clustering method for geospatial data in network space
Nguyen, Trang T. D.
Nguyen, Loan T. T.
Bui, Quang-Thinh
Duy, Le Nhat
Vo, Bay
INFORMATION SCIENCES, 2025, 690
[3] Efficient strategies for spatial data clustering using topological relations
Nguyen, Trang T. D.
Nguyen, Loan T. T.
Bui, Quang-Thinh
Duy, Le Nhat
Pedrycz, Witold
Vo, Bay
APPLIED INTELLIGENCE, 2025, 55 (02)
[4] CLARANS: A method for clustering objects for spatial data mining
Ng, RT
Han, JW
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (05) : 1003 - 1016
[5] Clustering Algorithms for Spatial Big Data
Schoier, Gabriella
Gregorio, Caterina
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2017, PT IV, 2017, 10407 : 571 - 583
[6] Parallel and distributed clustering framework for big spatial data mining
Bendechache, Malika
Tari, A-Kamel
Kechadi, M-Tahar
INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2019, 34 (06) : 671 - 689
[7] Data Clustering Method Using Efficient Fuzzifier Values Derivation
Cho, Jaehyuk
Joo, Wonhee
IEEE ACCESS, 2020, 8 : 124624 - 124632
[8] STiMR k-Means: An Efficient Clustering Method for Big Data
Ben HajKacem, Mohamed Aymen
Ben N'cir, Chiheb-Eddine
Essoussi, Nadia
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (08)
[9] HYBRIDIZATION OF MAGNETIC CHARGE SYSTEM SEARCH METHOD FOR EFFICIENT DATA CLUSTERING
Kumar, Yugal
Sahoo, G.
MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2018, 31 (02) : 108 - 129
[10] Efficient Large Scale Clustering based on Data Partitioning
Bendechache, Malika
Le-Khac, Nhien-An
Kechadi, M-Tahar
PROCEEDINGS OF 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, (DSAA 2016), 2016, : 612 - 621

← 1 2 3 4 5 →