DACA: Distributed adaptive grid decision graph based clustering algorithm

被引:1
|
作者
He, Jing [1 ]
Zhou, Jun [1 ]
Wang, Haoyu [1 ]
Cai, Li [1 ]
机构
[1] Yunnan Univ, Natl Pilot Sch Software, Kunming, Yunnan, Peoples R China
来源
SOFTWARE-PRACTICE & EXPERIENCE | 2022年 / 52卷 / 05期
关键词
adaptive grid division; clustering algorithms; decision graphs; distributed; KD-tree; BIG DATA;
D O I
10.1002/spe.3060
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Clustering algorithms play a very important role in machine learning. With the development of big-data artificial intelligence, distributed parallel algorithms have become an important research field. To reduce the computational complexity and running time of large-scale datasets in the clustering process, this study proposes a distributed clustering algorithm DACA (distributed adaptive grid decision graph based clustering algorithm). In a distributed environment, DACA uses relative entropy to adaptively mesh the data to form an obvious sparse grid and dense grid. Then, the decision graph is used to determine the cluster center mesh object. Finally, the KD-tree is used to accelerate the determination of the cluster center of sparse points to complete clustering. The algorithm is implemented using the popular Apache Spark computing framework, compared with other distributed clustering algorithms, DACA can adaptively divide the grid according to the data distribution to obtain better clustering effect. At the same time, KD tree algorithm is used to speed up the decision-making of clustering center. Numerous experiments show that the DACA algorithm has excellent performance and accuracy on six standard datasets and real GPS trajectory datasets.
引用
收藏
页码:1199 / 1215
页数:17
相关论文
共 50 条
  • [1] A robust spectral clustering algorithm based on grid-partition and decision-graph
    Wang, Lijuan
    Ding, Shifei
    Wang, Yanru
    Ding, Ling
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (05) : 1243 - 1254
  • [2] A robust spectral clustering algorithm based on grid-partition and decision-graph
    Lijuan Wang
    Shifei Ding
    Yanru Wang
    Ling Ding
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 1243 - 1254
  • [3] A distributed genetic algorithm for graph-based clustering
    Buza K.
    Buza A.
    Kis P.B.
    Advances in Intelligent and Soft Computing, 2011, 103 : 323 - 331
  • [4] AN ADAPTIVE RANDOM WALK BASED DISTRIBUTED CLUSTERING ALGORITHM
    Bui, Alain
    Kudireti, Abdurusul
    Sohier, Devan
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2012, 23 (04) : 803 - 830
  • [5] Adaptive grid-based forest-like clustering algorithm
    Cheng, Mingchang
    Ma, Tiefeng
    Ma, Lin
    Yuan, Jian
    Yan, Qijing
    NEUROCOMPUTING, 2022, 481 : 168 - 181
  • [6] A Grid Based Clustering Algorithm
    Zhang, Qiang
    2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [7] D-SAG: A Distributed Sort-Based Algorithm for Graph Clustering
    Saeed, Yaman A.
    Ismail, Raed A.
    Al-Haj Baddar, Sherenaz W.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (09) : 8665 - 8676
  • [8] D-SAG: A Distributed Sort-Based Algorithm for Graph Clustering
    Yaman A. Saeed
    Raed A. Ismail
    Sherenaz W. Al-Haj Baddar
    Arabian Journal for Science and Engineering, 2021, 46 : 8665 - 8676
  • [9] Distributed Clustering Algorithm for Adaptive Pandemic Control
    Insausti, Xabier
    Zarraga-Rodriguez, Marta
    Nolasco-Ferencikova, Carolina
    Gutierrez-Gutierrez, Jesus
    IEEE ACCESS, 2021, 9 : 160688 - 160696
  • [10] A task partition algorithm based on grid and graph partition for distributed crowd simulation
    Zhou, Wenping
    Tang, Haoxuan
    Ji, Zhenzhou
    2014 FOURTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2014, : 522 - 526