Segmentation Region Density Clustering Algorithm Based on Minimum Spanning Tree

被引:0
|
作者
Li J. [1 ]
Liang Y. [1 ,2 ]
机构
[1] College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao
[2] Provincial Key Laboratory for Information Technology of Wisdom Mining of Shandong Province, Qingdao
关键词
Density clustering; Imbalanced data set; Minimum spanning tree;
D O I
10.3724/SP.J.1089.2019.17716
中图分类号
学科分类号
摘要
To solve the problem that the traditional density clustering algorithms have poor adaptability to imbalanced data sets due to the use of global variables, a density clustering algorithm based on minimum spanning tree is proposed. Firstly, a data set density peaks calculation is used to estimate global density. Secondly, density clustering aims to separate the high-density clusters and low-density area. Thirdly, the minimum spanning tree is constructed and segmented to mining the associations within low density areas, and construct interconnection between high density areas and low density areas. Finally, compute all clusters' density as feature of merging the clusters, and obtain the result. This algorithm combines the knowledge of graph theory, processing the data set by segmentation and combination according to density feature, so that overcomes the limitations of traditional density clustering algorithms. By selecting multiple imbalanced artificial data sets and UCI data sets for test, we verify the effectiveness and robustness of this algorithm. © 2019, Beijing China Science Journal Publishing Co. Ltd. All right reserved.
引用
收藏
页码:1628 / 1635
页数:7
相关论文
共 22 条
  • [1] Sun J., Liu J., Zhao L., Clustering algorithms research, Journal of Software, 19, 1, pp. 48-61, (2008)
  • [2] Agarwal S., Data mining: data mining concepts and techniques, Proceedings of International Conference on Machine Intelligence and Research Advancement, pp. 203-207, (2013)
  • [3] Zhou T., Lu H., Clustering algorithm research advances on data mining, Computer Engineering and Applications, 48, 12, pp. 100-111, (2012)
  • [4] Bai X.Y., Yang P.L., Shi X.H., An overlapping community detection algorithm based on density peaks, Neurocomputing, 226, pp. 7-15, (2017)
  • [5] Chu Y., Luo X., Qu K., Et al., DBSCAN and K-means hybrid clustering based automatic dental feature detection, Journal of Computer-Aided Design & Computer Graphics, 30, 7, pp. 102-109, (2018)
  • [6] Hu M., Liu H., Zhang D., DBSCAN algorithm optimization and its application in GSM-R tower monitoring system, Computer Application and Software, 35, 5, pp. 308-312, (2018)
  • [7] Ji G., Zhao B., Research progress in mining for big spatio-temporal trajectories, Journal of Data Acquisition and Processing, 30, 1, pp. 47-58, (2015)
  • [8] Huang G., Wu C., Lyu N., A study of laser radar object detection based on improved DBSCAN algorithm, Journal of Transportation Information and Safety, 33, 3, pp. 23-28, (2015)
  • [9] Zhang H., Tan H., Qian Y., Et al., Chinese text deception detection based on ensemble learning, Journal of Computer Research and Development, 52, 5, pp. 1005-1013, (2015)
  • [10] Ester M., Kriegel H.P., Sander J., Et al., A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise, Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, pp. 226-231, (1996)