Efficient and Intelligent Density and Delta-Distance Clustering Algorithm

被引:0
作者
Xuejuan Liu
Jiabin Yuan
Hanchi Zhao
机构
[1] Nanjing University of Aeronautics and Astronautics,College of Computer Science and Technology
[2] Nanjing Tech University,Pujiang Institute
来源
Arabian Journal for Science and Engineering | 2018年 / 43卷
关键词
LSH; Outlier detection; Density; Delta distance; Clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Density and delta-distance clustering (DDC) is an ideal clustering method that computes the density and delta distance of data. When data derived from the two indicators are large, these areas can be defined as cluster centers. DDC has good clustering performance compared with some other clustering algorithms. However, DDC has a high time complexity and requires manual identification of cluster centers. To fill these gaps, an efficient and intelligent DDC (EIDDC) algorithm is proposed in this study. EIDDC begins from using a sampling method based on locality-sensitive hashing (LSH) to obtain a small-scale dataset. The density and delta distance of each data point are calculated from this dataset to reduce time complexity. Cluster centers are intelligently recognized by utilizing density-based spatial clustering of applications with noise-based outlier detection technology. Experiment results show that LSH can obtain good representatives of the original dataset and that the proposed outlier detection method can recognize the cluster centers of a given dataset. The results also reveal the efficiency of EIDDC.
引用
收藏
页码:7177 / 7187
页数:10
相关论文
共 49 条
[1]  
Gracia CD(2017)Adaptive clustering of embedded multiple web objects for efficient group prefetching Arab. J. Sci. Eng. 42 715-724
[2]  
Sudha S(2013)A segment-based approach to clustering multi-topic documents Knowl. Inf. Syst. 34 563-595
[3]  
Tagarelli A(2017)Fuzzy soft subspace clustering method for gene co-expression network analysis Int. J. Mach. Learn. Cybern. 8 1157-1165
[4]  
Karypis G(2008)Top 10 algorithms in data mining Knowl. Inf. Syst. 14 1-37
[5]  
Wang Q(2002)A clustering me tho d using hierarchical self-organizing maps J. VLSI Signal Process. Syst. Signal Image Video Technol. 32 105-118
[6]  
Chen G(2015)A comprehensive survey of clustering algorithms Ann. Data Sci. 2 165-193
[7]  
Wu X(2014)Clustering by fast search and find of density peaks Science 344 1492-1496
[8]  
Kumar V(2016)A novel ranking-based clustering approach for hyperspectral band selection IEEE Trans. Geosci. Remote Sens. 54 88-102
[9]  
Quinlan JR(2016)Community detection in hypernetwork via density-ordered tree partition Appl. Math. Comput. 276 384-393
[10]  
Masahiro E(2016)A new method to estimate ages of facial image for large database Multimed. Tools Appl. 75 2877-2895