Granular K-means Clustering Algorithm

被引:0
作者
Zhou, Chenglong [1 ]
Chen, Yuming [1 ]
Zhu, Yidong [1 ]
机构
[1] College of Computer and Information Engineering, Xiamen University of Technology, Fujian, Xiamen
关键词
granular clustering; granular computing; granular distance; K-means clustering; unsupervised learning;
D O I
10.3778/j.issn.1002-8331.2211-0072
中图分类号
学科分类号
摘要
Abstrac:K-means clustering belongs to unsupervised learning, which has the advantages of simple application, strong interpretability and a good clustering effect. However, it has slow convergence speed, parameters are difficult to determine, and it is easy to fall into a local solution. In order to overcome the inherent defects of K-means clustering, combined with granular computing theory, this paper proposes a new clustering model: granular K-means clustering algorithm. Samples are granulated into granules on a single feature, and the granules on multi-dimensional features forms a granular vector. Several granular distances are further defined to measure the distance between granules. Then, a granular K-means clustering method based on the granular distance is proposed, and its clustering algorithm is also designed. The granulation aims to compare the similarity in all sample spaces, which reflects the global characteristics of samples and makes the clustering converge easily with fewer iterations. Finally, experiments are carried out on several biological datasets to compare the convergence speed, K-value influence and clustering effect. The results show that the proposed granular Kmeans clustering method has the advantages of fast convergence speed and a good clustering effect. © 2023 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:317 / 324
页数:7
相关论文
共 25 条
[1]  
ZADEH L A., Fuzzy sets and information granularity[C], Advances in Fuzzy Set Theory and Applications, pp. 3-18, (1979)
[2]  
LIN T Y., Granular computing on binary relations I:data mining and neighborhood systems[J], Rough Sets in Knowledge Discovery, 2, pp. 165-166, (1998)
[3]  
YAO Y Y., Granular computing using neighborhood systems[C], Advances in Soft Computing, pp. 539-553, (1999)
[4]  
LIU Q, HUANG Y H., G-logic ans its reasoning[J], Chinese Journal of Computers, 27, 7, pp. 865-872, (2004)
[5]  
MIAO D Q, FAN S D., The calculation of knowledge granulation and its application[J], System Engineering Theory and Practice, 22, 1, pp. 48-56, (2002)
[6]  
ZHANG Y Q, FRASER M D, GACLIANO R A., Grarular neural networks for numericat linguistic data fusion and know ledge discovery[J], IEEE Transactions on Neural Networks, 11, pp. 658-667, (2000)
[7]  
LIANG J Y, FENG C J, SONG P., Overview of big data correlation analysis[J], Chinese Journal of Computers, 39, 1, pp. 1-18, (2016)
[8]  
ZHU F, WANG Y Q., Research on user information clustering and prediction based on k-means and neural network machine learning algorithm[J], Information Science, 39, 7, pp. 83-90, (2021)
[9]  
CHANG S Y, BAI X Z, XU J., A two- dimensional shock wave pattern recognition algorithm based on cluster analysis[J], Acta Aeronautica ET Astronautica Sinica, 41, 8, pp. 162-175, (2020)
[10]  
WANG H L,, LIU L, LIN M,, Et al., Music personalized recommendation algorithm based on k-means clustering algorithm[J], Journal of Jilin University(Engineering and Technology Edition), 51, 5, pp. 1845-1850, (2021)