Entropy K-Means Clustering With Feature Reduction Under Unknown Number of Clusters

被引:39
作者
Sinaga, Kristina P. [1 ]
Hussain, Ishtiaq [2 ]
Yang, Miin-Shen [2 ]
机构
[1] BINUS Univ, BINUS Grad Program, Dept Master Informat Syst Management, Jakarta 10279, Indonesia
[2] Chung Yuan Christian Univ, Dept Appl Math, Taoyuan 32023, Taiwan
关键词
Clustering algorithms; Entropy; Linear programming; Indexes; Optimization; Partitioning algorithms; Licenses; Clustering; k-means; entropy; feature weights; feature reduction; number of clusters; entropy-k-means; KRILL HERD; OPTIMIZATION; MODEL;
D O I
10.1109/ACCESS.2021.3077622
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The k-means algorithm with its extensions is the most used clustering method in the literature. But, the k-means and its various extensions are generally affected by initializations with a given number of clusters. On the other hand, most of k-means always treat data points with equal importance for feature components. There are several feature-weighted k-means proposed in literature, but, these feature-weighted k-means do not give a feature reduction behavior. In this paper, based on several entropy-regularized terms we can construct a novel k-means clustering algorithm, called Entropy-k-means, such that it can be free of initializations without a given number of clusters, and also has a feature reduction behavior. That is, the proposed Entropy-k-means algorithm can eliminate irrelevant features with feature reduction under free of initializations with automatically finding an optimal number of clusters. Comparisons between the proposed Entropy-k-means and other methods are made. Experimental results and comparisons actually demonstrate these good aspects of the proposed Entropy-k-means with its effectiveness and usefulness in practice.
引用
收藏
页码:67736 / 67751
页数:16
相关论文
共 42 条
[1]   Fast density clustering strategies based on the k-means algorithm [J].
Bai, Liang ;
Cheng, Xueqi ;
Liang, Jiye ;
Shen, Huawei ;
Guo, Yike .
PATTERN RECOGNITION, 2017, 71 :375-386
[2]  
Blake C. L., 1998, UCI REPOSITORY MACHI
[4]  
Caliski T., 1974, COMMUN STAT, V3, P1, DOI [10.1080/03610927408827101, DOI 10.1080/03610927408827101]
[5]  
Cover T. M., 1991, ELEMENTS INFORM THEO, DOI 10.1002/0471200611
[6]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[7]  
Dubes RC., 1988, ALGORITHMS CLUSTERIN
[8]  
Dunn J. C., 1973, Journal of Cybernetics, V3, P32, DOI 10.1080/01969727308546046
[9]   IONIZATION YIELD OF RADIATIONS .2. THE FLUCTUATIONS OF THE NUMBER OF IONS [J].
FANO, U .
PHYSICAL REVIEW, 1947, 72 (01) :26-29
[10]   Monarch butterfly optimization: A comprehensive review [J].
Feng, Yanhong ;
Deb, Suash ;
Wang, Gai-Ge ;
Alavi, Amir H. .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168