Model Selection Using K-Means Clustering Algorithm for the Symmetrical Segmentation of Remote Sensing Datasets

被引:16
|
作者
Ali, Ishfaq [1 ]
Rehman, Atiq Ur [2 ]
Khan, Dost Muhammad [1 ]
Khan, Zardad [1 ]
Shafiq, Muhammad [3 ]
Choi, Jin-Ghoo [3 ]
机构
[1] Abdul Wali Khan Univ, Dept Stat, Mardan 23200, Pakistan
[2] Int Islam Univ, Fac Basic & Appl Sci, Dept Math & Stat, Islamabad 44000, Pakistan
[3] Yeungnam Univ, Dept Informat & Commun Engn, Gyongsan 38541, South Korea
来源
SYMMETRY-BASEL | 2022年 / 14卷 / 06期
基金
新加坡国家研究基金会;
关键词
unsupervised clustering; k-means; balanced optimal number of clusters; symmetry; clustering validity indices; remote sensing; root mean square error; satellite images; BIG DATA; DATA SET; NUMBER;
D O I
10.3390/sym14061149
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The importance of unsupervised clustering methods is well established in the statistics and machine learning literature. Many sophisticated unsupervised classification techniques have been made available to deal with a growing number of datasets. Due to its simplicity and efficiency in clustering a large dataset, the k-means clustering algorithm is still popular and widely used in the machine learning community. However, as with other clustering methods, it requires one to choose the balanced number of clusters in advance. This paper's primary emphasis is to develop a novel method for finding the optimum number of clusters, k, using a data-driven approach. Taking into account the cluster symmetry property, the k-means algorithm is applied multiple times to a range of k values within which the balanced optimum k value is expected. This is based on the uniqueness and symmetrical nature among the centroid values for the clusters produced, and we chose the final k value as the one for which symmetry is observed. We evaluated the proposed algorithm's performance on different simulated datasets with controlled parameters and also on real datasets taken from the UCI machine learning repository. We also evaluated the performance of the proposed method with the aim of remote sensing, such as in deforestation and urbanization, using satellite images of the Islamabad region in Pakistan, taken from the Sentinel-2B satellite of the United States Geological Survey. From the experimental results and real data analysis, it is concluded that the proposed algorithm has better accuracy and minimum root mean square error than the existing methods.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] IMPROVEMENT IN K-MEANS CLUSTERING ALGORITHM FOR DATA CLUSTERING
    Rajeswari, K.
    Acharya, Omkar
    Sharma, Mayur
    Kopnar, Mahesh
    Karandikar, Kiran
    1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 367 - 369
  • [42] Dynamic particle swarm optimization and K-means clustering algorithm for image segmentation
    Li, Haiyang
    He, Hongzhou
    Wen, Yongge
    OPTIK, 2015, 126 (24): : 4817 - 4822
  • [43] Brain Image Segmentation Based on Firefly Algorithm Combined with K-means Clustering
    Capor Hrosik, Romana
    Tuba, Eva
    Dolicanin, Edin
    Jovanovic, Raka
    Tuba, Milan
    STUDIES IN INFORMATICS AND CONTROL, 2019, 28 (02): : 167 - 176
  • [44] Development of a Corruption Detection Algorithm using K-means Clustering
    Islam, Md. Tawheedul
    Abu Yousuf, Mohammad
    2018 INTERNATIONAL CONFERENCE ON ADVANCEMENT IN ELECTRICAL AND ELECTRONIC ENGINEERING (ICAEEE), 2018,
  • [45] Underground Electrical Profile Clustering Using K-MEANS Algorithm
    Kutbay, Ugurhan
    Ural, Ali Berkan
    Hardalac, Firat
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 561 - 564
  • [46] An Approach for Document Clustering using PSO and K-means Algorithm
    Chouhan, Rashmi
    Purohit, Anuradha
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2018), 2018, : 1380 - 1384
  • [47] DETERMINISTIC INITIALIZATION OF THE K-MEANS ALGORITHM USING HIERARCHICAL CLUSTERING
    Celebi, M. Emre
    Kingravi, Hassan A.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [48] Wheat ear counting using K-means clustering segmentation and convolutional neural network
    Xu, Xin
    Li, Haiyang
    Yin, Fei
    Xi, Lei
    Qiao, Hongbo
    Ma, Zhaowu
    Shen, Shuaijie
    Jiang, Binchao
    Ma, Xinming
    PLANT METHODS, 2020, 16 (01)
  • [49] Clustering Centroid Selection using a K-means and Rapid Density Peak Search Fusion Algorithm
    Zhang, Chenyang
    Wang, Jiamei
    Li, Xinyun
    Fu, Fei
    Wang, Weiquan
    PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020), 2020, : 201 - 207
  • [50] K-means Clustering Using R A Case Study of Market Segmentation
    Phan Duy Hung
    Nguyen Duc Ngoc
    Tran Duc Hanh
    PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON E-BUSINESS AND APPLICATIONS (ICEBA 2019), 2019, : 100 - 104