A New K-means-Based Algorithm for Automatic Clustering and Outlier Discovery

被引:3
|
作者
Jambudi, Trushali [1 ]
Gandhi, Savita [2 ]
机构
[1] Ahmedabad Univ, Sch Comp Studies, Ahmadabad, Gujarat, India
[2] Gujarat Univ, Dept Comp Sci, Ahmadabad, Gujarat, India
关键词
Data clustering; Outlier mining; K-means clustering; Data mining; Number of clusters; Merging clusters;
D O I
10.1007/978-981-13-1747-7_44
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-means is one of the most popular partition-based clustering algorithms that partition data objects based on attributes/features into K number of groups or clusters. In this paper, we address the major issues affecting the performance of k-means clustering algorithm. We have proposed as well as implemented a new k-means-based clustering algorithm which forms clusters by detecting and removing both global and local outliers and automatically converging into optimal clusters which are formed by a two-part process of splitting the initial clusters into subclusters based on criterion at local level and, in the second part, merging the clusters that satisfy the nearness criterion. Experiments show that our algorithm is able to automatically generate optimal number of clusters of different sizes and shapes which are free from global and local outliers.
引用
收藏
页码:457 / 467
页数:11
相关论文
共 50 条
  • [1] A K-means-based algorithm for projective clustering
    Bouguessa, Mohamed
    Wang, Shengrui
    Jiang, Qingshan
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 888 - +
  • [2] Parallelizing K-means-based Clustering on Spark
    Wang, Bowen
    Yin, Jun
    Hua, Qi
    Wu, Zhiang
    Cao, Jie
    2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 31 - 36
  • [3] Algorithm 1038: KCC: A MATLAB Package for k-Means-based Consensus Clustering
    Lin, Hao
    Liu, Hongfu
    Wu, Junjie
    Li, Hong
    Guennemann, Stephan
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2023, 49 (04):
  • [4] A study of K-Means-based algorithms for constrained clustering
    Covoes, Thiago F.
    Hruschka, Eduardo R.
    Ghosh, Joydeep
    INTELLIGENT DATA ANALYSIS, 2013, 17 (03) : 485 - 505
  • [5] Greedy Optimization for K-Means-Based Consensus Clustering
    Li, Xue
    Liu, Hongfu
    TSINGHUA SCIENCE AND TECHNOLOGY, 2018, 23 (02) : 184 - 194
  • [6] Greedy Optimization for K-Means-Based Consensus Clustering
    Xue Li
    Hongfu Liu
    Tsinghua Science and Technology, 2018, 23 (02) : 184 - 194
  • [7] Greedy Optimization for K-Means-Based Consensus Clustering
    Xue Li
    Hongfu Liu
    Tsinghua Science and Technology, 2018, (02) : 184 - 194
  • [8] K-Means-Based Consensus Clustering: A Unified View
    Wu, Junjie
    Liu, Hongfu
    Xiong, Hui
    Cao, Jie
    Chen, Jian
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (01) : 155 - 169
  • [9] Clustering by Hybrid K-Means-Based Rider Sunflower Optimization Algorithm for Medical Data
    Rani, A. Jaya Mabel
    Pravin, A.
    ADVANCES IN FUZZY SYSTEMS, 2022, 2022
  • [10] Weight-Improved K-Means-Based Consensus Clustering
    Wang, Yanhua
    Xiang, Laisheng
    Liu, Xiyu
    HUMAN CENTERED COMPUTING, HCC 2017, 2018, 10745 : 46 - 52