A New K-means-Based Algorithm for Automatic Clustering and Outlier Discovery

被引:3
|
作者
Jambudi, Trushali [1 ]
Gandhi, Savita [2 ]
机构
[1] Ahmedabad Univ, Sch Comp Studies, Ahmadabad, Gujarat, India
[2] Gujarat Univ, Dept Comp Sci, Ahmadabad, Gujarat, India
关键词
Data clustering; Outlier mining; K-means clustering; Data mining; Number of clusters; Merging clusters;
D O I
10.1007/978-981-13-1747-7_44
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-means is one of the most popular partition-based clustering algorithms that partition data objects based on attributes/features into K number of groups or clusters. In this paper, we address the major issues affecting the performance of k-means clustering algorithm. We have proposed as well as implemented a new k-means-based clustering algorithm which forms clusters by detecting and removing both global and local outliers and automatically converging into optimal clusters which are formed by a two-part process of splitting the initial clusters into subclusters based on criterion at local level and, in the second part, merging the clusters that satisfy the nearness criterion. Experiments show that our algorithm is able to automatically generate optimal number of clusters of different sizes and shapes which are free from global and local outliers.
引用
收藏
页码:457 / 467
页数:11
相关论文
共 50 条
  • [21] A dynamic K-means-based clustering algorithm using fuzzy logic for CH selection and data transmission based on machine learning
    Anupam Choudhary
    Abhishek Badholia
    Anurag Sharma
    Brijesh Patel
    Sapna Jain
    Soft Computing, 2023, 27 : 6135 - 6149
  • [22] Sensitivity analysis of an outlier-aware k-means clustering algorithm
    Olukanmi, Peter O.
    Twala, Bhekisipho
    2017 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS (PRASA-ROBMECH), 2017, : 68 - 73
  • [23] k*-means:: A new generalized k-means clustering algorithm
    Cheung, YM
    PATTERN RECOGNITION LETTERS, 2003, 24 (15) : 2883 - 2893
  • [24] An improved k-means clustering algorithm for the community discovery
    JiangYan, Sun
    Journal of Software Engineering, 2015, 9 (02): : 242 - 253
  • [25] A K-Means-Based Interpolation Algorithm With Lp-Norm and Feature Weighting
    Miao, Yipeng
    Xu, Yenan
    IEEE ACCESS, 2024, 12 : 96179 - 96192
  • [26] k-means clustering with outlier removal
    Gan, Guojun
    Ng, Michael Kwok-Po
    PATTERN RECOGNITION LETTERS, 2017, 90 : 8 - 14
  • [27] K-Means-Based Nature-Inspired Metaheuristic Algorithms for Automatic Data Clustering Problems: Recent Advances and Future Directions
    Ikotun, Abiodun M.
    Almutari, Mubarak S.
    Ezugwu, Absalom E.
    APPLIED SCIENCES-BASEL, 2021, 11 (23):
  • [28] K-means-based fuzzy classifier design
    Wong, CC
    Chen, CC
    Yeh, SL
    NINTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2000), VOLS 1 AND 2, 2000, : 48 - 52
  • [29] Automatic PAM clustering algorithm for outlier detection
    Zhu, Q. (qszhu@cqu.edu.cn), 1600, Academy Publisher (07):
  • [30] Designing framework to secure data using K Means clustering based outlier Detection (KCOD) algorithm
    Nithinsha, S.
    Anusuya, S.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (01) : 1057 - 1068