An optimization approach to partitional data clustering

被引:9
|
作者
Kim, J. [2 ]
Yang, J. [1 ]
Olafsson, S. [3 ]
机构
[1] Chonbuk Natl Univ, Dept Ind & Informat Syst Engn, Jeonju 561756, Jeonbuck, South Korea
[2] KOSBI, Seoul, South Korea
[3] Iowa State Univ, Ames, IA USA
关键词
optimization-based partitional clustering; scalability; partitioning; K-MEANS; ALGORITHMS;
D O I
10.1057/jors.2008.195
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Scalability of clustering algorithms is a critical issue facing the data mining community. One method to handle this issue is to use only a subset of all instances. This paper develops an optimization-based approach to the partitional clustering problem using an algorithm specifically designed for noisy performance, which is a problem that arises when using a subset of instances. Numerical results show that computation time can be dramatically reduced by using a partial set of instances without sacrificing solution quality. In addition, these results are more persuasive as the size of the problem is larger. Journal of the Operational Research Society (2009) 60, 1069-1084. doi:10.1057/jors.2008.195 Published online 8 April 2009
引用
收藏
页码:1069 / 1084
页数:16
相关论文
共 50 条
  • [41] Modified Teacher Learning Based Optimization Method for Data Clustering
    Sahoo, Anoop J.
    Kumar, Yugal
    ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 429 - 437
  • [42] Local neighbour spider monkey optimization algorithm for data clustering
    Patel, Vaishali P.
    Rawat, Manoj Kumar
    Patel, Amit S.
    EVOLUTIONARY INTELLIGENCE, 2023, 16 (01) : 133 - 151
  • [43] Local Best Particle Swarm Optimization for Partitioning Data Clustering
    Azab, Shahira Shaaban
    Hady, Mohamed Farouk Abdel
    Hefny, Hesham Ahmed
    ICENCO 2016 - 2016 12TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO) - BOUNDLESS SMART SOCIETIES, 2016, : 41 - 46
  • [44] Clustering Algorithm Optimization Applied to Metagenomics Using Big Data
    Vanegas, Julian
    Bonet, Isis
    INFORMATION AND COMMUNICATION TECHNOLOGIES OF ECUADOR (TIC.EC), 2019, 884 : 182 - 192
  • [45] A robust fuzzy approach for gene expression data clustering
    Meskat Jahan
    Mahmudul Hasan
    Soft Computing, 2021, 25 : 14583 - 14596
  • [46] A robust fuzzy approach for gene expression data clustering
    Jahan, Meskat
    Hasan, Mahmudul
    SOFT COMPUTING, 2021, 25 (23) : 14583 - 14596
  • [47] Hybrid Reptile Search Algorithm and Remora Optimization Algorithm for Optimization Tasks and Data Clustering
    Almotairi, Khaled H.
    Abualigah, Laith
    SYMMETRY-BASEL, 2022, 14 (03):
  • [48] Data Clustering using Enhanced Biogeography-based Optimization
    Pal, Raju
    Saraswat, Mukesh
    2017 TENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2017, : 122 - 127
  • [49] A Behavioral Study of Some Widely Employed Partitional and Model-Based Clustering Algorithms and Their Hybridizations
    Kishor, D. Raja
    Venkateswarlu, N. B.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2, 2017, 469 : 587 - 601
  • [50] Multi-view fuzzy clustering with minimax optimization for effective clustering of data from multiple sources
    Wang, Yangtao
    Chen, Lihui
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 : 457 - 466