An optimization approach to partitional data clustering

被引:9
|
作者
Kim, J. [2 ]
Yang, J. [1 ]
Olafsson, S. [3 ]
机构
[1] Chonbuk Natl Univ, Dept Ind & Informat Syst Engn, Jeonju 561756, Jeonbuck, South Korea
[2] KOSBI, Seoul, South Korea
[3] Iowa State Univ, Ames, IA USA
关键词
optimization-based partitional clustering; scalability; partitioning; K-MEANS; ALGORITHMS;
D O I
10.1057/jors.2008.195
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Scalability of clustering algorithms is a critical issue facing the data mining community. One method to handle this issue is to use only a subset of all instances. This paper develops an optimization-based approach to the partitional clustering problem using an algorithm specifically designed for noisy performance, which is a problem that arises when using a subset of instances. Numerical results show that computation time can be dramatically reduced by using a partial set of instances without sacrificing solution quality. In addition, these results are more persuasive as the size of the problem is larger. Journal of the Operational Research Society (2009) 60, 1069-1084. doi:10.1057/jors.2008.195 Published online 8 April 2009
引用
收藏
页码:1069 / 1084
页数:16
相关论文
共 50 条
  • [1] A Kalman filtering induced heuristic optimization based partitional data clustering
    Pakrashi, Arjun
    Chaudhuri, Bidyut B.
    INFORMATION SCIENCES, 2016, 369 : 704 - 717
  • [2] Maxmin Data Range Heuristic-Based Initial Centroid Method of Partitional Clustering for Big Data Mining
    Pandey, Kamlesh Kumar
    Shukla, Diwakar
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2022, 12 (01)
  • [3] An Effective Partitional Crisp Clustering Method Using Gradient Descent Approach
    Shalileh, Soroosh
    MATHEMATICS, 2023, 11 (12)
  • [4] Genetic Algorithms in Partitional Clustering: A Comparison
    Paterlini, Sandra
    Minerva, Tommaso
    RECENT ADVANCES IN NEURAL NETWORKS, FUZZY SYSTEMS & EVOLUTIONARY COMPUTING, 2010, : 28 - +
  • [5] Improved fast partitional clustering algorithm for text clustering
    Bejos, Sebastian
    Feliciano-Avelino, Ivan
    Martinez-Trinidad, J. Fco.
    Carrasco-Ochoa, J. A.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 2137 - 2145
  • [6] NDPD: an improved initial centroid method of partitional clustering for big data mining
    Pandey, Kamlesh Kumar
    Shukla, Diwakar
    JOURNAL OF ADVANCES IN MANAGEMENT RESEARCH, 2023, 20 (01) : 1 - 34
  • [7] Maxmin distance sort heuristic-based initial centroid method of partitional clustering for big data mining
    Pandey, Kamlesh Kumar
    Shukla, Diwakar
    PATTERN ANALYSIS AND APPLICATIONS, 2022, 25 (01) : 139 - 156
  • [8] Multivariate Analysis of LTE Radio-Layer Parameters based on a Partitional Clustering Approach
    Pasquino, Nicola
    Ventre, Giorgio
    Zinno, Stefania
    Petrocelli, Sofia
    2019 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR INDUSTRY 4.0 AND INTERNET OF THINGS (METROIND4.0&IOT), 2019, : 22 - 27
  • [9] Black hole: A new heuristic optimization approach for data clustering
    Hatamlou, Abdolreza
    INFORMATION SCIENCES, 2013, 222 : 175 - 184
  • [10] An improved approach of particle swarm optimization and application in data clustering
    Tran, Dang Cong
    Wu, Zhijian
    Deng, Changshou
    INTELLIGENT DATA ANALYSIS, 2015, 19 (05) : 1049 - 1070