Robust and compact maximum margin clustering for high-dimensional data

被引:1
作者
Cevikalp, Hakan [1 ]
Chome, Edward [2 ]
机构
[1] Eskisehir Osmangazi Univ, Machine Learning & Comp Vis Lab, Elect & Elect Engn Dept, TR-26480 Meselik, Eskisehir, Turkiye
[2] Eskisehir Tech Univ, Comp Engn Dept, Eskisehir, Turkiye
关键词
Maximum margin clustering; Subspace clustering; Hyperplane fitting; Large margin; Robust clustering; ALGORITHM;
D O I
10.1007/s00521-023-09388-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of machine learning, clustering has become an increasingly popular research topic due to its critical importance. Many clustering algorithms have been proposed utilizing a variety of approaches. This study focuses on clustering of high-dimensional data using the maximum margin clustering approach. In this paper, two methods are introduced: The first method employs the classical maximum margin clustering approach, which separates data into two clusters with the greatest margin between them. The second method takes cluster compactness into account and searches for two parallel hyperplanes that best fit to the cluster samples while also being as far apart from each other as possible. Additionally, robust variants of these clustering methods are introduced to handle outliers and noise within the data samples. The stochastic gradient algorithm is used to solve the resulting optimization problems, enabling all proposed clustering methods to scale well with large-scale data. Experimental results demonstrate that the proposed methods are more effective than existing maximum margin clustering methods, particularly in high-dimensional clustering problems, highlighting the efficacy of the proposed methods.
引用
收藏
页码:5981 / 6003
页数:23
相关论文
共 73 条
  • [1] Aeberhard S., 1991, UCI Mach Learn Repos, V10, pC5PC7J
  • [2] Alpaydin E, 1998, UCI Machine Learning Repository, DOI DOI 10.24432/C50P49
  • [3] [Anonymous], 2004, P 23 ACM SIGMOD SIGA
  • [4] [Anonymous], 1993, UCI Machine Learning Repository, DOI DOI 10.24432/C55887
  • [5] [Anonymous], 2011, Advances in Neural Information Processing Systems, DOI DOI 10.1109/TPAMI.2013.57
  • [6] Beyer K, 1999, LECT NOTES COMPUT SC, V1540, P217
  • [7] Bilenko M., 2004, P INT C MACH LEARN, P81
  • [8] High-dimensional data clustering by using local affine/convex hulls
    Cevikalp, Hakan
    [J]. PATTERN RECOGNITION LETTERS, 2019, 128 : 427 - 432
  • [9] Large-scale robust transductive support vector machines
    Cevikalp, Hakan
    Franc, Vojtech
    [J]. NEUROCOMPUTING, 2017, 235 : 199 - 209
  • [10] Large margin classifiers based on affine hulls
    Cevikalp, Hakan
    Triggs, Bill
    Yavuz, Hasan Serhan
    Kucuk, Yalcin
    Kucuk, Mahide
    Barkana, Atalay
    [J]. NEUROCOMPUTING, 2010, 73 (16-18) : 3160 - 3168