Minimum Distribution Support Vector Clustering

被引:2
作者
Wang, Yan [1 ,2 ]
Chen, Jiali [1 ]
Xie, Xuping [1 ]
Yang, Sen [1 ]
Pang, Wei [3 ]
Huang, Lan [1 ]
Zhang, Shuangquan [1 ]
Zhao, Shishun [4 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
[2] Jilin Univ, Sch Artificial Intelligence, Changchun 130012, Peoples R China
[3] Heriot Watt Univ, Sch Math & Comp Sci, Edinburgh EH14 4AS, Midlothian, Scotland
[4] Jilin Univ, Coll Math, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
support vector clustering; margin theory; mean; variance; dual coordinate descent; DISTRIBUTION MACHINE;
D O I
10.3390/e23111473
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Support vector clustering (SVC) is a boundary-based algorithm, which has several advantages over other clustering methods, including identifying clusters of arbitrary shapes and numbers. Leveraged by the high generalization ability of the large margin distribution machine (LDM) and the optimal margin distribution clustering (ODMC), we propose a new clustering method: minimum distribution for support vector clustering (MDSVC), for improving the robustness of boundary point recognition, which characterizes the optimal hypersphere by the first-order and second-order statistics and tries to minimize the mean and variance simultaneously. In addition, we further prove, theoretically, that our algorithm can obtain better generalization performance. Some instructive insights for adjusting the number of support vector points are gained. For the optimization problem of MDSVC, we propose a double coordinate descent algorithm for small and medium samples. The experimental results on both artificial and real datasets indicate that our MDSVC has a significant improvement in generalization performance compared to SVC.
引用
收藏
页数:20
相关论文
共 25 条
  • [1] Ben-Hur A, 2001, ADV NEUR IN, V13, P367
  • [2] Berkhin P, 2006, GROUPING MULTIDIMENSIONAL DATA: RECENT ADVANCES IN CLUSTERING, P25
  • [3] An Efficient Method for subjectively choosing parameter 'k' automatically in VDBSCAN (Varied Density Based Spatial Clustering of Applications with Noise) Algorithm
    Chowdhury, A. K. M. Rasheduzzaman
    Mollah, Md. Elias
    Rahman, Md. Asikur
    [J]. 2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 1, 2010, : 38 - 41
  • [4] On the doubt about margin explanation of boosting
    Gao, Wei
    Zhou, Zhi-Hua
    [J]. ARTIFICIAL INTELLIGENCE, 2013, 203 : 1 - 18
  • [5] Recent Advances in Large Margin Learning
    Guo, Yiwen
    Zhang, Changshui
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 7167 - 7174
  • [6] Jennath H. S., 2022, International Conference on Innovative Computing and Communications: Proceedings of ICICC 2021. Advances in Intelligent Systems and Computing (1388), P285, DOI 10.1007/978-981-16-2597-8_24
  • [7] Hyper-parameter optimization for support vector machines using stochastic gradient descent and dual coordinate descent
    Jiang, Wei
    Siddiqui, Sauleh
    [J]. EURO JOURNAL ON COMPUTATIONAL OPTIMIZATION, 2020, 8 (01) : 85 - 101
  • [8] An efficient k-means clustering filtering algorithm using density based initial cluster centers
    Kumar, K. Mahesh
    Reddy, A. Rama Mohan
    [J]. INFORMATION SCIENCES, 2017, 418 : 286 - 301
  • [9] Lee S.-H., 2005, ADV BIOINFORMATICS I, P151
  • [10] Lee SH, 2006, SIAM PROC S, P484