Proximity multi-sphere support vector clustering

被引:5
作者
Trung Le [1 ]
Dat Tran [1 ]
Phuoc Nguyen [1 ]
Ma, Wanli [1 ]
Sharma, Dharmendra [1 ]
机构
[1] Univ Canberra, Fac Informat Sci & Engn, Canberra, ACT 2601, Australia
关键词
Clustering; Support vector data description; Multi-sphere support vector data description; Support vector clustering; Multi-sphere support vector clustering; Proximity graph;
D O I
10.1007/s00521-012-1001-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Support vector data description constructs an optimal hypersphere in feature space as a description of a data set. This hypersphere when mapped back to input space becomes a set of contours, and support vector clustering (SVC) employs these contours as cluster boundaries to detect clusters in the data set. However real-world data sets may have some distinctive distributions and hence a single hypersphere cannot be the best description. As a result, the set of contours in input space does not always detect all clusters in the data set. Another issue in SVC is that in some cases, it cannot preserve proximity notation which is crucial for cluster analysis, that is, two data points that are close to each other can be assigned to different clusters using cluster labelling method of SVC. To overcome these drawbacks, we propose Proximity Multi-sphere Support Vector Clustering which employs a set of hyperspheres to provide a better data description for data sets having distinctive distributions and a proximity graph to favour the proximity notation. Experimental results on different data sets are presented to evaluate the proposed clustering technique and compare it with SVC and other clustering techniques.
引用
收藏
页码:1309 / 1319
页数:11
相关论文
共 22 条
  • [1] Support vector clustering
    Ben-Hur, A
    Horn, D
    Siegelmann, HT
    Vapnik, V
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) : 125 - 137
  • [2] Ben-Hur A, 2001, ADV NEUR IN, V13, P367
  • [3] Bezdek JamesC., 1993, J INTELL FUZZY SYST, V1, P1, DOI [10.3233/IFS-1993-1103, DOI 10.3233/IFS-1993-1103]
  • [4] Data clustering using a model granular magnet
    Blatt, M
    Wiseman, S
    Domany, E
    [J]. NEURAL COMPUTATION, 1997, 9 (08) : 1805 - 1842
  • [5] Estivill-Castro V., 2001, Advances in Knowledge Discovery and Data Mining. 5th Pacific-Asia Conference, PAKDD 2001. Proceedings (Lecture Notes in Artificial Intelligence Vol.2035), P348
  • [6] Fukunaga K, 1990, INTRO STAT PATTERN R, V2nd
  • [7] A clustering algorithm based on graph connectivity
    Hartuv, E
    Shamir, R
    [J]. INFORMATION PROCESSING LETTERS, 2000, 76 (4-6) : 175 - 181
  • [8] Kohonen T., 2001, Self-Organizing Maps
  • [9] Macqueen J, 1965, 5TH P BERK S MATH ST, P281
  • [10] Nguyen N, 2007, INT C DAT MIN