Farthest Centroid Divisive Clustering

被引:3
作者
Fang, Haw-ren [1 ]
Saad, Yousef [2 ]
机构
[1] Argonne Natl Lab, Math & Comp Sci Div, 9700 S Cass Ave, Argonne, IL 60439 USA
[2] Univ Minnesota, Comp Sci & Eng Dept, Minneapolis, MN 55455 USA
来源
SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS | 2008年
关键词
D O I
10.1109/ICMLA.2008.141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method is presented to partition a given set of data entries embedded in Euclidean space by recursively bisecting clusters into smaller ones. The initial set is subdivided into two subsets whose centroids are farthest from each other, and the process is repeated recursively on each subset. An approximate algorithm is proposed to solve the original integer programming problem which is NP-hard. Experimental evidence shows that the clustering method often outperforms a standard spectral clustering method, albeit at a slightly higher computational cost. The paper also discusses improvements of the standard K-means algorithm. Specifically, the clustering quality resulting from the K-means technique can be significantly enhanced by using the proposed algorithm for its initialization.
引用
收藏
页码:232 / +
页数:2
相关论文
共 16 条
[1]  
BOLEY DL, 1998, DATA MIN KNOWL DISC, V2, P324
[2]  
Bottou L., 1995, Advances in Neural Information Processing Systems 7, P585
[3]  
Cormen T. H., 2001, Introduction to Algorithms, V2nd
[4]  
FANG H, 2008, UMSI20085
[5]  
JUHASZ F, 1980, COLLOQUIA MATH SOC J, V22, P405
[6]  
Karypis G, 1995, SUPERCOMP PROC, P658
[7]  
Kernighan B. W., 1970, Bell System Technical Journal, V49, P291
[8]   COMPUTER SOLUTIONS OF TRAVELING SALESMAN PROBLEM [J].
LIN, S .
BELL SYSTEM TECHNICAL JOURNAL, 1965, 44 (10) :2245-+
[9]   Spectral clustering of protein sequences [J].
Paccanaro, A ;
Casbon, JA ;
Saqi, MAS .
NUCLEIC ACIDS RESEARCH, 2006, 34 (05) :1571-1580
[10]   PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS [J].
POTHEN, A ;
SIMON, HD ;
LIOU, KP .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1990, 11 (03) :430-452