A clustering method based on boosting

被引:57
作者
Frossyniotis, D
Likas, A
Stafylopatis, A
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Athens 15773, Greece
[2] Univ Ioannina, Dept Comp Sci, Ioannina 45110, Greece
关键词
ensemble clustering; unsupervised learning; partitions schemes;
D O I
10.1016/j.patrec.2003.12.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is widely recognized that the boosting methodology provides superior results for classification problems. In this paper, we propose the boost-clustering algorithm which constitutes a novel clustering methodology that exploits the general principles of boosting in order to provide a consistent partitioning of a dataset. The boost-clustering algorithm is a multi-clustering method. At each boosting iteration, a new training set is created using weighted random sampling from the original dataset and a simple clustering algorithm (e.g. k-means) is applied to provide a new data partitioning. The final clustering solution is produced by aggregating the multiple clustering results through weighted voting. Experiments on both artificial and real-world data sets indicate that boost-clustering provides solutions of improved quality. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:641 / 654
页数:14
相关论文
共 21 条
[1]  
[Anonymous], LNCS
[2]  
[Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
[3]  
Bezdek JC., 1992, FUZZY MODELS PATTERN
[4]  
BOUNDAILLIER E, 1998, INTELL DATA ANAL, V2
[5]  
BREIMAN L, 1994, U CAL DEP STAT BERK
[6]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]  
*ELENA, 1995, ESPRIT BAS RES PROJ
[8]  
Fisher D. H., 1987, Machine Learning, V2, P139, DOI 10.1007/BF00114265
[9]  
Freund Y, 1996, ICML
[10]  
Frossyniotis D, 2002, LECT NOTES ARTIF INT, V2308, P225