Adaptive Data Clustering Ensemble Algorithm Based on Stability Feature Selection and Spectral Clustering

被引:0
作者
Li, Zuhong [1 ]
Ma, Zhixin [1 ]
Ma, Zhicheng [2 ]
Yang, Shibo [2 ]
机构
[1] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou, Peoples R China
[2] Gansu Shining Sci & Technol Co Ltd, Lanzhou, Peoples R China
来源
2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019) | 2019年
关键词
clustering ensemble; feature selection; matrix eigengap; spectral clustering; similarity matrix;
D O I
10.1109/icaibd.2019.8836974
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data Clustering algorithms are essential techniques to pattern recognition. While many clustering algorithms have been proposed, none of them could gain a desirable result on various pattern recognition problems. In this paper, an adaptive clustering ensemble algorithm based on stability feature selection and spectral clustering for data clustering is proposed. The paper chooses strong features by stability feature selection, aiming to improve the quality of clustering. LIMBO is then applied on dataset to engender ensemble instances via calculating the similarity of ensemble members, a proximity matrix could be formed. The cluster ensemble problem is then formalized as a combinatorial spectral clustering problem. Empirical studies on several benchmarks from UC Irvine Machine Learning Repository shows the favorites of the proposed algorithm compares to its competitors, and the promising results yields new inspiration of cluster ensemble and spectral clustering.
引用
收藏
页码:277 / 281
页数:5
相关论文
共 21 条
[1]  
Al-Razgan M S, 2006, SIAM INT C DATA MINI
[2]  
Andritsos P, 2004, LECT NOTES COMPUT SC, V2992, P123
[3]   Speaker Diarization Exploiting the Eigengap Criterion and Cluster Ensembles [J].
Bassiou, Nikoletta ;
Moschou, Vassiliki ;
Kotropoulos, Constantine .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08) :2134-2144
[4]  
Ben Ayed A, 2014, 2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), P331, DOI 10.1109/SOCPAR.2014.7008028
[5]  
Fern X Z, 2003, 20 INT C INT C MACH
[6]  
Fred A., 2001, MULTIPLE CLASSIFIER
[7]   Combining multiple clusterings using evidence accumulation [J].
Fred, ALN ;
Jain, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (06) :835-850
[8]  
Fred ALN, 2002, INT C PATT RECOG, P276, DOI 10.1109/ICPR.2002.1047450
[9]   Data clustering: A review [J].
Jain, AK ;
Murty, MN ;
Flynn, PJ .
ACM COMPUTING SURVEYS, 1999, 31 (03) :264-323
[10]   Stability of feature selection algorithms: a study on high-dimensional spaces [J].
Kalousis, Alexandros ;
Prados, Julien ;
Hilario, Melanie .
KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 12 (01) :95-116