Clustering High-Dimensional Data via Random Sampling and Consensus

被引:0
作者
Traganitis, Panagiotis A. [1 ]
Slavakis, Konstantinos
Giannakis, Georgios B.
机构
[1] Univ Minnesota, Dept ECE, Minneapolis, MN 55455 USA
来源
2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP) | 2014年
关键词
Clustering; high-dimensional data; feature selection; random sampling and consensus; K-means; FEATURE-SELECTION; ALGORITHMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In response to the urgent need for learning tools tuned to big data analytics, the present paper introduces a feature selection approach to efficient clustering of high-dimensional vectors. The resultant method leverages random sampling and consensus (RANSAC) arguments, originally developed for robust regression tasks in computer vision, to yield novel dimensionality reduction schemes. The advocated random sampling and consensus K-means (RSC-Kmeans) algorithm can operate in either batch or sequential modes, with the latter being able to afford lower computational footprint than the former. Extensive numerical tests on synthetic and real datasets highlight the potential of the proposed algorithms, and demonstrate their competitive performance relative to state-of-the-art random projection alternatives.
引用
收藏
页码:307 / 311
页数:5
相关论文
共 28 条
[1]  
Achlioptas D, 2001, P 20 ACM SIGMOD SIGA, DOI [DOI 10.1145/375551.375608, 10.1145/375551.375608]
[2]  
[Anonymous], 2001, Pattern Classification
[3]  
[Anonymous], 2013, MATLAB VERS 8 2 0 70
[4]  
Bengtsson Thomas, 2008, PROBABILITY STAT ESS, P316, DOI DOI 10.1214/193940307000000518
[5]  
Boutsidis Christos., 2011, CoRR
[6]   On self-organizing algorithms and networks for class-separability features [J].
Chatterjee, C ;
Roychowdhury, VP .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (03) :663-678
[7]   Optimal Randomized RANSAC [J].
Chum, Ondrej ;
Matas, Jiri .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (08) :1472-1482
[8]  
Clarkson KL, 2013, STOC'13: PROCEEDINGS OF THE 2013 ACM SYMPOSIUM ON THEORY OF COMPUTING, P81
[9]  
Cukier K., 2010, Economist Newspaper
[10]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137