Clustering High-Dimensional Data via Random Sampling and Consensus

被引:0
|
作者
Traganitis, Panagiotis A. [1 ]
Slavakis, Konstantinos
Giannakis, Georgios B.
机构
[1] Univ Minnesota, Dept ECE, Minneapolis, MN 55455 USA
关键词
Clustering; high-dimensional data; feature selection; random sampling and consensus; K-means; FEATURE-SELECTION; ALGORITHMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In response to the urgent need for learning tools tuned to big data analytics, the present paper introduces a feature selection approach to efficient clustering of high-dimensional vectors. The resultant method leverages random sampling and consensus (RANSAC) arguments, originally developed for robust regression tasks in computer vision, to yield novel dimensionality reduction schemes. The advocated random sampling and consensus K-means (RSC-Kmeans) algorithm can operate in either batch or sequential modes, with the latter being able to afford lower computational footprint than the former. Extensive numerical tests on synthetic and real datasets highlight the potential of the proposed algorithms, and demonstrate their competitive performance relative to state-of-the-art random projection alternatives.
引用
收藏
页码:307 / 311
页数:5
相关论文
共 50 条
  • [1] High-Dimensional Clustering via Random Projections
    Laura Anderlucci
    Francesca Fortunato
    Angela Montanari
    Journal of Classification, 2022, 39 : 191 - 216
  • [2] High-Dimensional Clustering via Random Projections
    Anderlucci, Laura
    Fortunato, Francesca
    Montanari, Angela
    JOURNAL OF CLASSIFICATION, 2022, 39 (01) : 191 - 216
  • [3] Iterative random projections for high-dimensional data clustering
    Cardoso, Angelo
    Wichert, Andreas
    PATTERN RECOGNITION LETTERS, 2012, 33 (13) : 1749 - 1755
  • [4] Subspace-Weighted Consensus Clustering for High-Dimensional Data
    Cai, Xiaosha
    Huang, Dong
    ADVANCED DATA MINING AND APPLICATIONS, 2020, 12447 : 3 - 16
  • [5] Random Projections and Sampling Algorithms for Clustering of High-Dimensional Polygonal Curves
    Meintrup, Stefan
    Munteanu, Alexander
    Rohde, Dennis
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] Clustering high-dimensional data via feature selection
    Liu, Tianqi
    Lu, Yu
    Zhu, Biqing
    Zhao, Hongyu
    BIOMETRICS, 2023, 79 (02) : 940 - 950
  • [7] High-dimensional data clustering
    Bouveyron, C.
    Girard, S.
    Schmid, C.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 502 - 519
  • [8] Clustering High-Dimensional Data
    Masulli, Francesco
    Rovetta, Stefano
    CLUSTERING HIGH-DIMENSIONAL DATA, CHDD 2012, 2015, 7627 : 1 - 13
  • [9] RETRACTED: An Ensemble Clustering Approach (Consensus Clustering) for High-Dimensional Data (Retracted Article)
    Yan, Jingdong
    Liu, Wuwei
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [10] Clustering of High-Dimensional Data via Finite Mixture Models
    McLachlan, Geoff J.
    Baek, Jangsun
    ADVANCES IN DATA ANALYSIS, DATA HANDLING AND BUSINESS INTELLIGENCE, 2010, : 33 - +