Clustering High-Dimensional Data via Random Sampling and Consensus

被引：0

作者：

Traganitis, Panagiotis A. ^{[1
]}

Slavakis, Konstantinos

Giannakis, Georgios B.

机构：

[1] Univ Minnesota, Dept ECE, Minneapolis, MN 55455 USA

来源：

2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP) | 2014年

关键词：

Clustering; high-dimensional data; feature selection; random sampling and consensus; K-means; FEATURE-SELECTION; ALGORITHMS;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In response to the urgent need for learning tools tuned to big data analytics, the present paper introduces a feature selection approach to efficient clustering of high-dimensional vectors. The resultant method leverages random sampling and consensus (RANSAC) arguments, originally developed for robust regression tasks in computer vision, to yield novel dimensionality reduction schemes. The advocated random sampling and consensus K-means (RSC-Kmeans) algorithm can operate in either batch or sequential modes, with the latter being able to afford lower computational footprint than the former. Extensive numerical tests on synthetic and real datasets highlight the potential of the proposed algorithms, and demonstrate their competitive performance relative to state-of-the-art random projection alternatives.

引用

页码：307 / 311

页数：5

共 50 条

[21] The Role of Hubness in Clustering High-Dimensional Data
Tomasev, Nenad
Radovanovic, Milos
Mladenic, Dunja
Ivanovic, Mirjana
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (03) : 739 - 751
[22] An Initialization Method for Clustering High-Dimensional Data
Chen, Luying
Chen, Lifei
Jiang, Qingshan
Wang, Beizhan
Shi, Liang
FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 444 - +
[23] Clustering of imbalanced high-dimensional media data
Brodinova, Sarka
Zaharieva, Maia
Filzmoser, Peter
Ortner, Thomas
Breiteneder, Christian
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (02) : 261 - 284
[24] Clustering of imbalanced high-dimensional media data
Šárka Brodinová
Maia Zaharieva
Peter Filzmoser
Thomas Ortner
Christian Breiteneder
Advances in Data Analysis and Classification, 2018, 12 : 261 - 284
[25] The Role of Hubness in Clustering High-Dimensional Data
Tomasev, Nenad
Radovanovic, Milos
Mladenic, Dunja
Ivanovic, Mirjana
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6634 : 183 - 195
[26] An effective clustering scheme for high-dimensional data
Xuansen He
Fan He
Yueping Fan
Lingmin Jiang
Runzong Liu
Allam Maalla
Multimedia Tools and Applications, 2024, 83 : 45001 - 45045
[27] An algorithm for high-dimensional traffic data clustering
Zheng, Pengjun
McDonald, Mike
FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 59 - 68
[28] Growing neural gas with random projection method for high-dimensional data stream clustering
Zhu, Yingwen
Chen, Songcan
SOFT COMPUTING, 2020, 24 (13) : 9789 - 9807
[29] Model-based clustering of high-dimensional longitudinal data via regularization
Yang, Luoying
Wu, Tong Tong
BIOMETRICS, 2023, 79 (02) : 761 - 774
[30] Ascending and Descending Order of Random Projections: Comparative Analysis of High-Dimensional Data Clustering
Pasunuri, Raghunadh
Venkaiah, Vadlamudi China
Dhariyal, Bhaskar
HARMONY SEARCH AND NATURE INSPIRED OPTIMIZATION ALGORITHMS, 2019, 741 : 133 - 142

← 1 2 3 4 5 →