NGPCA: Clustering of high-dimensional and non-stationary data streams

被引:0
|
作者
Migenda, Nico [1 ]
Moeller, Ralf [2 ]
Schenck, Wolfram [1 ]
机构
[1] Bielefeld Univ Appl Sci, Fac Engn & Math, Ctr Appl Data Sci Gutersloh, Bielefeld, Germany
[2] Univ Bielefeld, Fac Technol, Comp Engn Grp, Bielefeld, Germany
关键词
NGPCA; Clustering; Data stream clustering; Local PCA; MATLAB;
D O I
10.1016/j.simpa.2024.100635
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Neural Gas Principal Component Analysis (NGPCA) is an online clustering algorithm. An NGPCA model is a mixture of local PCA units and combines dimensionality reduction with vector quantization. Recently, NGPCA has been extended with an adaptive learning rate and an adaptive potential function for accurate and efficient clustering of high-dimensional and non-stationary data streams. The algorithm achieved highly competitive results on clustering benchmark datasets compared to the state of the art. Our implementation of the algorithm was developed in MATLAB and is available as open source. This code can be easily applied to the clustering of stationary and non-stationary data.
引用
收藏
页数:3
相关论文
共 50 条
  • [1] Online density estimation over high-dimensional stationary and non-stationary data streams
    Majdara, Aref
    Nooshabadi, Saeid
    DATA & KNOWLEDGE ENGINEERING, 2019, 123
  • [2] Graded Possibilistic Clustering of Non-stationary Data Streams
    Abdullatif, A.
    Masulli, F.
    Rovetta, S.
    Cabri, A.
    FUZZY LOGIC AND SOFT COMPUTING APPLICATIONS, WILF 2016, 2017, 10147 : 139 - 150
  • [3] A Fuzzy Clustering Approach to Non-stationary Data Streams Learning
    Abdullatif, A.
    Masulli, F.
    Rovetta, S.
    Cabri, A.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 768 - 769
  • [4] Generalized projected clustering in high-dimensional data streams
    Wang, T
    FRONTIERS OF WWW RESEARCH AND DEVELOPMENT - APWEB 2006, PROCEEDINGS, 2006, 3841 : 772 - 778
  • [5] Parallel Clustering of High-Dimensional Social Media Data Streams
    Gao, Xiaoming
    Ferrara, Emilio
    Qiu, Judy
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 323 - 332
  • [6] Rank Aggregation for Non-stationary Data Streams
    Irurozki, Ekhine
    Perez, Aritz
    Lobo, Jesus
    Del Ser, Javier
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 297 - 313
  • [7] Outlier Detection in Non-stationary Data Streams
    Tran, Luan
    Fan, Liyue
    Shahabi, Cyrus
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2019), 2019, : 25 - 36
  • [8] High-dimensional regimes of non-stationary Gaussian correlated Wishart matrices
    Bourguin, Solesne
    Thanh Dang
    RANDOM MATRICES-THEORY AND APPLICATIONS, 2022, 11 (01)
  • [9] Causal structure learning for high-dimensional non-stationary time series
    Chen, Siya
    Wu, HaoTian
    Jin, Guang
    KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [10] Concept-evolution detection in non-stationary data streams: a fuzzy clustering approach
    Poorya ZareMoodi
    Sajjad Kamali Siahroudi
    Hamid Beigy
    Knowledge and Information Systems, 2019, 60 : 1329 - 1352