Adaptive local Principal Component Analysis improves the clustering of high-dimensional data

被引:5
作者
Migenda, Nico [1 ,3 ]
Moeller, Ralf [2 ]
Schenck, Wolfram [1 ]
机构
[1] Bielefeld Univ Appl Sci & Arts, Ctr Appl Data Sci CfADS, Bielefeld, Germany
[2] Bielefeld Univ, Fac Technol, Comp Engn Grp, Bielefeld, Germany
[3] Schulstr 10, D-33330 Gutersloh, Germany
关键词
High-dimensional clustering; Potential function; Adaptive learning rate; Ranking criteria; Neural network-based PCA; Mixture PCA; Local PCA; LEARNING ALGORITHM; DECOMPOSITION; CONVERGENCE;
D O I
10.1016/j.patcog.2023.110030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In local Principal Component Analysis (PCA), a distribution is approximated by multiple units, each repre-senting a local region by a hyper-ellipsoid obtained through PCA. We present an extension for local PCA which adaptively adjusts both the learning rate of each unit and the potential function which guides the competition between the local units. Our local PCA method is an online neural network method where unit centers and shapes are modified after the presentation of each data point. For several benchmark distributions, we demonstrate that our method improves the overall quality of clustering, especially for high-dimensional distributions where many conventional methods do not perform satisfactorily. Our online method is also well suited for the processing of streaming data: The two adaptive mechanisms lead to a quick reorganization of the clustering when the underlying distribution changes.
引用
收藏
页数:16
相关论文
共 38 条
  • [1] Bortz, 2010, STAT HUMAN SOZIALWIS
  • [2] Brand M, 2002, LECT NOTES COMPUT SC, V2350, P707
  • [3] Online Principal Component Analysis in High Dimension: Which Algorithm to Choose?
    Cardot, Herve
    Degras, David
    [J]. INTERNATIONAL STATISTICAL REVIEW, 2018, 86 (01) : 29 - 50
  • [4] AN ADAPTIVE LEARNING ALGORITHM FOR PRINCIPAL COMPONENT ANALYSIS
    CHEN, LH
    CHANG, SY
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (05): : 1255 - 1263
  • [5] Nonparametric pairwise multiple comparisons in independent groups using Dunn's test
    Dinno, Alexis
    [J]. STATA JOURNAL, 2015, 15 (01) : 292 - 300
  • [6] Du Ke-Lin, 2013, Neural Networks and Statistical Learning
  • [7] A comprehensive survey of clustering algorithms: State-of-the-art machine learning applications, taxonomy, challenges, and future research prospects
    Ezugwu, Absalom E.
    Ikotun, Abiodun M.
    Oyelade, Olaide O.
    Abualigah, Laith
    Agushaka, Jeffery O.
    Eke, Christopher I.
    Akinyelu, Andronicus A.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110
  • [8] K-means properties on six clustering benchmark datasets
    Franti, Pasi
    Sieranoja, Sami
    [J]. APPLIED INTELLIGENCE, 2018, 48 (12) : 4743 - 4759
  • [9] Efficiency of random swap clustering
    Fränti P.
    [J]. Journal of Big Data, 5 (1)
  • [10] Centroid index: Cluster level similarity measure
    Franti, Pasi
    Rezaei, Mohammad
    Zhao, Qinpei
    [J]. PATTERN RECOGNITION, 2014, 47 (09) : 3034 - 3045