Cluster-scaled principal component analysis

被引:6
作者
Sato-Ilic, Mika [1 ]
机构
[1] Univ Tsukuba, Fac Engn Informat & Syst, Tsukuba, Ibaraki, Japan
基金
日本学术振兴会;
关键词
cluster scale; dimension reduction; fuzzy clustering; high-dimension low-sample size data; principal component analysis; GEOMETRIC REPRESENTATION; HIGH-DIMENSION; FUZZY; VALIDITY;
D O I
10.1002/wics.1572
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Cluster-scaled analysis means exploiting the cluster-based scaling to conventional data analysis to obtain more accurate results or results that we cannot obtain by using ordinary analysis. Our target data is complex and large amounts of data. For this type of data, it is well known that ordinary statistical methods do not always work well, or theoretically, we know that we cannot obtain a correct result. As a tool of this implementation, we utilize fuzzy clustering, which is well known as a robust clustering to a complex and large amount of data. That is, we use the fuzzy clustering result as a scale of data and apply the rescaled data by the cluster-scale to another target analysis. Our target analysis in this article is principal component analysis, which is a well-known dimensional reduction method. A numerical example shows a better performance of the cluster-scaled principal component analysis. This article is categorized under: Statistical Learning and Exploratory Methods of the Data Sciences > Knowledge Discovery Statistical Learning and Exploratory Methods of the Data Sciences > Modeling Methods Statistical Learning and Exploratory Methods of the Data Sciences > Clustering and Classification
引用
收藏
页数:13
相关论文
共 66 条
[1]   The high-dimension, low-sample-size geometric representation holds under mild conditions [J].
Ahn, Jeongyoun ;
Marron, J. S. ;
Muller, Keith M. ;
Chi, Yueh-Yun .
BIOMETRIKA, 2007, 94 (03) :760-766
[2]  
Ai-Ping L., 2007, J CONVERG INF TECHNO, V2, P83
[3]  
[Anonymous], 2012, 2012 IEEE WORLD C CO, P981
[4]  
[Anonymous], 1995, INT J INTELLIGENT AU
[5]   MAPCLUS - A MATHEMATICAL-PROGRAMMING APPROACH TO FITTING THE ADCLUS MODEL [J].
ARABIE, P ;
CARROLL, JD .
PSYCHOMETRIKA, 1980, 45 (02) :211-235
[6]   Nonlinear subspace clustering using curvature constrained distances [J].
Babaeian, Amir ;
Babaee, Mohammadreaza ;
Bayestehtashk, Alireza ;
Bandarabadi, Mojtaba .
PATTERN RECOGNITION LETTERS, 2015, 68 :118-125
[7]   A CLUSTERING PERFORMANCE-MEASURE BASED ON FUZZY SET DECOMPOSITION [J].
BACKER, E ;
JAIN, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1981, 3 (01) :66-75
[8]   Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices [J].
Baik, J ;
Ben Arous, G ;
Péché, S .
ANNALS OF PROBABILITY, 2005, 33 (05) :1643-1697
[9]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms
[10]  
Bezdek J. C., 1978, Fuzzy Sets and Systems, V1, P111, DOI 10.1016/0165-0114(78)90012-X