Performance Analysis of Data Mining Algorithms Based on PCA

被引:0
作者
Bai, Ruifeng [1 ]
Wang, Jie [2 ]
Yang, Lin [2 ]
Pan, Jingchang [2 ]
机构
[1] Shandong Univ, Coll Business, Weihai 264209, Peoples R China
[2] Shandong Univ, Scholl Mech Elect & Informat Engn, Weihai 264209, Peoples R China
来源
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING | 2015年 / 8卷
关键词
PCA; Classification; Clustering; Spectrum; Cataclysmic Variable Star; DIGITAL SKY SURVEY; CATACLYSMIC VARIABLES;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data mining algorithms behave differently under different application context. It is an important topic to find out the characteristics of the relevant algorithms. This paper studied PCA based dimension reduction and the functional performance of data mining algorithms (ANN, Bayes, KNN, K-means) under different dimension reduction rates in finding Cataclysmic Variable Stars(CVs) in a hybrid celestial spectra dataset. The dataset was selected from SDSS(Sloan Digital Sky Survey), 1417 spectra altogether. In the dataset, there are 15 CVs, along with other type of celestial bodies. ANN, Bayes, KNN and K-means were chosen to test their performances in finding CVs and time cost under different PCA dimensions. The classification accuracy and time cost were analyzed of the four mentioned algorithms in detail under different PCA dimensions. A series of experiments were done to carry out our research. Through this study, we can understand the inherent characteristics of the four algorithms and make better choices in future data mining applications.
引用
收藏
页码:1506 / 1509
页数:4
相关论文
共 9 条
  • [1] ARTHUR D, 2006, P 2006 S COMP GEOM S
  • [2] BIAN Zhao-qi, 2000, Pattern recognition
  • [3] Deng Shibing, 1994, PROGR ASTRONOMY, V12, P229
  • [4] Deng Shibing, 1994, PROGR ASTRONOMY, V12, P42
  • [5] Qin DM, 2003, SPECTROSC SPECT ANAL, V23, P182
  • [6] Rennie J.D., 2003, P 20 INT C MACHINE L
  • [7] Cataclysmic variables from Sloan Digital Sky Survey.: V.: The fifth year (2004)
    Szkody, P
    Henden, A
    Agüeros, M
    Anderson, SF
    Bochanski, JJ
    Knapp, GR
    Mannikko, L
    Mukadam, A
    Silvestri, NM
    Schmidt, GD
    Stephanik, B
    Watson, TK
    West, AA
    Winget, D
    Wolfe, MA
    Barentine, JC
    Brinkmann, J
    Brewington, HJ
    Downes, RA
    Harvanek, M
    Kleinman, SJ
    Krzesinski, J
    Long, D
    Neilsen, EH
    Downes, RA
    Harvanek, M
    Kleinman, SJ
    Krzesinski, J
    Long, D
    Neilsen, EH
    Nitta, A
    Schneider, DP
    Snedden, SA
    Voges, W
    [J]. ASTRONOMICAL JOURNAL, 2006, 131 (02) : 973 - 983
  • [8] Cataclysmic variables from Sloan Digital Sky Survey.: IV.: The fourth year (2003)
    Szkody, P
    Henden, A
    Fraser, OJ
    Silvestri, NM
    Schmidt, GD
    Bochanski, JJ
    Wolfe, MA
    Agüeros, M
    Anderson, SF
    Mannikko, L
    Downes, RA
    Schneider, DP
    Brinkmann, J
    [J]. ASTRONOMICAL JOURNAL, 2005, 129 (05) : 2386 - 2399
  • [9] Xue J Q, 1999, THESIS CHINESE ACAD