The effect of principal component analysis on machine learning accuracy with high dimensional spectral data

被引:70
|
作者
Howley, T [1 ]
Madden, MG [1 ]
O'Connell, ML [1 ]
Ryder, AG [1 ]
机构
[1] Natl Univ Ireland Univ Coll Galway, Galway, Ireland
关键词
D O I
10.1007/1-84628-224-1_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the results of an investigation into the use of machine learning methods for the identification of narcotics from Raman spectra. The classification of spectral data and other high dimensional data, such as images, gene-expression data and spectral data, poses an interesting challenge to machine learning, as the presence of high numbers of redundant or highly correlated attributes can seriously degrade classification accuracy. This paper investigates the use of Principal Component Analysis (PCA) to reduce high dimensional spectral data and to improve the predictive performance of some well known machine learning methods. Experiments are carried out on a high dimensional spectral dataset. These experiments employ the NIPALS (Non-Linear Iterative Partial Least Squares) PCA method, a method that has been used in the field of chemometrics for spectral classification, and is a more efficient alternative than the widely used eigenvector decomposition approach. The experiments show that the use of this PCA method can improve the performance of machine learning in the classification of high dimensionsal data.
引用
收藏
页码:209 / +
页数:3
相关论文
共 50 条
  • [31] The learning-based principal component analysis technique in low resolution and high resolution spectral images
    Bochko, Vladimir
    Miyake, Yoichi
    Parkkinen, Jussi
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2008, 52 (03) : 0305041 - 0305048
  • [32] High-dimensional principal component analysis with heterogeneous missingness
    Zhu, Ziwei
    Wang, Tengyao
    Samworth, Richard J.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2022, 84 (05) : 2000 - 2031
  • [33] Tensor Principal Component Analysis in High Dimensional CP Models
    Han, Yuefeng
    Zhang, Cun-Hui
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (02) : 1147 - 1167
  • [34] High Dimensional Bayesian Optimization with Kernel Principal Component Analysis
    Antonov, Kirill
    Raponi, Elena
    Wang, Hao
    Doerr, Carola
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XVII, PPSN 2022, PT I, 2022, 13398 : 118 - 131
  • [35] Support vector machine and principal component analysis for microarray data classification
    Astuti, Widi
    Adiwijaya
    INTERNATIONAL CONFERENCE ON DATA AND INFORMATION SCIENCE (ICODIS), 2018, 971
  • [36] Face Recognition via Multi linear Principal Component Analysis and Two-Dimensional Extreme Learning Machine
    Zhang, Fan
    Qi, Lin
    Chen, Enqing
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (07) : 1138 - 1143
  • [37] Least angle sparse principal component analysis for ultrahigh dimensional data
    Xie, Yifan
    Wang, Tianhui
    Kim, Junyoung
    Lee, Kyungsik
    Jeong, Myong K.
    ANNALS OF OPERATIONS RESEARCH, 2024,
  • [38] Dimensional Reduction and Feature Selection: Principal Component Analysis for Data Mining
    Singh, Tulika
    Ghosh, Adarsh
    Khandelwal, Niranjan
    RADIOLOGY, 2017, 285 (03) : 1055 - 1055
  • [39] Enforcement of the principal component analysis–extreme learning machine algorithm by linear discriminant analysis
    A. Castaño
    F. Fernández-Navarro
    Annalisa Riccardi
    C. Hervás-Martínez
    Neural Computing and Applications, 2016, 27 : 1749 - 1760
  • [40] A Report on Uncorrelated Multi linear Principal Component Analysis Plus Extreme Learning Machine to Deal With Tensorial Data
    Zhang, Fan
    Fan, Yao-Ling
    Xu, Li
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (07) : 1258 - 1262