Dimensionality reduction of near infrared spectral data using global and local implementations of principal component analysis for neural network calibrations

被引:9
作者
Kovalenko, Igor V. [1 ]
Rippke, Glen R. [1 ]
Hurburgh, Charles R. [1 ]
机构
[1] Iowa State Univ, Dept Agr & Biosyst Engn, Ames, IA 50010 USA
关键词
near infrared; artificial neural networks; principal component analysis; cluster analysis; data compression; dimensionality reduction;
D O I
10.1255/jnirs.711
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
The artificial neural network (ANN) learning algorithm is a strong alternative to traditional linear calibration methods used in near infrared (NIR) spectroscopy. The generalisation capacity of ANN can be effectively employed only when the ratio of available training samples to a number of neuron interconnection weights and biases (unknown regression parameters) is large. Traditionally, this ratio is increased by compressing X data to fewer variables using principal component analysis (PCA). An alternative method, local PCA compression, has been described in the literature. This approach overcomes the global linearity of PCA by performing dimensionality reduction in two steps: division of the data space into clusters and local compression of each cluster using PCA. This study compared global and local implementations of PCA compression in NIR calibration problems solved with ANN regression. Three data sets were used for development of control,(based on PCA) and experimental (based on local PCA) ANN calibrations. The results demonstrated that local PCA could outperform traditional global PCA compression. However, the best dimensionality reduction method was case-dependent. Performance of local PCA-based calibrations degraded rapidly as compression rate increased, while global PCA allowed higher compression at minimal cost of prediction accuracy.
引用
收藏
页码:21 / 28
页数:8
相关论文
共 23 条
[1]  
*AACC, 2000, APPR MATH AM ASS CER
[2]  
[Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
[3]  
AOCS, 1997, OFF METH REC PRACT A
[4]  
Archer C, 2000, NEURAL NETWORKS FOR SIGNAL PROCESSING X, VOLS 1 AND 2, PROCEEDINGS, P308, DOI 10.1109/NNSP.2000.889422
[5]   OPTIMAL MINIMAL NEURAL INTERPRETATION OF SPECTRA [J].
BORGGAARD, C ;
THODBERG, HH .
ANALYTICAL CHEMISTRY, 1992, 64 (05) :545-551
[6]  
BORGGAARD C, 2001, NEAR INFRARED TECHNO, P101
[7]   Performance of European artificial neural network (ANN) calibrations for moisture and protein in cereals using the Danish near-infrared transmission (NIT) network [J].
Büchmann, NB ;
Josefsson, H ;
Cowe, IA .
CEREAL CHEMISTRY, 2001, 78 (05) :572-577
[8]   Prediction of drug content and hardness of intact tablets using artificial neural network and near-infrared spectroscopy [J].
Chen, YX ;
Thosar, SS ;
Forbess, RA ;
Kemper, MS ;
Rubinovitz, RL ;
Shukla, AJ .
DRUG DEVELOPMENT AND INDUSTRIAL PHARMACY, 2001, 27 (07) :623-631
[9]   Improved class prediction in DNA microarray gene expression data by unsupervised reduction of the dimensionality followed by supervised learning with a perceptron [J].
Conde, L ;
Mateos, A ;
Herrero, J ;
Dopazo, J .
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2003, 35 (03) :245-253
[10]  
Dunn J. C., 1973, Journal of Cybernetics, V3, P32, DOI 10.1080/01969727308546046