Aggregated functional data model for near-infrared spectroscopy calibration and prediction

被引:9
作者
Dias, Ronaldo [1 ]
Garcia, Nancy L. [1 ]
Ludwig, Guilherme [1 ]
Saraiva, Marley A. [2 ]
机构
[1] Univ Estadual Campinas, Campinas, SP, Brazil
[2] Univ Fed Goias, Inst Matemat & Estat, BR-74000197 Goiania, Go, Brazil
关键词
B-splines; square error of prediction; leave-one-out jackknife; REGRESSION;
D O I
10.1080/02664763.2014.938224
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Calibration and prediction for NIR spectroscopy data are performed based on a functional interpretation of the Beer-Lambert formula. Considering that, for each chemical sample, the resulting spectrum is a continuous curve obtained as the summation of overlapped absorption spectra from each analyte plus a Gaussian error, we assume that each individual spectrum can be expanded as a linear combination of B-splines basis. Calibration is then performed using two procedures for estimating the individual analytes' curves: basis smoothing and smoothing splines. Prediction is done by minimizing the square error of prediction. To assess the variance of the predicted values, we use a leave-one-out jackknife technique. Departures from the standard error models are discussed through a simulation study, in particular, how correlated errors impact on the calibration step and consequently on the analytes' concentration prediction. Finally, the performance of our methodology is demonstrated through the analysis of two publicly available datasets.
引用
收藏
页码:127 / 143
页数:17
相关论文
共 27 条
[1]   REPRESENTATION OF SPECTRA BY CONTINUOUS-FUNCTIONS [J].
ALSBERG, BK .
JOURNAL OF CHEMOMETRICS, 1993, 7 (03) :177-193
[2]   Semi-functional partial linear regression [J].
Aneiros-Perez, German ;
Vieu, Philippe .
STATISTICS & PROBABILITY LETTERS, 2006, 76 (11) :1102-1110
[3]  
[Anonymous], 1999, WILEY SER PROB STAT
[4]   OPTIMAL MINIMAL NEURAL INTERPRETATION OF SPECTRA [J].
BORGGAARD, C ;
THODBERG, HH .
ANALYTICAL CHEMISTRY, 1992, 64 (05) :545-551
[5]   Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215
[6]  
Brereton R.G., 2003, Chemometrics: Data Analysis for the Laboratory and Chemical Plant
[7]  
Burns D.A., 2007, Handbook of near-infrared analysis
[8]  
Chen T, 2007, CHEMOMETR INTELL LAB, V87, P59, DOI 10.1016/j.chemolab.2006.09.004
[9]  
De Boor C., 2001, Applied Mathematical Sciences, DOI DOI 10.1007/978-1-4612-6333-3
[10]   Sequential adaptive nonparametric regression via H-splines [J].
Dias, R .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1999, 28 (02) :501-515