A practical approach for near infrared spectral quantitative analysis of complex samples using partial least squares modeling

被引:7
作者
Liu ZhiChao [1 ]
Ma Xiang [2 ]
Wen YaDong [2 ]
Wang Yi [2 ]
Cai WenSheng [1 ]
Shao XueGuang [1 ]
机构
[1] Nankai Univ, Coll Chem, Res Ctr Analyt Sci, Tianjin 300071, Peoples R China
[2] Hongta Grp, R&D Ctr, Yuxi 653100, Peoples R China
来源
SCIENCE IN CHINA SERIES B-CHEMISTRY | 2009年 / 52卷 / 07期
基金
中国国家自然科学基金;
关键词
number of latent variables; partial least squares (PLS) regression; near-infrared (NIR) spectroscopy; tobacco lamina; cross validation (CV); VARIABLE-SELECTION; MULTIVARIATE CALIBRATION; CROSS-VALIDATION; ERROR RATE; REGRESSION; COMPONENTS; NUMBER;
D O I
10.1007/s11426-009-0110-3
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The number of latent variables (LVs) or the factor number is a key parameter in PLS modeling to obtain a correct prediction. Although lots of work have been done on this issue, it is still a difficult task to determine a suitable LV number in practical uses. A method named independent factor diagnostics (IFD) is proposed for investigation of the contribution of each LV to the predicted results on the basis of discussion about the determination of LV number in PLS modeling for near infrared (NIR) spectra of complex samples. The NIR spectra of three data sets of complex samples, including a public data set and two tobacco lamina ones, are investigated. It is shown that several high order LVs constitute main contributions to the predicted results, albeit the contribution of the low order LVs should not be neglected in the PLS models. Therefore, in practical uses of PLS for analysis of complex samples, it may be better to use a slightly large LV number for NIR spectral analysis of complex samples.
引用
收藏
页码:1021 / 1027
页数:7
相关论文
共 35 条
  • [1] RELATIONSHIP BETWEEN VARIABLE SELECTION AND DATA AUGMENTATION AND A METHOD FOR PREDICTION
    ALLEN, DM
    [J]. TECHNOMETRICS, 1974, 16 (01) : 125 - 127
  • [2] [Anonymous], 1988, J. Chemom, DOI [10.1002/cem.1180020106, DOI 10.1002/CEM.1180020106]
  • [3] [Anonymous], 1966, Research Papers in Statistics: Festschrift for J. Neyman
  • [4] Cross-validation as the objective function for variable-selection techniques
    Baumann, K
    [J]. TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2003, 22 (06) : 395 - 406
  • [5] A systematic evaluation of the benefits and hazards of variable selection in latent variable regression. Part I. Search algorithm, theory and simulations
    Baumann, K
    Albert, H
    von Korff, M
    [J]. JOURNAL OF CHEMOMETRICS, 2002, 16 (07) : 339 - 350
  • [6] Breiman L., 1984, BIOMETRICS, V40, P874, DOI 10.1201/9781315139470
  • [7] BRERETON RG, 2003, CHEMOMETRICS DATA AN, P421
  • [8] *CAMO AS, 1996, UNSCR 6 0 US GUID, P75
  • [9] Multivariate calibration of analytical signals by WILMA (wavelet interface to linear modelling analysis)
    Cocchi, M
    Seeber, R
    Ulrici, A
    [J]. JOURNAL OF CHEMOMETRICS, 2003, 17 (8-9) : 512 - 527
  • [10] A strategy for selecting calibration samples for multivariate modelling
    Dantas, HA
    Galvao, RKH
    Araújo, MCU
    da Silva, EC
    Saldanha, TCB
    José, GE
    Pasquini, C
    Raimundo, IM
    Rohwedder, JJR
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2004, 72 (01) : 83 - 91