Clustering and training set selection methods for improving the accuracy of quantitative laser induced breakdown spectroscopy

被引:37
作者
Anderson, Ryan B. [1 ]
Bell, James F., III [2 ]
Wiens, Roger C. [3 ]
Morris, Richard V. [4 ]
Clegg, Samuel M. [3 ]
机构
[1] Cornell Univ, Dept Astron, Ithaca, NY 14853 USA
[2] Arizona State Univ, Sch Earth & Space Explorat, Tempe, AZ 85287 USA
[3] Los Alamos Natl Lab, Los Alamos, NM 87545 USA
[4] NASA, Lyndon B Johnson Space Ctr, Houston, TX 77058 USA
关键词
Laser-induced breakdown spectroscopy; Mars; ChemCam; Multivariate analysis; SAMPLES; CALIBRATION;
D O I
10.1016/j.sab.2012.04.004
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
We investigated five clustering and training set selection methods to improve the accuracy of quantitative chemical analysis of geologic samples by laser induced breakdown spectroscopy (LIBS) using partial least squares (PLS) regression. The LIBS spectra were previously acquired for 195 rock slabs and 31 pressed powder geostandards under 7 Torr CO2 at a stand-off distance of 7 m at 17 RI per pulse to simulate the operational conditions of the ChemCam LIBS instrument on the Mars Science Laboratory Curiosity rover. The clustering and training set selection methods, which do not require prior knowledge of the chemical composition of the test-set samples, are based on grouping similar spectra and selecting appropriate training spectra for the partial least squares (PLS2) model. These methods were: (1) hierarchical clustering of the full set of training spectra and selection of a subset for use in training; (2) k-means clustering of all spectra and generation of PLS2 models based on the training samples within each cluster; (3) iterative use of PLS2 to predict sample composition and k-means clustering of the predicted compositions to subdivide the groups of spectra; (4) soft independent modeling of class analogy (SIMCA) classification of spectra, and generation of PLS2 models based on the training samples within each class; (5) use of Bayesian information criteria (BIC) to determine an optimal number of clusters and generation of PLS2 models based on the training samples within each cluster. The iterative method and the k-means method using 5 clusters showed the best performance, improving the absolute quadrature root mean squared error (RMSE) by similar to 3 wt.%. The statistical significance of these improvements was similar to 85%. Our results show that although clustering methods can modestly improve results, a large and diverse training set is the most reliable way to improve the accuracy of quantitative LIBS. In particular, additional sulfate standards and specifically fabricated analog samples with Mars-like compositions may improve the accuracy of ChemCam measurements on Mars. Refinement of the iterative method, modifications of the basic k-means clustering algorithm, and classification based on specifically selected S. C and Si emission lines may also prove beneficial and merit further study. Published by Elsevier B.V.
引用
收藏
页码:24 / 32
页数:9
相关论文
共 24 条
  • [1] The influence of multivariate analysis methods and target grain size on the accuracy of remote quantitative chemical analysis of rocks using laser induced breakdown spectroscopy
    Anderson, Ryan B.
    Morris, Richard V.
    Clegg, Samuel M.
    Bell, James F., III
    Wiens, Roger C.
    Humphries, Seth D.
    Mertzman, Stanley A.
    Graff, Trevor G.
    McInroy, Rhonda
    [J]. ICARUS, 2011, 215 (02) : 608 - 627
  • [2] [Anonymous], 2006, MCLUST VERSION 3 R N
  • [3] [Anonymous], 2003, Information Theory, Inference and Learning Algorithms, DOI 10.2277/0521642981
  • [4] New procedure for quantitative elemental analysis by laser-induced plasma spectroscopy
    Ciucci, A
    Corsi, M
    Palleschi, V
    Rastelli, S
    Salvetti, A
    Tognoni, E
    [J]. APPLIED SPECTROSCOPY, 1999, 53 (08) : 960 - 964
  • [5] Multivariate analysis of remote laser-induced breakdown spectroscopy spectra using partial least squares, principal component analysis, and related techniques
    Clegg, Samuel M.
    Sklute, Elizabeth
    Dyar, M. Darby
    Barefield, James E.
    Wiens, Roger C.
    [J]. SPECTROCHIMICA ACTA PART B-ATOMIC SPECTROSCOPY, 2009, 64 (01) : 79 - 88
  • [6] Laser induced breakdown spectroscopy library for the Martian environment
    Cousin, A.
    Forni, A.
    Maurice, S.
    Gasnault, O.
    Fabre, C.
    Sautter, V.
    Wiens, R. C.
    Mazoyer, J.
    [J]. SPECTROCHIMICA ACTA PART B-ATOMIC SPECTROSCOPY, 2011, 66 (11-12) : 805 - 814
  • [7] Screening analysis to detect adulterations in Brazilian gasoline samples using distillation curves
    de Oliveira, FS
    Teixeira, LSG
    Araujo, MCU
    Korn, M
    [J]. FUEL, 2004, 83 (7-8) : 917 - 923
  • [8] Dyar M.D., 2011, LUNAR PLANET SCI, V42, P1258
  • [9] Esbensen K.H., 2004, MULTIVARIATE DATA AN, V5th
  • [10] Govindaraju K., 1994, GEOSTANDARD NEWSLETT, V118, P1