Procrustes Cross-Validation-A Bridge between Cross-Validation and Independent Validation Sets

被引:38
作者
Kucheryavskiy, Sergey [3 ]
Zhilin, Sergei [1 ]
Rodionova, Oxana [2 ]
Pomerantsev, Alexey [2 ]
机构
[1] CSort Ltd, Germana Titova St 7, Barnaul 656023, Russia
[2] RAS, Semenov Fed Res Ctr Chem Phys, Moscow 119991, Russia
[3] Aalborg Univ, Dept Chem & Biosci, DK-6700 Esbjerg, Denmark
关键词
CALIBRATION MODELS; CLASSIFICATION;
D O I
10.1021/acs.analchem.0c02175
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this paper, we propose a new approach for validation of chemometric models. It is based on k-fold cross-validation algorithm, but in contrast to conventional cross-validation, our approach makes it possible to create a new dataset, which carries sampling uncertainty estimated by the cross-validation procedure. This dataset, called a pseudo-validation set, can be used similar to an independent test set, giving a possibility to compute residual distances, explained variance, scores, and other results, which cannot be obtained in the conventional cross-validation. The paper describes theoretical details of the proposed approach and its implementation as well as presents experimental results obtained using simulated and real chemical datasets.
引用
收藏
页码:11842 / 11850
页数:9
相关论文
共 19 条
[11]   Statistical validation of classification and calibration models using bootstrapped Latin partitions [J].
Harrington, Peter de Boves .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2006, 25 (11) :1112-1124
[12]   Multiple Versus Single Set Validation of Multivariate Models to Avoid Mistakes [J].
Harrington, Peter de Boves .
CRITICAL REVIEWS IN ANALYTICAL CHEMISTRY, 2018, 48 (01) :33-46
[13]  
Kohavi R., 1995, IJCAI-95. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, P1137
[14]   Design of adaptive fuzzy model for classification problem [J].
Li, THS ;
Guo, NR ;
Kuo, CL .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2005, 18 (03) :297-306
[15]   Partial least squares density modeling (PLS-DM) - A new class-modeling strategy applied to the authentication of olives in brine by near-infrared spectroscopy [J].
Oliveri, Paolo ;
Isabel Lopez, M. ;
Casolino, M. Chiara ;
Ruisanchez, Itziar ;
Pilar Callao, M. ;
Medini, Luca ;
Lanteri, Silvia .
ANALYTICA CHIMICA ACTA, 2014, 851 :30-36
[16]   Popular decision rules in SIMCA: Critical review [J].
Pomerantsev, Alexey L. ;
Rodionova, Oxana Ye .
JOURNAL OF CHEMOMETRICS, 2020, 34 (08)
[17]   Concept and role of extreme objects in PCA/SIMCA [J].
Pomerantsev, Alexey L. ;
Rodionova, Oxana Ye .
JOURNAL OF CHEMOMETRICS, 2014, 28 (05) :429-438
[18]   PROBLEMS IN PLANE SAMPLING [J].
QUENOUILLE, MH .
ANNALS OF MATHEMATICAL STATISTICS, 1949, 20 (03) :355-375
[19]   Rigorous and compliant approaches to one-class classification [J].
Rodionova, Oxana Ye. ;
Oliveri, Paolo ;
Pomerantsev, Alexey L. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2016, 159 :89-96