Procrustes Cross-Validation-A Bridge between Cross-Validation and Independent Validation Sets

被引:32
作者
Kucheryavskiy, Sergey [3 ]
Zhilin, Sergei [1 ]
Rodionova, Oxana [2 ]
Pomerantsev, Alexey [2 ]
机构
[1] CSort Ltd, Germana Titova St 7, Barnaul 656023, Russia
[2] RAS, Semenov Fed Res Ctr Chem Phys, Moscow 119991, Russia
[3] Aalborg Univ, Dept Chem & Biosci, DK-6700 Esbjerg, Denmark
关键词
CALIBRATION MODELS; CLASSIFICATION;
D O I
10.1021/acs.analchem.0c02175
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this paper, we propose a new approach for validation of chemometric models. It is based on k-fold cross-validation algorithm, but in contrast to conventional cross-validation, our approach makes it possible to create a new dataset, which carries sampling uncertainty estimated by the cross-validation procedure. This dataset, called a pseudo-validation set, can be used similar to an independent test set, giving a possibility to compute residual distances, explained variance, scores, and other results, which cannot be obtained in the conventional cross-validation. The paper describes theoretical details of the proposed approach and its implementation as well as presents experimental results obtained using simulated and real chemical datasets.
引用
收藏
页码:11842 / 11850
页数:9
相关论文
共 19 条