Empirical Comparison Between Cross-Validation and Mutation-Validation in Model Selection

被引：0

作者：

Yu, Jinyang ^{[1
]}

Hamdan, Sami ^{[1
,2
]}

Sasse, Leonard ^{[1
,2
,6
]}

Morrison, Abigail ^{[3
,4
,5
]}

Patil, Kaustubh R. ^{[1
,2
]}

机构：

[1] Res Ctr Julich, Inst Neurosci & Med, Brain & Behav INM 7, Julich, Germany

[2] Heinrich Heine Univ Dusseldorf, Med Fac, Inst Syst Neurosci, Dusseldorf, Germany

[3] Res Ctr Julich, Inst Neurosciennce & Med INM 6, Julich, Germany

[4] Res Ctr Julich, Inst Adv Simulat IAS 6, Julich, Germany

[5] Rhein Westfal TH Aachen, Dept Comp Sci Software Engn 3, Aachen, Germany

[6] Max Planck Sch Cognit, Stephanstr 1a, Leipzig, Germany

来源：

ADVANCES IN INTELLIGENT DATA ANALYSIS XXII, PT II, IDA 2024 | 2024年 / 14642卷

关键词：

model selection; mutation validation; cross-validation;

D O I：

10.1007/978-3-031-58553-1_5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Mutation validation (MV) is a recently proposed approach for model selection, garnering significant interest due to its unique characteristics and potential benefits compared to the widely used cross-validation (CV) method. In this study, we empirically compared MV and k-fold CV using benchmark and real-world datasets. By employing Bayesian tests, we compared generalization estimates yielding three posterior probabilities: practical equivalence, CV superiority, and MV superiority. We also evaluated the differences in the capacity of the selected models and computational efficiency. We found that both MV and CV select models with practically equivalent generalization performance across various machine learning algorithms and the majority of benchmark datasets. MV exhibited advantages in terms of selecting simpler models and lower computational costs. However, in some cases MV selected overly simplistic models leading to underfitting and showed instability in hyperparameter selection. These limitations of MV became more evident in the evaluation of a real-world neuroscientific task of predicting sex at birth using brain functional connectivity.

引用

页码：56 / 67

页数：12

共 18 条

[11] MITCHELL T, 1989, ANNU REV COMPUT SCI, V4, P417
[12] pypi, xcpengine-container 1.0.1
[13] Raschka S, 2020, Arxiv, DOI arXiv:1811.12808
[14] Local-Global Parcellation of the Human Cerebral Cortex from Intrinsic Functional Connectivity MRI
Schaefer, Alexander
Kong, Ru
Gordon, Evan M.
Laumann, Timothy O.
Zuo, Xi-Nian
Holmes, Avram J.
Eickhoff, Simon B.
Yeo, B. T. Thomas
[J]. CEREBRAL CORTEX, 2018, 28 (09) : 3095 - 3114
[15] The Amsterdam Open MRI Collection, a set of multimodal MRI datasets for individual difference analyses
Snoek, Lukas
van der Miesen, Maite M.
Beemsterboer, Tinka
van der Leij, Andries
Eigenhuis, Annemarie
Steven Scholte, H.
[J]. SCIENTIFIC DATA, 2021, 8 (01)
[16] Vanschoren J., 2014, SIGKDD EXPLORATIONS, V15, P49, DOI [DOI 10.1145/2641190.2641198, 10.1145/ 2641190.2641198, 10.1145/2641190.2641198]
[17] Sex Classification by Resting State Brain Connectivity
Weis, Susanne
Patil, Kaustubh R.
Hoffstaedter, Felix
Nostro, Alessandra
Yeo, B. T. Thomas
Eickhoff, Simon B.
[J]. CEREBRAL CORTEX, 2020, 30 (02) : 824 - 835
[18] Model validation using mutated training labels: An exploratory study
Zhang, Jie M.
Harman, Mark
Guedj, Benjamin
Barr, Earl T.
Shawe-Taylor, John
[J]. NEUROCOMPUTING, 2023, 539

← 1 2 →