Empirical Comparison Between Cross-Validation and Mutation-Validation in Model Selection

被引:0
作者
Yu, Jinyang [1 ]
Hamdan, Sami [1 ,2 ]
Sasse, Leonard [1 ,2 ,6 ]
Morrison, Abigail [3 ,4 ,5 ]
Patil, Kaustubh R. [1 ,2 ]
机构
[1] Res Ctr Julich, Inst Neurosci & Med, Brain & Behav INM 7, Julich, Germany
[2] Heinrich Heine Univ Dusseldorf, Med Fac, Inst Syst Neurosci, Dusseldorf, Germany
[3] Res Ctr Julich, Inst Neurosciennce & Med INM 6, Julich, Germany
[4] Res Ctr Julich, Inst Adv Simulat IAS 6, Julich, Germany
[5] Rhein Westfal TH Aachen, Dept Comp Sci Software Engn 3, Aachen, Germany
[6] Max Planck Sch Cognit, Stephanstr 1a, Leipzig, Germany
来源
ADVANCES IN INTELLIGENT DATA ANALYSIS XXII, PT II, IDA 2024 | 2024年 / 14642卷
关键词
model selection; mutation validation; cross-validation;
D O I
10.1007/978-3-031-58553-1_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mutation validation (MV) is a recently proposed approach for model selection, garnering significant interest due to its unique characteristics and potential benefits compared to the widely used cross-validation (CV) method. In this study, we empirically compared MV and k-fold CV using benchmark and real-world datasets. By employing Bayesian tests, we compared generalization estimates yielding three posterior probabilities: practical equivalence, CV superiority, and MV superiority. We also evaluated the differences in the capacity of the selected models and computational efficiency. We found that both MV and CV select models with practically equivalent generalization performance across various machine learning algorithms and the majority of benchmark datasets. MV exhibited advantages in terms of selecting simpler models and lower computational costs. However, in some cases MV selected overly simplistic models leading to underfitting and showed instability in hyperparameter selection. These limitations of MV became more evident in the evaluation of a real-world neuroscientific task of predicting sex at birth using brain functional connectivity.
引用
收藏
页码:56 / 67
页数:12
相关论文
共 18 条
  • [11] MITCHELL T, 1989, ANNU REV COMPUT SCI, V4, P417
  • [12] pypi, xcpengine-container 1.0.1
  • [13] Raschka S, 2020, Arxiv, DOI arXiv:1811.12808
  • [14] Local-Global Parcellation of the Human Cerebral Cortex from Intrinsic Functional Connectivity MRI
    Schaefer, Alexander
    Kong, Ru
    Gordon, Evan M.
    Laumann, Timothy O.
    Zuo, Xi-Nian
    Holmes, Avram J.
    Eickhoff, Simon B.
    Yeo, B. T. Thomas
    [J]. CEREBRAL CORTEX, 2018, 28 (09) : 3095 - 3114
  • [15] The Amsterdam Open MRI Collection, a set of multimodal MRI datasets for individual difference analyses
    Snoek, Lukas
    van der Miesen, Maite M.
    Beemsterboer, Tinka
    van der Leij, Andries
    Eigenhuis, Annemarie
    Steven Scholte, H.
    [J]. SCIENTIFIC DATA, 2021, 8 (01)
  • [16] Vanschoren J., 2014, SIGKDD EXPLORATIONS, V15, P49, DOI [DOI 10.1145/2641190.2641198, 10.1145/ 2641190.2641198, 10.1145/2641190.2641198]
  • [17] Sex Classification by Resting State Brain Connectivity
    Weis, Susanne
    Patil, Kaustubh R.
    Hoffstaedter, Felix
    Nostro, Alessandra
    Yeo, B. T. Thomas
    Eickhoff, Simon B.
    [J]. CEREBRAL CORTEX, 2020, 30 (02) : 824 - 835
  • [18] Model validation using mutated training labels: An exploratory study
    Zhang, Jie M.
    Harman, Mark
    Guedj, Benjamin
    Barr, Earl T.
    Shawe-Taylor, John
    [J]. NEUROCOMPUTING, 2023, 539