Cross-validation failure: Small sample sizes lead to large error bars

被引:383
作者
Varoquaux, Gael [1 ,2 ,3 ]
机构
[1] INRIA Saclay Ile France, Parietal Project Team, Palaiseau, France
[2] CEA, Neurospin Bat 145, F-91191 Gif Sur Yvette, France
[3] Univ Paris Saclay, Saclay, France
关键词
Cross-validation; Statistics; Decoding; fMRI; Model selection; MVPA; Biomarkers; VENTRAL TEMPORAL CORTEX; VOXEL PATTERN-ANALYSIS; BRAIN ACTIVITY; FMRI; CLASSIFICATION; CONNECTIVITY; CLASSIFIERS; PREDICTION; BIOMARKERS; MODEL;
D O I
10.1016/j.neuroimage.2017.06.061
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Predictive models ground many state-of-the-art developments in statistical brain image analysis: decoding, MVPA, searchlight, or extraction of biomarkers. The principled approach to establish their validity and usefulness is cross-validation, testing prediction on unseen data. Here, I would like to raise awareness on error bars of cross-validation, which are often underestimated. Simple experiments show that sample sizes of many neuroimaging studies inherently lead to large error bars, eg +10% for 100 samples. The standard error across folds strongly underestimates them. These large error bars compromise the reliability of conclusions drawn with predictive models, such as biomarkers or methods developments where, unlike with cognitive neuroimaging MVPA approaches, more samples cannot be acquired by repeating the experiment across many subjects. Solutions to increase sample size must be investigated, tackling possible increases in heterogeneity of the data.
引用
收藏
页码:68 / 77
页数:10
相关论文
共 59 条
[1]   Deriving reproducible biomarkers from multi-site resting-state data: An Autism-based example [J].
Abraham, Alexandre ;
Milham, Michael P. ;
Di Martino, Adriana ;
Craddock, R. Cameron ;
Samaras, Dimitris ;
Thirion, Bertrand ;
Varoquaux, Gael .
NEUROIMAGE, 2017, 147 :736-745
[2]   Single subject prediction of brain disorders in neuroimaging: Promises and pitfalls [J].
Arbabshirani, Mohammad R. ;
Plis, Sergey ;
Sui, Jing ;
Calhoun, Vince D. .
NEUROIMAGE, 2017, 145 :137-165
[3]   A survey of cross-validation procedures for model selection [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS SURVEYS, 2010, 4 :40-79
[4]  
Bengio Y, 2004, J MACH LEARN RES, V5, P1089
[5]   Toward discovery science of human brain function [J].
Biswal, Bharat B. ;
Mennes, Maarten ;
Zuo, Xi-Nian ;
Gohel, Suril ;
Kelly, Clare ;
Smith, Steve M. ;
Beckmann, Christian F. ;
Adelstein, Jonathan S. ;
Buckner, Randy L. ;
Colcombe, Stan ;
Dogonowski, Anne-Marie ;
Ernst, Monique ;
Fair, Damien ;
Hampson, Michelle ;
Hoptman, Matthew J. ;
Hyde, James S. ;
Kiviniemi, Vesa J. ;
Kotter, Rolf ;
Li, Shi-Jiang ;
Lin, Ching-Po ;
Lowe, Mark J. ;
Mackay, Clare ;
Madden, David J. ;
Madsen, Kristoffer H. ;
Margulies, Daniel S. ;
Mayberg, Helen S. ;
McMahon, Katie ;
Monk, Christopher S. ;
Mostofsky, Stewart H. ;
Nagel, Bonnie J. ;
Pekar, James J. ;
Peltier, Scott J. ;
Petersen, Steven E. ;
Riedl, Valentin ;
Rombouts, Serge A. R. B. ;
Rypma, Bart ;
Schlaggar, Bradley L. ;
Schmidt, Sein ;
Seidler, Rachael D. ;
Siegle, Greg J. ;
Sorg, Christian ;
Teng, Gao-Jun ;
Veijola, Juha ;
Villringer, Arno ;
Walter, Martin ;
Wang, Lihong ;
Weng, Xu-Chu ;
Whitfield-Gabrieli, Susan ;
Williamson, Peter ;
Windischberger, Christian .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (10) :4734-4739
[6]   Is cross-validation valid for small-sample microarray classification? [J].
Braga-Neto, UM ;
Dougherty, ER .
BIOINFORMATICS, 2004, 20 (03) :374-380
[7]  
Brown C. J., 2016, ARXIV161108699
[8]   Power failure: why small sample size undermines the reliability of neuroscience [J].
Button, Katherine S. ;
Ioannidis, John P. A. ;
Mokrysz, Claire ;
Nosek, Brian A. ;
Flint, Jonathan ;
Robinson, Emma S. J. ;
Munafo, Marcus R. .
NATURE REVIEWS NEUROSCIENCE, 2013, 14 (05) :365-376
[9]   The secret lives of experiments: Methods reporting in the fMRI literature [J].
Carp, Joshua .
NEUROIMAGE, 2012, 63 (01) :289-300
[10]  
Costafreda Sergi G, 2009, Front Neuroinform, V3, P33, DOI 10.3389/neuro.11.033.2009