Cluster-level statistical inference in fMRI datasets: The unexpected behavior of random fields in high dimensions

被引:26
作者
Bansal, Ravi [1 ,2 ]
Peterson, Bradley S. [1 ,3 ]
机构
[1] Childrens Hosp Los Angeles, Inst Developing Mind, Los Angeles, CA 90027 USA
[2] Univ Southern Calif, Keck Sch Med, Dept Pediat, Los Angeles, CA 90033 USA
[3] Univ Southern Calif, Keck Sch Med, Dept Psychiat, Los Angeles, CA 90033 USA
关键词
Random fields; Cluster level inference; Permutation testing; Parametric testing; Family wise error rates; Euler characteristics; DEFICIT HYPERACTIVITY DISORDER; ATTENTION-DEFICIT/HYPERACTIVITY DISORDER; VOXEL-BASED MORPHOMETRY; GENERAL LINEAR-MODEL; FALSE DISCOVERY RATE; HUMAN BRAIN; FUNCTIONAL SPECIFICITY; CORTICAL THICKNESS; PERMUTATION TESTS; TOURETTE-SYNDROME;
D O I
10.1016/j.mri.2018.01.004
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Identifying regional effects of interest in MRI datasets usually entails testing a priori hypotheses across many thousands of brain voxels, requiring control for false positive findings in these multiple hypotheses testing. Recent studies have suggested that parametric statistical methods may have incorrectly modeled functional MRI data, thereby leading to higher false positive rates than their nominal rates. Nonparametric methods for statistical inference when conducting multiple statistical tests, in contrast, are thought to produce false positives at the nominal rate, which has thus led to the suggestion that previously reported studies should reanalyze their fMRI data using nonparametric tools. To understand better why parametric methods may yield excessive false positives, we assessed their performance when applied both to simulated datasets of 1D, 2D, and 3D Gaussian Random Fields (GRFs) and to 710 real-world, resting-state fMRI datasets. We showed that both the simulated 2D and 3D GRFs and the real-world data contain a small percentage ( < 6%) of very large clusters (on average 60 times larger than the average cluster size), which were not present in 1D GRFs. These unexpectedly large clusters were deemed statistically significant using parametric methods, leading to empirical familywise error rates (FWERs) as high as 65%: the high empirical FWERs were not a consequence of parametric methods failing to model spatial smoothness accurately, but rather of these very large clusters that are inherently present in smooth, high-dimensional random fields. In fact, when discounting these very large clusters, the empirical FWER for parametric methods was 3.24%. Furthermore, even an empirical FWER of 65% would yield on average less than one of those very large clusters in each brain-wide analysis. Nonparametric methods, in contrast, estimated distributions from those large clusters, and therefore, by construct rejected the large clusters as false positives at the nominal FWERs. Those rejected clusters were outlying values in the distribution of cluster size but cannot be distinguished from true positive findings without further analyses, including assessing whether fMRI signal in those regions correlates with other clinical, behavioral, or cognitive measures. Rejecting the large clusters, however, significantly reduced the statistical power of nonparametric methods in detecting true findings compared with parametric methods, which would have detected most true findings that are essential for making valid biological inferences in MRI data. Parametric analyses, in contrast, detected most true findings while generating relatively few false positives: on average, less than one of those very large clusters would be deemed a true finding in each brainwide analysis. We therefore recommend the continued use of parametric methods that model nonstationary smoothness for cluster-level, familywise control of false positives, particularly when using a Cluster Defining Threshold of 2.5 or higher, and subsequently assessing rigorously the biological plausibility of the findings, even for large clusters. Finally, because nonparametric methods yielded a large reduction in statistical power to detect true positive findings, we conclude that the modest reduction in false positive findings that nonparametric analyses afford does not warrant a re-analysis of previously published fMRI studies using nonparametric techniques.
引用
收藏
页码:101 / 115
页数:15
相关论文
共 87 条
[11]   SPM: A history [J].
Ashburner, John .
NEUROIMAGE, 2012, 62 (02) :791-800
[12]   Evidence for neuroplastic compensation in the cerebral cortex of persons with depressive illness [J].
Bansal, R. ;
Hellerstein, D. J. ;
Peterson, B. S. .
MOLECULAR PSYCHIATRY, 2018, 23 (02) :375-383
[13]   Statistical analyses of brain surfaces using Gaussian random fields on 2-D manifolds [J].
Bansal, Ravi ;
Staib, Lawrence H. ;
Xu, Dongrong ;
Zhu, Hongtu ;
Peterson, Bradley S. .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2007, 26 (01) :46-57
[14]   Serotonin signaling modulates the effects of familial risk for depression on cortical thickness [J].
Bansal, Ravi ;
Peterson, Bradley S. ;
Gingrich, Jay ;
Hao, Xuejun ;
Odgerel, Zagaa ;
Warner, Virginia ;
Wickramaratne, Priya J. ;
Talati, Ardesheer ;
Ansorge, Mark ;
Brown, Alan S. ;
Sourander, Andre ;
Weissman, Myrna M. .
PSYCHIATRY RESEARCH-NEUROIMAGING, 2016, 248 :83-93
[15]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[16]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[17]   FUNCTIONAL CONNECTIVITY IN THE MOTOR CORTEX OF RESTING HUMAN BRAIN USING ECHO-PLANAR MRI [J].
BISWAL, B ;
YETKIN, FZ ;
HAUGHTON, VM ;
HYDE, JS .
MAGNETIC RESONANCE IN MEDICINE, 1995, 34 (04) :537-541
[18]  
Bonferroni C., 1936, PUBBLICAZIONI R I SU, V8, P3, DOI DOI 10.4135/9781412961288.N455
[19]  
Bonferroni C.E., 1935, Studi in Onore del Professore Salvatore Ortu Carboni, P13
[20]   Reorganization and plastic changes of the human brain associated with skill learning and expertise [J].
Chang, Yongmin .
FRONTIERS IN HUMAN NEUROSCIENCE, 2014, 8