Accounting for multiple testing in the analysis of spatio-temporal environmental data

被引:24
作者
Cortes, Jose [1 ]
Mahecha, Miguel [2 ]
Reichstein, Markus [2 ]
Brenning, Alexander [1 ]
机构
[1] Friedrich Schiller Univ, Dept Geog, Jena, Germany
[2] Max Planck Inst Biogeochem, Jena, Germany
关键词
Gridded data; Multiple testing; Nonparametric statistics; Permutation methods; Spatial patterns; Statistical inference; FALSE DISCOVERY RATE; FIELD SIGNIFICANCE; TREND; MODIS; GIMMS;
D O I
10.1007/s10651-020-00446-4
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The statistical analysis of environmental data from remote sensing and Earth system simulations often entails the analysis of gridded spatio-temporal data, with a hypothesis test being performed for each grid cell. When the whole image or a set of grid cells are analyzed for a global effect, the problem of multiple testing arises. When no global effect is present, we expect alpha of all grid cells to be false positives, and spatially autocorrelated data can give rise to clustered spurious rejections that can be misleading in an analysis of spatial patterns. In this work, we review standard solutions for the multiple testing problem and apply them to spatio-temporal environmental data. These solutions are independent of the test statistic, and any test statistic can be used (e.g., tests for trends or change points in time series). Additionally, we introduce permutation methods and show that they have more statistical power. Real-world data are used to provide examples of the analysis, and the performance of each method is assessed in a simulation study. Unlike other simulation studies, our study compares the statistical power of the presented methods in a comprehensive simulation study. In conclusion, we present several statistically rigorous methods for analyzing spatio-temporal environmental data and controlling the false positives. These methods allow the use of any test statistic in a wide range of applications in environmental sciences and remote sensing.
引用
收藏
页码:293 / 318
页数:26
相关论文
共 40 条
[1]  
[Anonymous], 2019, GISS Surface Temperature Analysis (GISTEMP), version 4
[2]   Satellite observations of high northern latitude vegetation productivity changes between 1982 and 2008: ecological variability and regional differences (vol 6, pg 045501, 2011) [J].
Beck, Pieter S. A. ;
Goetz, Scott J. .
ENVIRONMENTAL RESEARCH LETTERS, 2012, 7 (02)
[3]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]  
Bonferroni Carlo, 1936, Istituto Superiore di Scienze Economiche e Commericiali di Firenze, V8, P3, DOI DOI 10.4135/9781412961288.N455
[6]  
Bretz F., 2010, Multiple Comparisons Using R
[7]  
Cao J, 2001, LECT NOTES STAT, V159, P169
[8]   Field Significance of Regression Patterns [J].
DelSole, Timothy ;
Yang, Xiaosong .
JOURNAL OF CLIMATE, 2011, 24 (19) :5094-5107
[9]   Multiple hypothesis testing in microarray experiments [J].
Dudoit, S ;
Shaffer, JP ;
Boldrick, JC .
STATISTICAL SCIENCE, 2003, 18 (01) :71-103
[10]   MODIFIED RANDOMIZATION TESTS FOR NONPARAMETRIC HYPOTHESES [J].
DWASS, M .
ANNALS OF MATHEMATICAL STATISTICS, 1957, 28 (01) :181-187