Fourth-corner correlation is a score test statistic in a log-linear trait-environment model that is useful in permutation testing

被引:16
作者
ter Braak, Cajo J. F. [1 ]
机构
[1] Wageningen Univ & Res, Biometris, Wageningen, Netherlands
关键词
Community ecology; Correspondence analysis; Fourth-corner; Permutation test; Score test statistic; Trait-environment association; SPECIES TRAITS; CROSS-CLASSIFICATIONS; FUNCTIONAL-GROUPS; HABITAT; ASSOCIATION; DISTURBANCE; VARIABLES; RESPONSES; TEMPLET;
D O I
10.1007/s10651-017-0368-0
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Ecologists wish to understand the role of traits of species in determining where each species occurs in the environment. For this, they wish to detect associations between species traits and environmental variables from three data tables, species count data from sites with associated environmental data and species trait data from data bases. These three tables leave a missing part, the fourth-corner. The fourth-corner correlations between quantitative traits and environmental variables, heuristically proposed 20 years ago, fill this corner. Generalized linear (mixed) models have been proposed more recently as a model-based alternative. This paper shows that the squared fourth-corner correlation times the total count is precisely the score test statistic for testing the linear-by-linear interaction in a Poisson log-linear model that also contains species and sites as main effects. For multiple traits and environmental variables, the score test statistic is proportional to the total inertia of a doubly constrained correspondence analysis. When the count data are over-dispersed compared to the Poisson or when there are other deviations from the model such as unobserved traits or environmental variables that interact with the observed ones, the score test statistic does not have the usual chi-square distribution. For these types of deviations, row- and column-based permutation methods (and their sequential combination) are proposed to control the type I error without undue loss of power (unless no deviation is present), as illustrated in a small simulation study. The issues for valid statistical testing are illustrated using the well-known Dutch Dune Meadow data set.
引用
收藏
页码:219 / 242
页数:24
相关论文
共 54 条
[1]   Leaf size, specific leaf area and microhabitat distribution of chaparral woody plants: contrasting patterns in species level and community level analyses [J].
Ackerly, DD ;
Knight, CA ;
Weiss, SB ;
Barton, K ;
Starmer, KP .
OECOLOGIA, 2002, 130 (03) :449-457
[2]  
[Anonymous], DATA ANAL COMMUNITY
[3]  
[Anonymous], 2015, Vector Generalized Linear and Additive Models: With an Implementation in R Internet
[4]  
[Anonymous], 1984, Theory and Application of Correspondence Analysis
[5]  
[Anonymous], 2006, Randomization, bootstrap and Monte Carlo methods in biology
[6]   Rao's score, Neyman's C(α) and Silvey's LM tests:: an essay on historical developments and some new results [J].
Bera, AK ;
Bilias, Y .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2001, 97 (01) :9-44
[7]  
Brookes M., 2011, The Matrix Reference Manual
[8]   The fourth-corner solution - using predictive models to understand how species traits interact with the environment [J].
Brown, Alexandra M. ;
Warton, David I. ;
Andrew, Nigel R. ;
Binns, Matthew ;
Cassis, Gerasimos ;
Gibb, Heloise .
METHODS IN ECOLOGY AND EVOLUTION, 2014, 5 (04) :344-352
[9]  
Cailliez F, 1976, INTRO ANAL DONNEES
[10]  
Cox DR., 1974, THEORETICAL STAT