Nonparametric Identifiability in Species Distribution and Abundance Models: Why it Matters and How to Diagnose a Lack of it Using Simulation

被引:0
作者
Sara Stoudt
Perry de Valpine
William Fithian
机构
[1] Bucknell University,Department of Mathematics
[2] University of California,Department of Environmental Science, Policy, and Management
[3] Berkeley,Department of Statistics
[4] University of California,undefined
[5] Berkeley,undefined
来源
Journal of Statistical Theory and Practice | 2023年 / 17卷
关键词
Identifiability; Model mis-specification; Parametric assumptions; Species abundance models; Species distribution models; Splines;
D O I
暂无
中图分类号
学科分类号
摘要
Strong parametric assumptions are often made when formulating statistical models in practice. In the field of ecology, these assumptions have sparked repeated debates about identifiability of species distribution and abundance models. We leverage econometrics literature to broaden the view of the problem. Nonparametric identifiability exists when a model could, in theory, be estimated without parametric assumptions. Even if in practice an ecologist will not fit a nonparametric model, the potential to do so means the data are informative for desired goals. Our approach for determining whether nonparametric identifiability holds in targeted parts of the model is based on relaxing particular parametric assumptions. We approximate a nonparametric relationship as a flexible, unpenalized spline fit to simulated data with increasing sample sizes. We show the importance of semi-parametric identifiability, nonparametric identifiability achieved in part of a model, with presence-only models, single-visit occupancy and abundance models, and capture–recapture models with detection heterogeneity. In each case, we use our simulation approach to illustrate that when nonparametric identifiability holds in a regression relationship, even a mis-specified parametric model may provide a useful approximation of properties of interest like prevalence and average occurrence and abundance, the fit of alternative models can be compared, and parametric assumptions can be checked. When semi-parametric identifiability does not hold, parametric assumptions create artificial identifiability, and alternative models cannot be distinguished empirically. We argue that ecologists, and modelers in general, should be most confident in results when a stronger form of identifiability holds.
引用
收藏
相关论文
共 201 条
[1]  
Lewbel A(2019)the identification zoo: meanings of identification in econometrics J Econ Lit 57 835-903
[2]  
Koopmans TC(1950)The identification of structural characteristics Ann Math Stat 21 165-181
[3]  
Reiersol O(1971)Identification in parametric models Econometrica 56 433-447
[4]  
Rothenberg TJ(1988)Conditions for identification in nonparametric and parametric models Econometrica 71 1491-1517
[5]  
Roehrig CS(2003)The nonparametric identification of treatment effects in duration models Econometrica 21 479-481
[6]  
Abbring JH(2010)Learning from data: semiparametric models versus faith-based inference Epidemiology 69 463-482
[7]  
Van den Berg GJ(2007)Estimation of treatment effects in randomized trials with non-compliance and a dichotomous outcome J R Stat Soc Ser B 3 143-155
[8]  
Van der Laan M(1992)Identifiability and exchangeability for direct and indirect effects Epidemiology 4 236-243
[9]  
Hubbard AE(2013)Presence-only modelling using maxent: when can we trust the inferences? Methods Ecol Evol 24 276-292
[10]  
Jewell N(2015)Is my species distribuiton model fit for purpose? matching data and models to applications Glob Ecol Biogeogr 74 369-377