Nonparametric Identifiability in Species Distribution and Abundance Models: Why it Matters and How to Diagnose a Lack of it Using Simulation

被引:3
作者
Stoudt, Sara [1 ]
de Valpine, Perry [2 ]
Fithian, William [3 ]
机构
[1] Bucknell Univ, Dept Math, 1 Dent Dr, Lewisburg, PA 17837 USA
[2] Univ Calif Berkeley, Dept Environm Sci Policy & Management, 110 Sproul Hall 5800, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Dept Stat, 110 Sproul Hall 5800, Berkeley, CA 94720 USA
关键词
Identifiability; Model mis-specification; Parametric assumptions; Species abundance models; Species distribution models; Splines; CAPTURE-RECAPTURE EXPERIMENTS; HABITAT SUITABILITY MODELS; ESTIMATING SITE OCCUPANCY; PRESENCE-ONLY DATA; N-MIXTURE MODELS; IDENTIFICATION; PROBABILITY; NONIDENTIFIABILITY; INFERENCE; DESIGN;
D O I
10.1007/s42519-023-00336-5
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Strong parametric assumptions are often made when formulating statistical models in practice. In the field of ecology, these assumptions have sparked repeated debates about identifiability of species distribution and abundance models. We leverage econometrics literature to broaden the view of the problem. Nonparametric identifiability exists when a model could, in theory, be estimated without parametric assumptions. Even if in practice an ecologist will not fit a nonparametric model, the potential to do so means the data are informative for desired goals. Our approach for determining whether nonparametric identifiability holds in targeted parts of the model is based on relaxing particular parametric assumptions. We approximate a nonparametric relationship as a flexible, unpenalized spline fit to simulated data with increasing sample sizes. We show the importance of semi-parametric identifiability, nonparametric identifiability achieved in part of a model, with presence-only models, single-visit occupancy and abundance models, and capture-recapture models with detection heterogeneity. In each case, we use our simulation approach to illustrate that when nonparametric identifiability holds in a regression relationship, even a mis-specified parametric model may provide a useful approximation of properties of interest like prevalence and average occurrence and abundance, the fit of alternative models can be compared, and parametric assumptions can be checked. When semi-parametric identifiability does not hold, parametric assumptions create artificial identifiability, and alternative models cannot be distinguished empirically. We argue that ecologists, and modelers in general, should be most confident in results when a stronger form of identifiability holds.
引用
收藏
页数:26
相关论文
共 73 条
[1]   The nonparametric identification of treatment effects in duration models [J].
Abbring, JH ;
Van den Berg, GJ .
ECONOMETRICA, 2003, 71 (05) :1491-1517
[2]  
Akaike Hirotogu, 1998, Selected Papers of Hirotugu Akaike, P199
[3]  
[Anonymous], About us
[4]   On the reliability of N-mixture models for count data [J].
Barker, Richard J. ;
Schofield, Matthew R. ;
Link, William A. ;
Sauer, John R. .
BIOMETRICS, 2018, 74 (01) :369-377
[5]  
Box G. E. P., 1979, Robustness in Statistics, P201, DOI 10.1016/B978-0-12-438150-6.50018-2
[6]   Evaluating resource selection functions [J].
Boyce, MS ;
Vernier, PR ;
Nielsen, SE ;
Schmiegelow, FKA .
ECOLOGICAL MODELLING, 2002, 157 (2-3) :281-300
[7]  
Casella G., 2021, Statistical Inference
[8]   Detecting parameter redundancy [J].
Catchpole, EA ;
Morgan, BJT .
BIOMETRIKA, 1997, 84 (01) :187-196
[9]   A hybrid symbolic-numerical method for determining model structure [J].
Choquet, R. ;
Cole, D. J. .
MATHEMATICAL BIOSCIENCES, 2012, 236 (02) :117-125
[10]  
Cole D, 2020, Parameter Redundancy and Identifiability, DOI [10.1201/9781315120003, DOI 10.1201/9781315120003]