Errors in Statistical Inference Under Model Misspecification: Evidence, Hypothesis Testing, and AIC

被引:43
|
作者
Dennis, Brian [1 ,2 ]
Ponciano, Jose Miguel [3 ]
Taper, Mark L. [3 ,4 ]
Lele, Subhash R. [5 ]
机构
[1] Univ Idaho, Dept Fish & Wildlife Sci, Moscow, ID 83843 USA
[2] Univ Idaho, Dept Stat Sci, Moscow, ID 83843 USA
[3] Univ Florida, Biol Dept, Gainesville, FL USA
[4] Montana State Univ, Dept Ecol, Bozeman, MT 59717 USA
[5] Univ Alberta, Dept Math & Stat Sci, Edmonton, AB, Canada
来源
FRONTIERS IN ECOLOGY AND EVOLUTION | 2019年 / 7卷
关键词
model misspecification; evidential statistics; evidence; error rates in model selection; Kullback-Leibler divergence; hypothesis testing; Akaike's information criterion; model selection; LIKELIHOOD RATIO; INFORMATION CRITERION; MULTIMODEL INFERENCE; AKAIKE INFORMATION; P VALUES; SELECTION; ECOLOGY; EQUIVALENCE; PARAMETERS; REGRESSION;
D O I
10.3389/fevo.2019.00372
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The methods for making statistical inferences in scientific analysis have diversified even within the frequentist branch of statistics, but comparison has been elusive. We approximate analytically and numerically the performance of Neyman-Pearson hypothesis testing, Fisher significance testing, information criteria, and evidential statistics (Royall, 1997). This last approach is implemented in the form of evidence functions: statistics for comparing two models by estimating, based on data, their relative distance to the generating process (i.e., truth) (Lele, 2004). A consequence of this definition is the salient property that the probabilities of misleading or weak evidence, error probabilities analogous to Type 1 and Type 2 errors in hypothesis testing, all approach 0 as sample size increases. Our comparison of these approaches focuses primarily on the frequency with which errors are made, both when models are correctly specified, and when they are misspecified, but also considers ease of interpretation. The error rates in evidential analysis all decrease to 0 as sample size increases even under model misspecification. Neyman-Pearson testing on the other hand, exhibits great difficulties under misspecification. The real Type 1 and Type 2 error rates can be less, equal to, or greater than the nominal rates depending on the nature of model misspecification. Under some reasonable circumstances, the probability of Type 1 error is an increasing function of sample size that can even approach 1! In contrast, under model misspecification an evidential analysis retains the desirable properties of always having a greater probability of selecting the best model over an inferior one and of having the probability of selecting the best model increase monotonically with sample size. We show that the evidence function concept fulfills the seeming objectives of model selection in ecology, both in a statistical as well as scientific sense, and that evidence functions are intuitive and easily grasped. We find that consistent information criteria are evidence functions but the MSE minimizing (or efficient) information criteria (e.g., AIC, AICc, TIC) are not. The error properties of the MSE minimizing criteria switch between those of evidence functions and those of Neyman-Pearson tests depending on models being compared.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Statistical planning and inference in accelerated life testing using the CHSS model
    Bagdonavicius, V
    Cheminade, O
    Nikulin, M
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2004, 126 (02) : 535 - 551
  • [42] Likelihood inference under the general response transformation model with heteroscedastic errors
    Chen, CR
    Wang, LC
    TAIWANESE JOURNAL OF MATHEMATICS, 2003, 7 (02): : 261 - 273
  • [43] Statistical Inference for Estimators in a Semiparametric EV Model with Linear Process Errors and Missing Responses
    Zhang, Jing-Jing
    Yang, Xue
    Mathematical Problems in Engineering, 2023, 2023
  • [44] INTRODUCTION TO BIOSTATISTICS .5. STATISTICAL-INFERENCE TECHNIQUES FOR HYPOTHESIS-TESTING WITH NONPARAMETRIC DATA
    GADDIS, GM
    GADDIS, ML
    ANNALS OF EMERGENCY MEDICINE, 1990, 19 (09) : 1054 - 1059
  • [45] Identification of Dominant Hydrological Mechanisms Using Bayesian Inference, Multiple Statistical Hypothesis Testing, and Flexible Models
    Prieto, Cristina
    Kavetski, Dmitri
    Le Vine, Nataliya
    Alvarez, Cesar
    Medina, Raul
    WATER RESOURCES RESEARCH, 2021, 57 (08)
  • [46] Testing treatment effects in unconfounded studies under model misspecification: Logistic regression, discretization, and their combination
    Cangul, M. Z.
    Chretien, Y. R.
    Gutman, R.
    Rubin, D. B.
    STATISTICS IN MEDICINE, 2009, 28 (20) : 2531 - 2551
  • [47] In Silico Model-Based Inference: A Contemporary Approach for Hypothesis Testing in Network Biology
    Klinke, David J., II
    BIOTECHNOLOGY PROGRESS, 2014, 30 (06) : 1247 - 1261
  • [48] Errors May Not Cue Recall of Corrective Feedback: Evidence Against the Mediation Hypothesis of the Testing Effect
    Leggett, Jack M., I
    Burt, Jennifer S.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2021, 47 (01) : 65 - 74
  • [49] Bayesian phylogenetic inference under a statistical insertion-deletion model
    Lunter, G
    Miklós, I
    Drummond, A
    Jensen, JL
    Hein, J
    ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2003, 2812 : 228 - 244
  • [50] Sparse Statistical Model Inference for Analog Circuits under Process Variations
    Zhang, Yan
    Sankaranarayanan, Sriram
    Somenzi, Fabio
    2014 19TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2014, : 449 - 454