Statistical Inference After Model Selection

被引:48
|
作者
Berk, Richard [1 ,2 ]
Brown, Lawrence [1 ]
Zhao, Linda [1 ]
机构
[1] Univ Penn, Dept Stat, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Criminol, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Model selection; Statistical inference; Mixtures of distributions; DANTZIG SELECTOR; LARGER;
D O I
10.1007/s10940-009-9077-7
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Conventional statistical inference requires that a model of how the data were generated be known before the data are analyzed. Yet in criminology, and in the social sciences more broadly, a variety of model selection procedures are routinely undertaken followed by statistical tests and confidence intervals computed for a "final" model. In this paper, we examine such practices and show how they are typically misguided. The parameters being estimated are no longer well defined, and post-model-selection sampling distributions are mixtures with properties that are very different from what is conventionally assumed. Confidence intervals and statistical tests do not perform as they should. We examine in some detail the specific mechanisms responsible. We also offer some suggestions for better practice and show though a criminal justice example using real data how proper statistical inference in principle may be obtained.
引用
收藏
页码:217 / 236
页数:20
相关论文
共 50 条
  • [31] Statistical Inference in Games
    Salant, Yuval
    Cherry, Josh
    ECONOMETRICA, 2020, 88 (04) : 1725 - 1752
  • [32] Model Selection and Parameter Inference in Phylogenetics Using Nested Sampling
    Russel, Patricio Maturana
    Brewer, Brendon J.
    Klaere, Steffen
    Bouckaert, Remco R.
    SYSTEMATIC BIOLOGY, 2019, 68 (02) : 219 - 233
  • [33] Bootstrap Approximation of Model Selection Probabilities for Multimodel Inference Frameworks
    Dajles, Andres
    Cavanaugh, Joseph
    ENTROPY, 2024, 26 (07)
  • [34] History of multimodel inference via model selection in wildlife science
    Lindberg, Mark S.
    Schmidt, Joshua H.
    Walker, Johann
    JOURNAL OF WILDLIFE MANAGEMENT, 2015, 79 (05) : 704 - 707
  • [35] Applying optimal model selection in principal stratification for causal inference
    Odondi, Lang'o
    McNamee, Roseanne
    STATISTICS IN MEDICINE, 2013, 32 (11) : 1815 - 1828
  • [36] Consistency of variational Bayes inference for estimation and model selection in mixtures
    Cherief-Abdellatif, Badr-Eddine
    Alquier, Pierre
    ELECTRONIC JOURNAL OF STATISTICS, 2018, 12 (02): : 2995 - 3035
  • [37] An asymptotic theory for model selection inference in general semiparametric problems
    Claeskens, Gerda
    Carroll, Raymond J.
    BIOMETRIKA, 2007, 94 (02) : 249 - 265
  • [38] Stepwise Model Fitting and Statistical Inference: Turning Noise into Signal Pollution
    Mundry, Roger
    Nunn, Charles L.
    AMERICAN NATURALIST, 2009, 173 (01) : 119 - 123
  • [39] Statistical inference in a growth curve quantile regression model for longitudinal data
    Cho, Hyunkeun Ryan
    BIOMETRICS, 2018, 74 (03) : 855 - 862
  • [40] Statistical Inference for Independent Component Analysis Based on Polynomial Spline Model
    Kawaguchi, Atsushi
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 478 - 480