Statistical Inference After Model Selection

被引:48
|
作者
Berk, Richard [1 ,2 ]
Brown, Lawrence [1 ]
Zhao, Linda [1 ]
机构
[1] Univ Penn, Dept Stat, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Criminol, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Model selection; Statistical inference; Mixtures of distributions; DANTZIG SELECTOR; LARGER;
D O I
10.1007/s10940-009-9077-7
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Conventional statistical inference requires that a model of how the data were generated be known before the data are analyzed. Yet in criminology, and in the social sciences more broadly, a variety of model selection procedures are routinely undertaken followed by statistical tests and confidence intervals computed for a "final" model. In this paper, we examine such practices and show how they are typically misguided. The parameters being estimated are no longer well defined, and post-model-selection sampling distributions are mixtures with properties that are very different from what is conventionally assumed. Confidence intervals and statistical tests do not perform as they should. We examine in some detail the specific mechanisms responsible. We also offer some suggestions for better practice and show though a criminal justice example using real data how proper statistical inference in principle may be obtained.
引用
收藏
页码:217 / 236
页数:20
相关论文
共 50 条
  • [41] New metric learning model using statistical inference for kinship verification
    Qin, Xiaoqian
    Liu, Dakun
    Gui, Bin
    Wang, Dong
    APPLIED SOFT COMPUTING, 2020, 95
  • [42] Ensuring valid inference for Cox hazard ratios after variable selection
    Van Lancker, Kelly
    Dukes, Oliver
    Vansteelandt, Stijn
    BIOMETRICS, 2023, 79 (04) : 3096 - 3110
  • [43] Margin-adaptive model selection in statistical learning
    Arlot, Sylvain
    Bartlett, Peter L.
    BERNOULLI, 2011, 17 (02) : 687 - 713
  • [44] Natural Selection, Adaptive Topographies and the Problem of Statistical Inference: The Moraba scurra Controversy Under the Microscope
    Grodwohl, Jean-Baptiste
    JOURNAL OF THE HISTORY OF BIOLOGY, 2017, 50 (04) : 753 - 796
  • [45] The Limited Role of Formal Statistical Inference in Scientific Inference
    Hubbard, Raymond
    Haig, Brian D.
    Parsa, Rahul A.
    AMERICAN STATISTICIAN, 2019, 73 : 91 - 98
  • [46] Natural Selection, Adaptive Topographies and the Problem of Statistical Inference: The Moraba scurra Controversy Under the Microscope
    Jean-Baptiste Grodwohl
    Journal of the History of Biology, 2017, 50 : 753 - 796
  • [47] Inference and model selection in general causal time series with exogenous covariates
    Diop, Mamadou Lamine
    Kengne, William
    ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (01): : 116 - 157
  • [48] Testing for treeness: lateral gene transfer, phylogenetic inference, and model selection
    Velasco, Joel D.
    Sober, Elliott
    BIOLOGY & PHILOSOPHY, 2010, 25 (04) : 675 - 687
  • [49] Estimation and model selection based inference in single and multiple threshold models
    Gonzalo, J
    Pitarakis, JY
    JOURNAL OF ECONOMETRICS, 2002, 110 (02) : 319 - 352
  • [50] Procrustean Statistical Inference of Deformations
    Hossainali, M. Mashhadi
    Becker, M.
    Groten, E.
    JOURNAL OF GEODETIC SCIENCE, 2011, 1 (02) : 170 - 180