Statistical Inference After Model Selection

被引:48
|
作者
Berk, Richard [1 ,2 ]
Brown, Lawrence [1 ]
Zhao, Linda [1 ]
机构
[1] Univ Penn, Dept Stat, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Criminol, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Model selection; Statistical inference; Mixtures of distributions; DANTZIG SELECTOR; LARGER;
D O I
10.1007/s10940-009-9077-7
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Conventional statistical inference requires that a model of how the data were generated be known before the data are analyzed. Yet in criminology, and in the social sciences more broadly, a variety of model selection procedures are routinely undertaken followed by statistical tests and confidence intervals computed for a "final" model. In this paper, we examine such practices and show how they are typically misguided. The parameters being estimated are no longer well defined, and post-model-selection sampling distributions are mixtures with properties that are very different from what is conventionally assumed. Confidence intervals and statistical tests do not perform as they should. We examine in some detail the specific mechanisms responsible. We also offer some suggestions for better practice and show though a criminal justice example using real data how proper statistical inference in principle may be obtained.
引用
收藏
页码:217 / 236
页数:20
相关论文
共 50 条
  • [21] Multi-parameters Model Selection for Network Inference
    Tozzo, Veronica
    Barla, Annalisa
    COMPLEX NETWORKS AND THEIR APPLICATIONS VIII, VOL 1, 2020, 881 : 566 - 577
  • [22] Backpropagation neural network model with statistical inference in manufacturing processes
    de Leon-Delgado, Homero
    Praga-Alejo, Rolando J.
    Gonzalez-Gonzalez, David S.
    JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2025, 44
  • [23] The hierarchical theory of justification and statistical model selection
    Speekenbrink, M
    NEW DEVELOPMENTS IN PSYCHOMETRICS, 2003, : 331 - 338
  • [24] Robust model selection and the statistical classification of languages
    Garcia, Jesus E.
    Gonzalez-Lopez, V. A.
    Viola, M. L. L.
    XI BRAZILIAN MEETING ON BAYESIAN STATISTICS (EBEB 2012), 2012, 1490 : 160 - 170
  • [25] Inference after variable selection using restricted permutation methods
    Wang, Rui
    Lagakos, Stephen W.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2009, 37 (04): : 625 - 644
  • [26] A jackknife type approach to statistical model selection
    Lee, Hyunsook
    Babu, G. Jogesh
    Rao, C. R.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (01) : 301 - 311
  • [27] Radar HRRP statistical recognition: Parametric model and model selection
    Du, Lan
    Liu, Hongwei
    Bao, Zheng
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2008, 56 (05) : 1931 - 1944
  • [28] Statistical Inference after an Adaptive Group Sequential Design: A Case Study
    Lothar T. Tremmel
    Drug information journal : DIJ / Drug Information Association, 2010, 44 (5): : 589 - 598
  • [29] Statistical Inference After an Adaptive Group Sequential Design: A Case Study
    Tremmel, Lothar T.
    DRUG INFORMATION JOURNAL, 2010, 44 (05): : 589 - 598
  • [30] Geometric statistical inference
    Periwal, V
    NUCLEAR PHYSICS B, 1999, 554 (03) : 719 - 730