An unbiased model comparison test using cross-validation

被引:0
|
作者
Bruce A. Desmarais
Jeffrey J. Harden
机构
[1] University of Massachusetts—Amherst,Department of Political Science
[2] University of Colorado Boulder,Department of Political Science
来源
Quality & Quantity | 2014年 / 48卷
关键词
Model selection; Cross-validation; Kullback–Leibler Divergence; Vuong test; Clarke test;
D O I
暂无
中图分类号
学科分类号
摘要
Social scientists often consider multiple empirical models of the same process. When these models are parametric and non-nested, the null hypothesis that two models fit the data equally well is commonly tested using methods introduced by Vuong (Econometrica 57(2):307–333, 1989) and Clarke (Am J Political Sci 45(3):724–744, 2001; J Confl Resolut 47(1):72–93, 2003; Political Anal 15(3):347–363, 2007). The objective of each is to compare the Kullback–Leibler Divergence (KLD) of the two models from the true model that generated the data. Here we show that both of these tests are based upon a biased estimator of the KLD, the individual log-likelihood contributions, and that the Clarke test is not proven to be consistent for the difference in KLDs. As a solution, we derive a test based upon cross-validated log-likelihood contributions, which represent an unbiased KLD estimate. We demonstrate the CVDM test’s superior performance via simulation, then apply it to two empirical examples from political science. We find that the test’s selection can diverge from those of the Vuong and Clarke tests and that this can ultimately lead to differences in substantive conclusions.
引用
收藏
页码:2155 / 2173
页数:18
相关论文
共 50 条
  • [1] An unbiased model comparison test using cross-validation
    Desmarais, Bruce A.
    Harden, Jeffrey J.
    QUALITY & QUANTITY, 2014, 48 (04) : 2155 - 2173
  • [2] Empirical Comparison Between Cross-Validation and Mutation-Validation in Model Selection
    Yu, Jinyang
    Hamdan, Sami
    Sasse, Leonard
    Morrison, Abigail
    Patil, Kaustubh R.
    ADVANCES IN INTELLIGENT DATA ANALYSIS XXII, PT II, IDA 2024, 2024, 14642 : 56 - 67
  • [3] On Cross-Validation for MLP Model Evaluation
    Karkkainen, Tommi
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2014, 8621 : 291 - 300
  • [4] Linear model selection by cross-validation
    Rao, CR
    Wu, Y
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2005, 128 (01) : 231 - 240
  • [5] No unbiased estimator of the variance of K-fold cross-validation
    Bengio, Y
    Grandvalet, Y
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 1089 - 1105
  • [6] A survey of cross-validation procedures for model selection
    Arlot, Sylvain
    Celisse, Alain
    STATISTICS SURVEYS, 2010, 4 : 40 - 79
  • [7] Cross-validation is dead. Long live cross-validation! Model validation based on resampling
    Knut Baumann
    Journal of Cheminformatics, 2 (Suppl 1)
  • [8] Regular, Median and Huber Cross-Validation: A Computational Comparison
    Yu, Chi-Wai
    Clarke, Bertrand
    STATISTICAL ANALYSIS AND DATA MINING, 2015, 8 (01) : 14 - 33
  • [9] Estimating the Number of Clusters Using Cross-Validation
    Fu, Wei
    Perry, Patrick O.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (01) : 162 - 173
  • [10] Confidence intervals for the Cox model test error from cross-validation
    Sun, Min Woo
    Tibshirani, Robert
    STATISTICS IN MEDICINE, 2023, 42 (25) : 4532 - 4541