Model averaging approaches to data subset selection

被引:2
作者
Neil E.T. [1 ]
Sitison J.W. [1 ]
机构
[1] Department of Physics, University of Colorado, Boulder, 80309, CO
关键词
Compendex;
D O I
10.1103/PhysRevE.108.045308
中图分类号
学科分类号
摘要
Model averaging is a useful and robust method for dealing with model uncertainty in statistical analysis. Often, it is useful to consider data subset selection at the same time, in which model selection criteria are used to compare models across different subsets of the data. Two different criteria have been proposed in the literature for how the data subsets should be weighted. We compare the two criteria closely in a unified treatment based on the Kullback-Leibler divergence and conclude that one of them is subtly flawed and will tend to yield larger uncertainties due to loss of information. Analytical and numerical examples are provided. © 2023 American Physical Society.
引用
收藏
相关论文
共 12 条
  • [1] Borsanyi S., Leading hadronic contribution to the muon magnetic moment from lattice QCD, Nature (London), 593, (2021)
  • [2] Borsanyi S., Ab initio calculation of the neutron-proton mass difference, Science, 347, (2015)
  • [3] Jay W. I., Neil E. T., Bayesian model averaging for analysis of lattice field theory results, Phys. Rev. D, 103, (2021)
  • [4] Neil E. T., Sitison J. W., Improved information criteria for Bayesian model averaging in lattice field theory
  • [5] Mehta P., Bukov M., Wang C.-H., Day A. G., Fisher C. R. C. K., Schwab D. J., A high-bias, low-variance introduction to machine learning for physicists, Phys. Rep, 810, (2019)
  • [6] Kullback S., Leibler R. A., On information and sufficiency, Ann. Math. Stat, 22, (1951)
  • [7] Larsen R. J., Marx M. L., An Introduction to Mathematical Statistics, (2005)
  • [8] Konishi S., Kitagawa G., Bayesian information criteria, Information Criteria and Statistical Modeling, pp. 211-237, (2008)
  • [9] Lepage G., Clark B., Davies C., Hornbostel K., Mackenzie P., Morningstar C., Trottier H., Constrained curve fitting, Nucl. Phys. Proc. Suppl, 106, (2002)
  • [10] Lepage G., lsqfit