Breakdown of statistical inference from some random experiments

被引:11
|
作者
Kupczynski, Marian [1 ]
De Raedt, Hans [2 ]
机构
[1] UQO, Dept Informat, Case Postale 1250,Succursale Hull, Gatineau, PQ J8X 3X7, Canada
[2] Univ Groningen, Zernike Inst Adv Mat, Nijenborgh 4, NL-9747 AG Groningen, Netherlands
关键词
Finite statistics; Statistical inference; Monte Carlo simulations; Internal protocols; Breakdown of significance tests; Sample inhomogeneity; Chi-square compatibility tests; Chi-square frequency histograms;
D O I
10.1016/j.cpc.2015.11.010
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Many experiments can be interpreted in terms of random processes operating according to some internal protocols. When experiments are costly or cannot be repeated only one or a few finite samples are available. In this paper we study data generated by pseudo-random computer experiments operating according to particular internal protocols. We show that the standard statistical analysis performed on a sample, containing 105 data points or more, may sometimes be highly misleading and statistical errors largely underestimated. Our results confirm in a dramatic way the dangers of standard asymptotic statistical inference if a sample is not homogeneous. We demonstrate that analyzing various subdivisions of samples by multiple chi-square tests and chi-square frequency graphs is very effective in detecting sample inhomogeneity. Therefore to assure correctness of the statistical inference the above mentioned chi-square tests and other non-parametric sample homogeneity tests should be incorporated in any statistical analysis of experimental data. If such tests are not performed the reported conclusions and estimates of the errors cannot be trusted. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:168 / 175
页数:8
相关论文
共 50 条