The Newcomb-Benford Law in Its Relation to Some Common Distributions

被引:30
作者
Formann, Anton K. [1 ]
机构
[1] Univ Vienna, Dept Psychol Basic Res, Vienna, Austria
关键词
SIGNIFICANT-DIGIT; 1ST DIGIT; RANDOM-VARIABLES; NUMBERS; RATIO; ZIPF;
D O I
10.1371/journal.pone.0010541
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
An often reported, but nevertheless persistently striking observation, formalized as the Newcomb-Benford law (NBL), is that the frequencies with which the leading digits of numbers occur in a large variety of data are far away from being uniform. Most spectacular seems to be the fact that in many data the leading digit 1 occurs in nearly one third of all cases. Explanations for this uneven distribution of the leading digits were, among others, scale-and base-invariance. Little attention, however, found the interrelation between the distribution of the significant digits and the distribution of the observed variable. It is shown here by simulation that long right-tailed distributions of a random variable are compatible with the NBL, and that for distributions of the ratio of two random variables the fit generally improves. Distributions not putting most mass on small values of the random variable (e. g. symmetric distributions) fail to fit. Hence, the validity of the NBL needs the predominance of small values and, when thinking of real-world data, a majority of small entities. Analyses of data on stock prices, the areas and numbers of inhabitants of countries, and the starting page numbers of papers from a bibliography sustain this conclusion. In all, these findings may help to understand the mechanisms behind the NBL and the conditions needed for its validity. That this law is not only of scientific interest per se, but that, in addition, it has also substantial implications can be seen from those fields where it was suggested to be put into practice. These fields reach from the detection of irregularities in data (e. g. economic fraud) to optimizing the architecture of computers regarding number representation, storage, and round-off errors.
引用
收藏
页数:13
相关论文
共 39 条
[21]   The relationship between Zipf's law and the distribution of first digits [J].
Irmay, S .
JOURNAL OF APPLIED STATISTICS, 1997, 24 (04) :383-393
[22]   From uniform distributions to Benford's law [J].
Janvresse, T ;
De la Rue, T .
JOURNAL OF APPLIED PROBABILITY, 2004, 41 (04) :1203-1210
[23]  
Johnson NL., 1994, Continuous Univariate Distributions, V1
[24]  
Judge G, 2009, J HUM RESOUR, V44, P1
[25]   On the ratio of two folded normal distributions [J].
Kim, HJ .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2006, 35 (06) :965-977
[26]  
Knuth D. E., ART COMPUTER PROGRAM, V2
[27]   Survival distributions satisfying Benford's law [J].
Leemis, LM ;
Schmeiser, BW ;
Evans, DL .
AMERICAN STATISTICIAN, 2000, 54 (04) :236-241
[28]   On the peculiar distribution of the US stock indexes' digits [J].
Ley, E .
AMERICAN STATISTICIAN, 1996, 50 (04) :311-313
[29]   On the non-existence of a general Benford's law [J].
Lolbert, Tamas .
MATHEMATICAL SOCIAL SCIENCES, 2008, 55 (02) :103-106
[30]   The first-digit frequencies of prime numbers and Riemann zeta zeros [J].
Luque, Bartolo ;
Lacasa, Lucas .
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2009, 465 (2107) :2197-2216