A multivariate test for detecting fraud based on Benford's law, with application to music streaming data

被引:6
作者
Mumic, Nermina [1 ]
Filzmoser, Peter [1 ]
机构
[1] TU Wien, Inst Stat & Math Methods Econ, Wiedner Hauptstr 7, A-1040 Vienna, Austria
关键词
Benford's Law; Compositional data; Fraud detection; Multivariate testing;
D O I
10.1007/s10260-021-00582-6
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Benford's law became a prevalent concept for fraud and anomaly detection. It examines the frequencies of the leading digits of numbers in a collection of data and states that the leading digit is most often 1, with diminishing frequencies up to 9. In this paper we propose a multivariate approach to test whether the observed frequencies follow the theoretical Benford distribution. Our approach is based on the concept of compositional data, which examines the relative information between the frequencies of the leading digits. As a result, we introduce a multivariate test for Benford distribution. In simulation studies and examples we compare the multivariate test performance to the conventional chi-square and Kolmogorov-Smirnov test, where the multivariate test turns out to be more sensitive in many cases. A diagnostics plot based on relative information allows to reveal and interpret the possible deviations from the Benford distribution.
引用
收藏
页码:819 / 840
页数:22
相关论文
共 22 条
  • [1] AITCHISON J, 1982, J ROY STAT SOC B, V44, P139
  • [2] Anderson TW., 2003, INTRO MULTIVARIATE S
  • [3] Baxter MJ, 1999, DETECTING MULTIVARIA
  • [4] Benford Frank, 1938, Proc. Am. Philos. Soc., P551, DOI DOI 10.2307/984802
  • [5] A basic theory of Benford's Law
    Berger, Arno
    Hill, Theodore P.
    [J]. PROBABILITY SURVEYS, 2011, 8 : 1 - 126
  • [6] Newcomb-Benford law and the detection of frauds in international trade
    Cerioli, Andrea
    Barabesi, Lucio
    Cerasa, Andrea
    Menegatti, Mario
    Perrotta, Domenico
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (01) : 106 - 115
  • [7] Benford's Law and the Detection of Election Fraud
    Deckert, Joseph
    Myagkov, Mikhail
    Ordeshook, Peter C.
    [J]. POLITICAL ANALYSIS, 2011, 19 (03) : 245 - 268
  • [8] Isometric logratio transformations for compositional data analysis
    Egozcue, JJ
    Pawlowsky-Glahn, V
    Mateu-Figueras, G
    Barceló-Vidal, C
    [J]. MATHEMATICAL GEOLOGY, 2003, 35 (03): : 279 - 300
  • [9] Filzmoser P., 2018, Applied compositional data analysis with worked examples in R, DOI DOI 10.1007/978-3-319-96422-5
  • [10] Compositional data analysis and zeros in micro data
    Fry, JM
    Fry, TRL
    McLaren, KR
    [J]. APPLIED ECONOMICS, 2000, 32 (08) : 953 - 959