Bagging k-dependence probabilistic networks: An alternative powerful fraud detection tool

被引:25
作者
Louzada, Francisco [1 ]
Ara, Anderson [2 ]
机构
[1] Univ Sao Paulo, Inst Matemat & Ciencias Comp, Sao Carlos, SP, Brazil
[2] Univ Fed Sao Carlos, Dept Estat, BR-13560 Sao Carlos, SP, Brazil
关键词
Fraud detection; Probabilistic networks; Bayesian networks; Classification models; Bagging; Predictive performance; CLASSIFICATION;
D O I
10.1016/j.eswa.2012.04.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fraud is a global problem that has required more attention due to an accentuated expansion of modern technology and communication. When statistical techniques are used to detect fraud, whether a fraud detection model is accurate enough in order to provide correct classification of the case as a fraudulent or legitimate is a critical factor. In this context, the concept of bootstrap aggregating (bagging) arises. The basic idea is to generate multiple classifiers by obtaining the predicted values from the adjusted models to several replicated datasets and then combining them into a single predictive classification in order to improve the classification accuracy. In this paper, for the first time, we aim to present a pioneer study of the performance of the discrete and continuous k-dependence probabilistic networks within the context of bagging predictors classification. Via a large simulation study and various real datasets, we discovered that the probabilistic networks are a strong modeling option with high predictive capacity and with a high increment using the bagging procedure when compared to traditional techniques. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:11583 / 11592
页数:10
相关论文
共 35 条
  • [1] [Anonymous], 2004, Learning Bayesian Networks
  • [2] [Anonymous], 2007, 2007 INT C SERV SYST, DOI [10.1109/ICSSSM.2007.4280163, DOI 10.1109/ICSSSM.2007.4280163]
  • [3] [Anonymous], AI APPROACHES FRAUD
  • [4] Assessing the accuracy of prediction algorithms for classification: an overview
    Baldi, P
    Brunak, S
    Chauvin, Y
    Andersen, CAF
    Nielsen, H
    [J]. BIOINFORMATICS, 2000, 16 (05) : 412 - 424
  • [5] Bhowmik R, 2011, J EMERGING TRENDS CO, V2
  • [6] Improving the analysis of dependable systems by mapping fault trees into Bayesian networks
    Bobbio, A
    Portinale, L
    Minichino, M
    Ciancamerla, E
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2001, 71 (03) : 249 - 260
  • [7] Bolton RJ, 2002, STAT SCI, V17, P235
  • [8] Breiman L, 1998, ANN STAT, V26, P801
  • [9] Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
  • [10] Evaluation of gene structure prediction programs
    Burset, M
    Guigo, R
    [J]. GENOMICS, 1996, 34 (03) : 353 - 367