EnsemFDet: An Ensemble Approach to Fraud Detection based on Bipartite Graph

被引:19
作者
Ren, Yuxiang [1 ]
Zhu, Hao [2 ]
Zhang, Jiawei [1 ]
Dai, Peng [3 ]
Bo, Liefeng [3 ]
机构
[1] Florida State Univ, Dept Comp Sci, IFM Lab, Tallahassee, FL 32306 USA
[2] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT, Australia
[3] JD Finance Amer Corp, AI Lab, Mountain View, CA USA
来源
2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021) | 2021年
关键词
Bipartite Graph; Ensembles; Fraud Detection; Graph Mining;
D O I
10.1109/ICDE51399.2021.00197
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fraud detection is extremely critical for e-commerce business platforms. Utilizing graph structure data and identifying unexpected dense subgraphs as suspicious is a category of commonly used fraud detection methods. Among them, spectral methods solve the problem efficiently but hurt the performance due to the relaxed constraints. Heuristic methods cannot be accelerated with parallel computation and fail to control the scope of returned suspicious nodes. These drawbacks affect the real-world applications of existing graph-based methods. In this paper, we propose an Ensemble based Fraud DETection (ENSEMFDET) method to scale up fraud detection in bipartite graphs. By oversampling the graph and solving the subproblems, the ensemble approach further votes suspicious nodes without sacrificing the prediction accuracy. Extensive experiments have been done on real transaction data from JD.com and demonstrate the effectiveness, practicability, and scalability of ENSEMFDET.
引用
收藏
页码:2039 / 2044
页数:6
相关论文
共 23 条
  • [1] End-to-end neural network architecture for fraud scoring in card payments
    Ander Gomez, Jon
    Arevalo, Juan
    Paredes, Roberto
    Nin, Jordi
    [J]. PATTERN RECOGNITION LETTERS, 2018, 105 : 175 - 181
  • [2] [Anonymous], 2012, P INT C WORLD WID WE
  • [3] Finding Subgraphs with Maximum Total Density and Limited Overlap
    Balalau, Oana Denisa
    Bonchi, Francesco
    Chany, T-H. Hubert
    Gullo, Francesco
    Sozioz, Mauro
    [J]. WSDM'15: PROCEEDINGS OF THE EIGHTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2015, : 379 - 388
  • [4] Beutel A., 2013, P 22 INT C WORLD WID, P119, DOI DOI 10.1145/2488388.2488400
  • [5] Cao Q., 2012, P USENIX S NETW SYST, P197
  • [6] Charikar M., 2000, Approximation Algorithms for Combinatorial Optimization. Third International Workshop, APPROX 2000. Proceedings (Lecture Notes in Computer Science Vol.1913), P84
  • [7] Accelerating Graph Mining Algorithms via Uniform Random Edge Sampling
    Ciao, Ruohan
    Xu, Huanle
    Hu, Pili
    Lau, Wing Cheong
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [8] FRAUDAR: Bounding Graph Fraud in the Face of Camouflage
    Hooi, Bryan
    Song, Hyun Ah
    Beutel, Alex
    Shah, Neil
    Shin, Kijung
    Faloutsos, Christos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 895 - 904
  • [9] Data clustering: 50 years beyond K-means
    Jain, Anil K.
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (08) : 651 - 666
  • [10] A General Suspiciousness Metric for Dense Blocks in Multimodal Data
    Jiang, Meng
    Beutel, Alex
    Cui, Peng
    Hooi, Bryan
    Yang, Shiqiang
    Faloutsos, Christos
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 781 - 786