Fraud detection with natural language processing

被引:2
作者
Boulieris, Petros [1 ]
Pavlopoulos, John [1 ,2 ]
Xenos, Alexandros [1 ]
Vassalos, Vasilis [1 ]
机构
[1] Athens Univ Econ & Business, Dept Informat, Athens, Greece
[2] Stockholm Univ, Dept Comp & Syst Sci, Stockholm, Sweden
关键词
Fraud detection; Natural language processing; E-banking; Feature engineering; Varying class imbalance;
D O I
10.1007/s10994-023-06354-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated fraud detection can assist organisations to safeguard user accounts, a task that is very challenging due to the great sparsity of known fraud transactions. Many approaches in the literature focus on credit card fraud and ignore the growing field of online banking. However, there is a lack of publicly available data for both. The lack of publicly available data hinders the progress of the field and limits the investigation of potential solutions. With this work, we: (a) introduce FraudNLP, the first anonymised, publicly available dataset for online fraud detection, (b) benchmark machine and deep learning methods with multiple evaluation measures, (c) argue that online actions do follow rules similar to natural language and hence can be approached successfully by natural language processing methods.
引用
收藏
页码:5087 / 5108
页数:22
相关论文
共 31 条
  • [1] Achituve I, 2019, IEEE INT WORKS MACH, DOI 10.1109/mlsp.2019.8918896
  • [2] Data engineering for fraud detection
    Baesens, Bart
    Hoppner, Sebastiaan
    Verdonck, Tim
    [J]. DECISION SUPPORT SYSTEMS, 2021, 150
  • [3] Bojanowski Piotr, 2017, Trans. Assoc. Comput. Linguistics, V5, P135, DOI [10.1162/tacl_a_00051, DOI 10.1162/TACL_A_00051, DOI 10.1162/TACLA00051]
  • [4] Interleaved Sequence RNNs for Fraud Detection
    Branco, Bernardo
    Abreu, Pedro
    Gomes, Ana Sofia
    Almeida, Mariana S. C.
    Ascensao, Joao Tiago
    Bizarro, Pedro
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3101 - 3109
  • [5] FraudBuster: Temporal Analysis and Detection of Advanced Financial Frauds
    Carminati, Michele
    Baggio, Alessandro
    Maggi, Federico
    Spagnolini, Umberto
    Zanero, Stefano
    [J]. DETECTION OF INTRUSIONS AND MALWARE, AND VULNERABILITY ASSESSMENT, DIMVA 2018, 2018, 10885 : 211 - 233
  • [6] BANKSEALER: A decision support system for online banking fraud analysis and investigation
    Carminati, Michele
    Caron, Roberto
    Maggi, Federico
    Epifani, Ilenia
    Zanero, Stefano
    [J]. COMPUTERS & SECURITY, 2015, 53 : 175 - 186
  • [7] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [8] Adaptive fraud detection
    Fawcett, T
    Provost, F
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1 (03) : 291 - 316
  • [9] Sequential credit card fraud detection: A joint deep neural network and probabilistic graphical model approach
    Forough, Javad
    Momtazi, Saeedeh
    [J]. EXPERT SYSTEMS, 2022, 39 (01)
  • [10] Ensemble of deep sequential models for credit card fraud detection
    Forough, Javad
    Momtazi, Saeedeh
    [J]. APPLIED SOFT COMPUTING, 2021, 99