Feature engineering strategies for credit card fraud detection

被引:176
作者
Bahnsen, Alejandro Correa [1 ]
Aouada, Djamila [1 ]
Stojanovic, Aleksandar [1 ]
Ottersten, Bjoern [1 ]
机构
[1] Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust, Luxembourg, Luxembourg
关键词
Cost-sensitive learning; Fraud detection; Preprocessing; Von Mises distribution; TRANSACTION AGGREGATION;
D O I
10.1016/j.eswa.2015.12.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Every year billions of Euros are lost worldwide due to credit card fraud. Thus, forcing financial institutions to continuously improve their fraud detection systems. In recent years, several studies have proposed the use of machine learning and data mining techniques to address this problem. However, most studies used some sort of misclassification measure to evaluate the different solutions, and do not take into account the actual financial costs associated with the fraud detection process. Moreover, when constructing a credit card fraud detection model, it is very important how to extract the right features from the transactional data. This is usually done by aggregating the transactions in order to observe the spending behavioral patterns of the customers. In this paper we expand the transaction aggregation strategy, and propose to create a new set of features based on analyzing the periodic behavior of the time of a transaction using the von Mises distribution. Then, using a real credit card fraud dataset provided by a large European card processing company, we compare state-of-the-art credit card fraud detection models, and evaluate how the different sets of features have an impact on the results. By including the proposed periodic features into the methods, the results show an average increase in savings of 13%. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:134 / 142
页数:9
相关论文
共 32 条
  • [1] American Institute of CPAs, 2011, INT FINANCIAL REPORT
  • [2] [Anonymous], J OPERATIONAL RES SO
  • [3] [Anonymous], CREDIT SCORING CREDI
  • [4] Bachmayer S., 2008, Artificial Immune Systems, V5132, P119
  • [5] Bahnsen A.C., 2014, P 2014 SIAM INT C DA, P677
  • [6] Example-dependent cost-sensitive decision trees
    Bahnsen, Alejandro Correa
    Aouada, Djamila
    Ottersten, Bjoern
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (19) : 6609 - 6619
  • [7] Example-Dependent Cost-Sensitive Logistic Regression for Credit Scoring
    Bahnsen, Alejandro Correa
    Aouada, Djamila
    Ottersten, Bjorn
    [J]. 2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 263 - 269
  • [8] Cost Sensitive Credit Card Fraud Detection using Bayes Minimum Risk
    Bahnsen, Alejandro Correa
    Stojanovic, Aleksandar
    Aouada, Djamila
    Ottersten, Bjoern
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 333 - 338
  • [9] Data mining for credit card fraud: A comparative study
    Bhattacharyya, Siddhartha
    Jha, Sanjeev
    Tharakunnel, Kurian
    Westland, J. Christopher
    [J]. DECISION SUPPORT SYSTEMS, 2011, 50 (03) : 602 - 613
  • [10] Bishop C., 2006, Pattern recognition and machine learning, P423