FFD: A Federated Learning Based Method for Credit Card Fraud Detection

被引:128
作者
Yang, Wensi [1 ,2 ]
Zhang, Yuhang [1 ,2 ]
Ye, Kejiang [1 ]
Li, Li [1 ]
Xu, Cheng-Zhong [3 ]
机构
[1] Chinese Acad Sci, Shengzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Macau, Dept Comp & Informat Sci, Fac Sci & Technol, State Key Lab IoT Smart City, Taipa, Macao, Peoples R China
来源
BIG DATA - BIGDATA 2019 | 2019年 / 11514卷
基金
中国国家自然科学基金;
关键词
Federated learning; Credit card fraud; Skewed dataset; SYSTEM;
D O I
10.1007/978-3-030-23551-2_2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Credit card fraud has caused a huge loss to both banks and consumers in recent years. Thus, an effective Fraud Detection System (FDS) is important to minimize the loss for banks and cardholders. Based on our analysis, the credit card transaction dataset is very skewed, there are much fewer samples of frauds than legitimate transactions. Furthermore, due to the data security and privacy, different banks are usually not allowed to share their transaction datasets. These problemsmake FDS difficult to learn the patterns of frauds and also difficult to detect them. In this paper, we propose a framework to train a fraud detection model using behavior features with federated learning, we term this detection framework FFD (Federated learning for Fraud Detection). Different from the traditional FDS trained with data centralized in the cloud, FFD enables banks to learn fraud detection model with the training data distributed on their own local database. Then, a shared FDS is constructed by aggregating locally-computed updates of fraud detection model. Banks can collectively reap the benefits of shared model without sharing the dataset and protect the sensitive information of cardholders. Furthermore, an oversampling approach is combined to balance the skewed dataset. We evaluate the performance of our credit card FDS with FFD framework on a large scale dataset of real-world credit card transactions. Experimental results show that the federated learning based FDS achieves an average of test AUC to 95.5%, which is about 10% higher than traditional FDS.
引用
收藏
页码:18 / 32
页数:15
相关论文
共 32 条
[1]   Fraud detection system: A survey [J].
Abdallah, Aisha ;
Maarof, Mohd Aizaini ;
Zainal, Anazida .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 68 :90-113
[2]  
[Anonymous], 2017, Google
[3]  
[Anonymous], ARXIV160205629
[4]  
Bahnsen A. C., 2014, P 2014 SIAM INT C DA, P677
[5]  
Bahnsen A.C., P 2013 12 INT C MACH, V1, P333
[6]   Feature engineering strategies for credit card fraud detection [J].
Bahnsen, Alejandro Correa ;
Aouada, Djamila ;
Stojanovic, Aleksandar ;
Ottersten, Bjoern .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 51 :134-142
[7]  
Bian Y., 2016, PACIS 2016 Proceedings, P315
[8]  
Bolton R., 2001, CREDIT SCORING CREDI, V7, P235
[9]  
Bolton RJ, 2002, STAT SCI, V17, P235
[10]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)