The role of diversity and ensemble learning in credit card fraud detection

被引:0
作者
Gian Marco Paldino
Bertrand Lebichot
Yann-Aël Le Borgne
Wissam Siblini
Frédéric Oblé
Giacomo Boracchi
Gianluca Bontempi
机构
[1] Université Libre de Bruxelles,Machine Learning Group, Computer Science Departement, Faculty of Sciences
[2] Politecnico di Milano,Dipartimento di Elettronica, Informazione e Bioingegneria
[3] Research,undefined
[4] Development and Innovation,undefined
来源
Advances in Data Analysis and Classification | 2024年 / 18卷
关键词
Finance; Fraud detection; Concept drift; Ensemble learning; Diversity; 68T05 Learning and adaptive systems in artificial intelligence;
D O I
暂无
中图分类号
学科分类号
摘要
The number of daily credit card transactions is inexorably growing: the e-commerce market expansion and the recent constraints for the Covid-19 pandemic have significantly increased the use of electronic payments. The ability to precisely detect fraudulent transactions is increasingly important, and machine learning models are now a key component of the detection process. Standard machine learning techniques are widely employed, but inadequate for the evolving nature of customers behavior entailing continuous changes in the underlying data distribution. his problem is often tackled by discarding past knowledge, despite its potential relevance in the case of recurrent concepts. Appropriate exploitation of historical knowledge is necessary: we propose a learning strategy that relies on diversity-based ensemble learning and allows to preserve past concepts and reuse them for a faster adaptation to changes. In our experiments, we adopt several state-of-the-art diversity measures and we perform comparisons with various other learning approaches. We assess the effectiveness of our proposed learning strategy on extracts of two real datasets from two European countries, containing more than 30 M and 50 M transactions, provided by our industrial partner, Worldline, a leading company in the field.
引用
收藏
页码:193 / 217
页数:24
相关论文
共 146 条
[1]  
Alippi C(2013)Just-in-time classifiers for recurrent concepts IEEE Trans Neural Netw Learn Syst 24 620-634
[2]  
Boracchi G(2021)Data engineering for fraud detection Decis Support Syst 150 602-613
[3]  
Roveri M(2011)Data mining for credit card fraud: a comparative study Decis Support Syst 50 235-255
[4]  
Baesens B(2002)Statistical fraud detection: a review Stat Sci 17 317-331
[5]  
Höppner S(2019)Combining unsupervised and supervised learning in credit card fraud detection Inf Sci 557 2997-3020
[6]  
Verdonck T(2017)Outlier-free merging of homogeneous groups of pre-classified observations under contamination J Stat Comput Simul 87 106-115
[7]  
Bhattacharyya S(2018)Newcomb–benford law and the detection of frauds in international trade Proc Natl Acad Sci 116 67-74
[8]  
Jha S(1999)Distributed data mining in credit card fraud detection IEEE Intell Syst Their Appl 14 321-357
[9]  
Tharakunnel K(2002)Smote: synthetic minority over-sampling technique J Artif Intell Res 16 261-283
[10]  
Westland JC(1989)The cn2 induction algorithm Mach Learn 3 4915-4928