Transaction aggregation as a strategy for credit card fraud detection

被引:158
作者
Whitrow, C. [1 ]
Hand, D. J. [1 ,2 ]
Juszczak, P. [1 ]
Weston, D. [1 ]
Adams, N. M. [2 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Inst Math Sci, London, England
[2] Univ London Imperial Coll Sci Technol & Med, Dept Math, London, England
基金
英国工程与自然科学研究理事会;
关键词
Fraud detection; Supervised classification; Credit cards; Preprocessing;
D O I
10.1007/s10618-008-0116-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of preprocessing transaction data for supervised fraud classification is considered. It is impractical to present an entire series of transactions to a fraud detection system, partly because of the very high dimensionality of such data but also because of the heterogeneity of the transactions. Hence, a framework for transaction aggregation is considered and its effectiveness is evaluated against transaction-level detection, using a variety of classification methods and a realistic cost-based performance measure. These methods are applied in two case studies using real data. Transaction aggregation is found to be advantageous in many but not all circumstances. Also, the length of the aggregation period has a large impact upon performance. Aggregation seems particularly effective when a random forest is used for classification. Moreover, random forests were found to perform better than other classification methods, including SVMs, logistic regression and KNN. Aggregation also has the advantage of not requiring precisely labeled data and may be more robust to the effects of population drift.
引用
收藏
页码:30 / 55
页数:26
相关论文
共 28 条
[1]   Comparing classifiers when the misallocation costs are uncertain [J].
Adams, NM ;
Hand, DJ .
PATTERN RECOGNITION, 1999, 32 (07) :1139-1147
[2]   CARDWATCH: A neural network based database mining system for credit card fraud detection [J].
Aleskerov, E ;
Freisleben, B ;
Rao, B .
PROCEEDINGS OF THE IEEE/IAFE 1997 COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING (CIFER), 1997, :220-226
[3]  
[Anonymous], IEEE INT C NETW SENS
[4]  
*APACS, 2006, FRAUD FACTS 2006
[5]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[6]  
Bolton RJ, 2002, STAT SCI, V17, P235
[7]  
BOLTON RJ, 2001, C CRED SCOR CRED CON, V7
[8]  
Brause R., 1999, NEURAL DATA MINING C, P103
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Distributed data mining in credit card fraud detection [J].
Chan, PK ;
Fan, W ;
Prodromidis, AL ;
Stolfo, SJ .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (06) :67-74