Decouple then Combine: A Simple and Effective Framework for Fraud Transaction Detection

被引：0

作者：

Tang, Pengwei ^{[1
]}

Tang, Huayi ^{[1
]}

Wang, Wenhan ^{[2
]}

Su, Hanjing ^{[2
]}

Liu, Yong ^{[1
]}

机构：

[1] Renmin Univ China, Beijing, Peoples R China

[2] Tencent Inc, Shenzhen, Peoples R China

来源：

ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222 | 2023年 / 222卷

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Imbalance Learning; Fraud Detection; Tabular Data;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the popularity of electronic mobile and online payment, the demand for detecting financial fraudulent transactions is increasing. Although numerous efforts are devoted to tackling this problem, there are still two key challenges that are not well resolved, i.e., the class imbalance ratio of test samples are extremely larger than that of training samples and amount of detected fraudulent transactions do not be considered. In this paper, we propose a simple and effective framework composed of majority and minority branches to address the above issues. The input samples of majority and minority branches come from vanilla and re-adjusted distribution, respectively. Parameters of each branch are optimized individually, by which the representation learning for majority and minority samples are decoupled. Besides, an extra loss re-weighted by amount is added in the majority branch to improve the recall amount of detected fraudulent transactions. Theoretical results show that under the proposed framework, minimizing the empirical risk is guaranteed to achieve small generalization risk on more imbalanced data with high probability. Experiments on real-world datasets from Tencent Wechat payments demonstrate that our framework achieves superior performance than competitive methods in terms of both number and money of detected fraudulent transactions.

引用

页数：16

共 24 条

[1] Batista G.E., 2004, ACM SIGKDD EXPL NEWS, V6, P20, DOI [10.1145/1007730.1007735, 10.1145/1007730.1007735.2, DOI 10.1145/1007730.1007735]
[2] Cao KD, 2019, ADV NEUR IN, V32
[3] Class-Balanced Loss Based on Effective Number of Samples
Cui, Yin
Jia, Menglin
Lin, Tsung-Yi
Song, Yang
Belongie, Serge
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9260 - 9269
[4] Fan W, 1999, MACHINE LEARNING, PROCEEDINGS, P97
[5] EUSBoost: Enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling
Galar, Mikel
Fernandez, Alberto
Barrenechea, Edurne
Herrera, Francisco
[J]. PATTERN RECOGNITION, 2013, 46 (12) : 3460 - 3471
[6] Learning from Imbalanced Data
He, Haibo
Garcia, Edwardo A.
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) : 1263 - 1284
[7] Karakoulas Grigoris, 1998, Advances in Neural Information Processing Systems, V11
[8] Focal Loss for Dense Object Detection
Lin, Tsung-Yi
Goyal, Priya
Girshick, Ross
He, Kaiming
Dollar, Piotr
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
[9] Lin WL, 2021, PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, P3670
[10] Intention-aware Heterogeneous Graph Attention Networks for Fraud Transactions Detection
Liu, Can
Sun, Li
Ao, Xiang
Feng, Jinghua
He, Qing
Yang, Hao
[J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3280 - 3288

← 1 2 3 →