Decouple then Combine: A Simple and Effective Framework for Fraud Transaction Detection

被引:0
作者
Tang, Pengwei [1 ]
Tang, Huayi [1 ]
Wang, Wenhan [2 ]
Su, Hanjing [2 ]
Liu, Yong [1 ]
机构
[1] Renmin Univ China, Beijing, Peoples R China
[2] Tencent Inc, Shenzhen, Peoples R China
来源
ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222 | 2023年 / 222卷
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Imbalance Learning; Fraud Detection; Tabular Data;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the popularity of electronic mobile and online payment, the demand for detecting financial fraudulent transactions is increasing. Although numerous efforts are devoted to tackling this problem, there are still two key challenges that are not well resolved, i.e., the class imbalance ratio of test samples are extremely larger than that of training samples and amount of detected fraudulent transactions do not be considered. In this paper, we propose a simple and effective framework composed of majority and minority branches to address the above issues. The input samples of majority and minority branches come from vanilla and re-adjusted distribution, respectively. Parameters of each branch are optimized individually, by which the representation learning for majority and minority samples are decoupled. Besides, an extra loss re-weighted by amount is added in the majority branch to improve the recall amount of detected fraudulent transactions. Theoretical results show that under the proposed framework, minimizing the empirical risk is guaranteed to achieve small generalization risk on more imbalanced data with high probability. Experiments on real-world datasets from Tencent Wechat payments demonstrate that our framework achieves superior performance than competitive methods in terms of both number and money of detected fraudulent transactions.
引用
收藏
页数:16
相关论文
共 24 条
  • [1] Batista G.E., 2004, ACM SIGKDD EXPL NEWS, V6, P20, DOI [10.1145/1007730.1007735, 10.1145/1007730.1007735.2, DOI 10.1145/1007730.1007735]
  • [2] Cao KD, 2019, ADV NEUR IN, V32
  • [3] Class-Balanced Loss Based on Effective Number of Samples
    Cui, Yin
    Jia, Menglin
    Lin, Tsung-Yi
    Song, Yang
    Belongie, Serge
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9260 - 9269
  • [4] Fan W, 1999, MACHINE LEARNING, PROCEEDINGS, P97
  • [5] EUSBoost: Enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling
    Galar, Mikel
    Fernandez, Alberto
    Barrenechea, Edurne
    Herrera, Francisco
    [J]. PATTERN RECOGNITION, 2013, 46 (12) : 3460 - 3471
  • [6] Learning from Imbalanced Data
    He, Haibo
    Garcia, Edwardo A.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) : 1263 - 1284
  • [7] Karakoulas Grigoris, 1998, Advances in Neural Information Processing Systems, V11
  • [8] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
  • [9] Lin WL, 2021, PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, P3670
  • [10] Intention-aware Heterogeneous Graph Attention Networks for Fraud Transactions Detection
    Liu, Can
    Sun, Li
    Ao, Xiang
    Feng, Jinghua
    He, Qing
    Yang, Hao
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3280 - 3288