Online Payment Fraud Detection for Big Data

被引:0
|
作者
Tawde, Samiksha Dattaprasad [1 ]
Arora, Sandhya [1 ]
Thakur, Yashasvee Shitalkumar [1 ]
机构
[1] MKSSSs Cummins Coll Engn Women, Dept Comp Engn, Pune, Maharashtra, India
关键词
PySpark; Fraud Detection; Big Data; Resilient Distributed Datasets; Decision Tree; Random Forest; Logistic Regression;
D O I
10.1007/978-3-031-50583-6_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern economic life is now greatly facilitated by online payment systems, which allow for seamless financial transactions. However, the risk of online payment fraud has greatly increased along with the growth of digital transactions. This calls for the creation of sophisticated fraud detection systems that can instantly evaluate huge amounts of transaction data. This study suggests a novel method for identifying online payment fraud by utilizing big data management techniques, more specifically PySpark's capabilities. PySpark uses Resilient Distributed Datasets (RDD), data structure and stores data in RAM instead of writing it to disk after each operation. RDD operations are lazy i.e., they will not execute unless an action operation is called on them. After preprocessing the data Machine Learning algorithms from Spark ML package are applied, the ML library of PySpark provides optimized Machine Learning capabilities for Classification problems that require distributed computing. Further, models of classification algorithms that qualify with the best metrics are developed on our dataset and used for making accurate detections. Our Fraud detection system aims to assist Large organizations in assessing their enormous amount of transaction data to detect possible anomalies or fraudulent activities.
引用
收藏
页码:324 / 337
页数:14
相关论文
共 50 条
  • [31] Effective detection of sophisticated online banking fraud on extremely imbalanced data
    Wei, Wei
    Li, Jinjiu
    Cao, Longbing
    Ou, Yuming
    Chen, Jiahang
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2013, 16 (04): : 449 - 475
  • [32] Effective detection of sophisticated online banking fraud on extremely imbalanced data
    Wei Wei
    Jinjiu Li
    Longbing Cao
    Yuming Ou
    Jiahang Chen
    World Wide Web, 2013, 16 : 449 - 475
  • [33] Leveraging Adversarial Augmentation on Imbalance Data for Online Trading Fraud Detection
    Teng, Hu
    Wang, Cheng
    Yang, Qing
    Chen, Xue
    Li, Rui
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1602 - 1614
  • [34] Online Anomaly Detection over Big Data Streams
    Rettig, Laura
    Khayati, Mourad
    Cudre-Mauroux, Philippe
    Piorkowski, Michal
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1113 - 1122
  • [35] Online Transaction Fraud Detection Techniques: A Review of Data Mining Approaches
    Sagar, B. B.
    Singh, Pratibha
    Mallika, S.
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3756 - 3761
  • [36] A Distributed Approach of Big Data Mining for Financial Fraud Detection in a Supply Chain
    Zhou, Hangjun
    Sun, Guang
    Fu, Sha
    Fan, Xiaoping
    Jiang, Wangdong
    Hu, Shuting
    Li, Lingjiao
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 64 (02): : 1091 - 1105
  • [37] Improving Medicare Fraud Detection through Big Data Size Reduction Techniques
    Wang, Huanjing
    Hancock, John T., III
    Khoshgoftaar, Taghi M.
    2023 IEEE INTERNATIONAL CONFERENCE ON SERVICE-ORIENTED SYSTEM ENGINEERING, SOSE, 2023, : 208 - 217
  • [38] Unsupervised Machine Learning for Card Payment Fraud Detection
    Parreno-Centeno, Mario
    Ali, Mohammed Aamir
    Guan, Yu
    van Moorsel, Aad
    RISKS AND SECURITY OF INTERNET AND SYSTEMS (CRISIS 2019), 2020, 12026 : 247 - 262
  • [39] Boosting Fraud Detection in Mobile Payment with Prior Knowledge
    Sun, Quan
    Tang, Tao
    Chai, Hongfeng
    Wu, Jie
    Chen, Yang
    APPLIED SCIENCES-BASEL, 2021, 11 (10):
  • [40] Medicare Fraud Detection using Random Forest with Class Imbalanced Big Data
    Bauder, Richard A.
    Khoshgoftaar, Taghi M.
    2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 80 - 87