Modeling Insurance Fraud Detection Using Imbalanced Data Classification

被引:38
|
作者
Hassan, Amira Kamil Ibrahim [1 ,2 ]
Abraham, Ajith [1 ,3 ]
机构
[1] Sudan Univ Sci & Technol, Dept Comp Sci, Khartoum, Sudan
[2] MIR Labs, Auburn, WA USA
[3] VSB Tech Univ Ostrava, IT4Innovat, Ostrava, Czech Republic
关键词
Insurance fraud detection; Imbalanced data; Decision tree; Support vector machine and artificial neural network; AUTOMOBILE INSURANCE; CLAIMS;
D O I
10.1007/978-3-319-27400-3_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an innovative insurance fraud detection method to deal with the imbalanced data distribution. The idea is based on building insurance fraud detection models using Decision tree (DT), Support vector machine (SVM) and Artificial Neural Network (ANN), on data partitions derived from under-sampling (with-replacement and without-replacement) of the majority class and merging it with the minority class. Throughout the paper, ten-fold cross validation method of testing is used. Its originality lies in the use of several partitioning under-sampling approaches and choosing the best. Results from a publicly available automobile insurance fraud detection data set demonstrate that DT performs slightly better than other algorithms, so DT model was used to compare between different partitioning-under-sampling approaches. Empirical results illustrate that the proposed model gave better results.
引用
收藏
页码:117 / 127
页数:11
相关论文
共 50 条
  • [21] Data misrepresentation detection for insurance underwriting fraud prevention
    Vandervorst, Felix
    Verbeke, Wouter
    Verdonck, Tim
    DECISION SUPPORT SYSTEMS, 2022, 159
  • [22] A fraud detection approach with data mining in health insurance
    Kirlidog, Melih
    Asuk, Cuneyt
    WORLD CONFERENCE ON BUSINESS, ECONOMICS AND MANAGEMENT (BEM-2012), 2012, 62 : 989 - 994
  • [23] Credit Card Fraud Detection Using Tree-based Algorithms For Highly Imbalanced Data
    Rezaei, Abdolazim
    Yazdinejad, Mohsen
    Sookhak, Mehdi
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [24] Effective detection of sophisticated online banking fraud on extremely imbalanced data
    Wei, Wei
    Li, Jinjiu
    Cao, Longbing
    Ou, Yuming
    Chen, Jiahang
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2013, 16 (04): : 449 - 475
  • [25] Effective detection of sophisticated online banking fraud on extremely imbalanced data
    Wei Wei
    Jinjiu Li
    Longbing Cao
    Yuming Ou
    Jiahang Chen
    World Wide Web, 2013, 16 : 449 - 475
  • [26] Enhancing credit card fraud detection: highly imbalanced data case
    Breskuviene, Dalia
    Dzemyda, Gintautas
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [27] Fraud Detection and Frequent Pattern Matching in Insurance claims using Data Mining Techniques
    Verma, Aayushi
    Taneja, Anu
    Arora, Anuja
    2017 TENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2017, : 84 - 90
  • [28] Application of classification methods to individual disability income insurance fraud detection
    Peng, Yi
    Kou, Gang
    Sabatka, Alan
    Matza, Jeff
    Chen, Zhengxin
    Khazanchi, Deepak
    Shi, Yong
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 852 - +
  • [29] Fraud Claims Detection in Insurance Using Machine Learning
    Kalra, Hritik
    Singh, Ranvir
    Kumar, T. Senthil
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 327 - 331
  • [30] Health Care Insurance Fraud Detection Using Blockchain
    Saldamli, Gokay
    Reddy, Vamshi
    Bojja, Krishna S.
    Gururaja, Manjunatha K.
    Doddaveerappa, Yashaswi
    Tawalbeh, Loai
    2020 SEVENTH INTERNATIONAL CONFERENCE ON SOFTWARE DEFINED SYSTEMS (SDS), 2020, : 145 - 152