Fraud Detection in Healthcare Insurance Claims Using Machine Learning

被引:7
|
作者
Nabrawi, Eman [1 ,2 ]
Alanazi, Abdullah [1 ,2 ]
机构
[1] King Saud Ibn Abdulaziz Univ Hlth Sci, Hlth Informat Dept, POB 3660, Riyadh 11481, Saudi Arabia
[2] King Abdullah Int Med Res Ctr, Riyadh 14611, Saudi Arabia
关键词
fraud; insurance claims; artificial neural networks (ANN); logistic regression (LR); random forest (RF); Saudi Arabia;
D O I
10.3390/risks11090160
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Healthcare fraud is intentionally submitting false claims or producing misinterpretation of facts to obtain entitlement payments. Thus, it wastes healthcare financial resources and increases healthcare costs. Subsequently, fraud poses a substantial financial challenge. Therefore, supervised machine and deep learning analytics such as random forest, logistic regression, and artificial neural networks are successfully used to detect healthcare insurance fraud. This study aims to develop a health model that automatically detects fraud from health insurance claims in Saudi Arabia. The model indicates the greatest contributing factor to fraud with optimal accuracy. The labeled imbalanced dataset used three supervised deep and machine learning methods. The dataset was obtained from three healthcare providers in Saudi Arabia. The applied models were random forest, logistic regression, and artificial neural networks. The SMOT technique was used to balance the dataset. Boruta object feature selection was applied to exclude insignificant features. Validation metrics were accuracy, precision, recall, specificity, F1 score, and area under the curve (AUC). Random forest classifiers indicated policy type, education, and age as the most significant features with an accuracy of 98.21%, 98.08% precision, 100% recall, an F1 score of 99.03%, specificity of 80%, and an AUC of 90.00%. Logistic regression resulted in an accuracy of 80.36%, 97.62% precision, 80.39% recall, an F1 score of 88.17%, specificity of 80%, and an AUC of 80.20%. ANN revealed an accuracy of 94.64%, 98.00% precision, 96.08% recall, an F1 score of 97.03%, a specificity of 80%, and an AUC of 88.04%. This predictive analytics study applied three successful models, each of which yielded acceptable accuracy and validation metrics; however, further research on a larger dataset is advised.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Correction to: Fraud Detection Using Machine Learning and Deep Learning
    Akash Gandhar
    Kapil Gupta
    Aman Kumar Pandey
    Dharm Raj
    SN Computer Science, 5 (7)
  • [22] How machine learning is transforming the insurance sector: case of fraud detection in Morocco
    Hamdoun, Nabila
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2021, 6 (04) : 273 - 282
  • [23] Performance comparative study of machine learning algorithms for automobile insurance fraud detection
    Itri, Bouzgarne
    Mohamed, Youssfi
    Mohammed, Qbadou
    Omar, Bouattane
    2019 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS 2019), 2019,
  • [24] CODM: an outlier detection method for medical insurance claims fraud
    Gao, Yongchang
    Guan, Haowen
    Gong, Bin
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 20 (03) : 404 - 411
  • [25] Medicare Fraud Detection using Machine Learning Methods
    Bauder, Richard A.
    Khoshgoftaar, Taghi M.
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 858 - 865
  • [26] Detection and Prevention of Medical Fraud using Machine Learning
    Unal, Ceyda
    Erbuga, Gokce Sinem
    ACTA INFOLOGICA, 2024, 8 (02): : 100 - 117
  • [27] Claims auditing in automobile insurance: Fraud detection and deterrence objectives
    Tennyson, S
    Salsas-Forn, P
    JOURNAL OF RISK AND INSURANCE, 2002, 69 (03) : 289 - 308
  • [28] Fraud Detection and Frequent Pattern Matching in Insurance claims using Data Mining Techniques
    Verma, Aayushi
    Taneja, Anu
    Arora, Anuja
    2017 TENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2017, : 84 - 90
  • [29] Credit Card Fraud Detection Using Machine Learning
    Sailusha, Ruttala
    Gnaneswar, V
    Ramesh, R.
    Rao, G. Ramakoteswara
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 1264 - 1270
  • [30] Assessment of Healthcare Claims Rejection Risk Using Machine Learning
    Chimmad, Anundhara
    Saripalli, Prasad
    Tirumala, Venu
    2017 IEEE 19TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM), 2017,