Medicare Fraud Detection Using Graph Analysis: A Comparative Study of Machine Learning and Graph Neural Networks

被引:9
作者
Yoo, Yeeun [1 ]
Shin, Jinho [1 ]
Kyeong, Sunghyon [2 ]
机构
[1] KakaoBank, Div Res & Dev, Seongnam Si 13529, South Korea
[2] KakaoBank, Div Data Intelligence, Seongnam Si 13529, South Korea
关键词
Graph neural network; graph centrality measure; machine learning; medicare fraud detection; CLASSIFICATION; MODEL;
D O I
10.1109/ACCESS.2023.3305962
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Insurance companies have focused on medicare fraud detection to reduce financial losses and reputational harm because medicare fraud causes tens of billions of dollars in damage annually. This study demonstrates that medicare fraud detection can be significantly enhanced by introducing graph analysis with considering the relationships among medical providers, beneficiaries, and physicians. We use open-source tabular datasets containing beneficiary information, inpatient claims, outpatient claims, and indications about potential fraudulent providers. We then aggregated them into a single dataset by converting them into a graph structure. Furthermore, we developed medicare fraud detection models using two approaches to reflect graph information, i.e., graph neural network (GNN) models and traditional machine learning models using graph centrality measures. Therefore, the machine learning model with graph centrality features showed improved precision of 4 percent point (%p), recall of 24 %p, and F1-score of 14 %p compared to the best GNN model. The improvement in recall to this extent could result in substantial cost savings of 3.1 billion euros and 5 billion dollars in the United States and Europe, respectively, benefiting governmental institutions and insurance companies involved in healthcare insurance operations. Furthermore, the required learning time of the best GNN model was approximately 250-300 times more than that of the best machine-learning model. This outcome suggests that successful and efficient detection of medicare fraud can be achieved if graph centrality measures are used to capture the relationships among medical providers, physicians, and beneficiaries.
引用
收藏
页码:88278 / 88294
页数:17
相关论文
共 84 条
[71]  
Weber M, 2019, Arxiv, DOI arXiv:1908.02591
[72]   Graph Neural Networks in Recommender Systems: A Survey [J].
Wu, Shiwen ;
Sun, Fei ;
Zhang, Wentao ;
Xie, Xu ;
Cui, Bin .
ACM COMPUTING SURVEYS, 2023, 55 (05)
[73]   GFCNet: Utilizing graph feature collection networks for coronavirus knowledge graph embeddings [J].
Xie, Zhiwen ;
Zhu, Runjie ;
Liu, Jin ;
Zhou, Guangyou ;
Huang, Jimmy Xiangji ;
Cui, Xiaohui .
INFORMATION SCIENCES, 2022, 608 :1557-1571
[74]   PoxNet22: A Fine-Tuned Model for the Classification of Monkeypox Disease Using Transfer Learning [J].
Yasmin, Farhana ;
Hassan, Md. Mehedi ;
Hasan, Mahade ;
Zaman, Sadika ;
Kaushal, Chetna ;
El-Shafai, Walid ;
Soliman, Naglaa F. .
IEEE ACCESS, 2023, 11 :24053-24076
[75]   Big data analytics for default prediction using graph theory [J].
Yildirim, Mustafa ;
Okay, Feyza Yildirim ;
Ozdemir, Suat .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176
[76]  
Yoo Y., 2022, ICECET, V12, P1, DOI [10.1109/ICECET55527.2022.9872963, DOI 10.1109/ICECET55527.2022.9872963]
[77]   SybilGuard: Defending against sybil attacks via social networks [J].
Yu, Haifeng ;
Kaminsky, Michael ;
Gibbons, Phillip B. ;
Flaxman, Abraham D. .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2008, 16 (03) :576-589
[78]   SybilLimit: A near-optimal social network defense against sybil attacks [J].
Yu, Haifeng ;
Kaminsky, Michael ;
Gibbons, Phillip B. ;
Xiao, Feng .
PROCEEDINGS OF THE 2008 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, 2008, :3-+
[79]   FRAUDRE: Fraud Detection Dual-Resistant to Graph Inconsistency and Imbalance [J].
Zhang, Ge ;
Wu, Jia ;
Yang, Jian ;
Beheshti, Amin ;
Xue, Shan ;
Zhou, Chuan ;
Sheng, Quan Z. .
2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, :867-876
[80]  
Zhao JA, 2021, AAAI CONF ARTIF INTE, V35, P4697