Medicare Fraud Detection Using Graph Analysis: A Comparative Study of Machine Learning and Graph Neural Networks

被引:9
作者
Yoo, Yeeun [1 ]
Shin, Jinho [1 ]
Kyeong, Sunghyon [2 ]
机构
[1] KakaoBank, Div Res & Dev, Seongnam Si 13529, South Korea
[2] KakaoBank, Div Data Intelligence, Seongnam Si 13529, South Korea
关键词
Graph neural network; graph centrality measure; machine learning; medicare fraud detection; CLASSIFICATION; MODEL;
D O I
10.1109/ACCESS.2023.3305962
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Insurance companies have focused on medicare fraud detection to reduce financial losses and reputational harm because medicare fraud causes tens of billions of dollars in damage annually. This study demonstrates that medicare fraud detection can be significantly enhanced by introducing graph analysis with considering the relationships among medical providers, beneficiaries, and physicians. We use open-source tabular datasets containing beneficiary information, inpatient claims, outpatient claims, and indications about potential fraudulent providers. We then aggregated them into a single dataset by converting them into a graph structure. Furthermore, we developed medicare fraud detection models using two approaches to reflect graph information, i.e., graph neural network (GNN) models and traditional machine learning models using graph centrality measures. Therefore, the machine learning model with graph centrality features showed improved precision of 4 percent point (%p), recall of 24 %p, and F1-score of 14 %p compared to the best GNN model. The improvement in recall to this extent could result in substantial cost savings of 3.1 billion euros and 5 billion dollars in the United States and Europe, respectively, benefiting governmental institutions and insurance companies involved in healthcare insurance operations. Furthermore, the required learning time of the best GNN model was approximately 250-300 times more than that of the best machine-learning model. This outcome suggests that successful and efficient detection of medicare fraud can be achieved if graph centrality measures are used to capture the relationships among medical providers, physicians, and beneficiaries.
引用
收藏
页码:88278 / 88294
页数:17
相关论文
共 84 条
[1]  
[Anonymous], 2011, P IEEE 11 INT C DAT, DOI DOI 10.1109/ICDM.2011.124
[2]  
[Anonymous], 2018, Networks: An Introduction, DOI DOI 10.1093/ACPROF:OSO/9780199206650.001.0001
[3]  
[Anonymous], 2009, NDSS
[4]   A prescription fraud detection model [J].
Aral, Karca Duru ;
Guvenir, Halil Altay ;
Sabuncuoglu, Ihsan ;
Akar, Ahmet Ruchan .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2012, 106 (01) :37-46
[5]   Integro: Leveraging victim prediction for robust fake account detection in large scale OSNs [J].
Boshmaf, Yazan ;
Logothetis, Dionysios ;
Siganos, Georgos ;
Leria, Jorge ;
Lorenzo, Jose ;
Ripeanu, Matei ;
Beznosou, Konstantin ;
Halawa, Hassan .
COMPUTERS & SECURITY, 2016, 61 :142-168
[6]   Optimization Methods for Large-Scale Machine Learning [J].
Bottou, Leon ;
Curtis, Frank E. ;
Nocedal, Jorge .
SIAM REVIEW, 2018, 60 (02) :223-311
[7]  
Branting LK, 2016, PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, P845, DOI 10.1109/ASONAM.2016.7752336
[8]   Friend or Faux: Graph-Based Early Detection of Fake Accounts on Social Networks [J].
Breuer, Adam ;
Eilat, Roee ;
Weinsberg, Udi .
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, :1287-1297
[9]  
BUCKLAND M, 1994, J AM SOC INFORM SCI, V45, P12, DOI 10.1002/(SICI)1097-4571(199401)45:1<12::AID-ASI2>3.0.CO
[10]  
2-L