Explainable unsupervised anomaly detection for healthcare insurance data

被引:0
|
作者
De Meulemeester, Hannes [1 ]
De Smet, Frank [2 ,3 ]
van Dorst, Johan [2 ]
Derroitte, Elise [2 ]
De Moor, Bart [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn, ESAT STADIUS, Kasteelpk Arenberg 10, B-3001 Leuven, Belgium
[2] Christian Hlth Insurance Fund, B-1031 Brussels, Belgium
[3] Katholieke Univ Leuven, Dept Publ Hlth & Primary Care Environm & Hlth, B-3000 Leuven, Belgium
关键词
Health insurance; Anomaly detection; Unsupervised machine learning; OUTLIER DETECTION;
D O I
10.1186/s12911-024-02823-6
中图分类号
R-058 [];
学科分类号
摘要
BackgroundWaste and fraud are important problems for health insurers to deal with. With the advent of big data, these insurers are looking more and more towards data mining and machine learning methods to help in detecting waste and fraud. However, labeled data is costly and difficult to acquire as it requires expert investigators and known care providers with atypical behavior.MethodsIn this work we show how recent advances in machine learning can be used to set up a workflow that can aid investigators in discovering practitioners or groups of practitioners with unusual resource use in order to more efficiently combat waste and fraud. We combine three different techniques, which have not been used in the context of healthcare insurance anomaly detection: categorical embeddings to deal with high-cardinality categorical variables, state-of-the-art unsupervised anomaly detection techniques to detect anomalies and Shapley additive explanations (SHAP) to explain the model output.ResultsThe method has been evaluated on providers with a known anomalous profile and with the help of experts of the largest health insurance fund in Belgium. The quantitative experiments show that categorical embeddings offer a significant improvement compared to standard methods and that the state-of-the-art unsupervised anomaly detection techniques generally show an improvement over traditional methods. In a practical setting, the proposed workflow with SHAP was able to detect a previously unknown, anomalous trend among general practitioners.ConclusionsThe proposed workflow is able to detect known care providers with atypical behaviour and helps expert investigators in making informed decisions concerning possible fraud or overconsumption in the health insurance field.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] An explainable unsupervised anomaly detection framework for Industrial Internet of Things
    Abudurexiti, Yilixiati
    Han, Guangjie
    Zhang, Fan
    Liu, Li
    COMPUTERS & SECURITY, 2025, 148
  • [2] Learning Representations from Healthcare Time Series Data for Unsupervised Anomaly Detection
    Pereira, Joao
    Silveira, Margarida
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 84 - 90
  • [3] Unsupervised Anomaly Detection in Transactional Data
    Bouguessa, Mohamed
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 526 - 531
  • [4] Explainable Unsupervised Multi-Sensor Industrial Anomaly Detection and Categorization
    Ameli, Mina
    Becker, Philipp Aaron
    Lankers, Katharina
    van Ackeren, Markus
    Baehring, Holger
    Maass, Wolfgang
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1468 - 1475
  • [5] Tree-based Kendall's τ Maximization for Explainable Unsupervised Anomaly Detection
    Kong, Lanfang
    Huet, Alexis
    Rossi, Dario
    Sozio, Mauro
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1073 - 1078
  • [6] Unsupervised Anomaly Detection in Sequential Process Data
    Bulut, Okan
    Gorgun, Guher
    He, Surina
    ZEITSCHRIFT FUR PSYCHOLOGIE-JOURNAL OF PSYCHOLOGY, 2024, 232 (02): : 74 - 94
  • [7] Unsupervised Anomaly Detection in Data Quality Control
    Poon, Lex
    Farshidi, Siamak
    Li, Na
    Zhao, Zhiming
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 2327 - 2336
  • [8] Unsupervised Anomaly Detection on Temporal Multiway Data
    Duc Nguyen
    Phuoc Nguyen
    Kien Do
    Rana, Santu
    Gupta, Sunil
    Truyen Tran
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1059 - 1066
  • [9] SoftPatch: Unsupervised Anomaly Detection with Noisy Data
    Jiang, Xi
    Liu, Jianlin
    Wang, Jinbao
    Nie, Qian
    Wu, Kai
    Liu, Yong
    Wang, Chengjie
    Zheng, Feng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] Explainable Action-Recognition Based Approach for Unsupervised Video Anomaly Detection
    Yang, Hui
    Khouadjia, Mostepha
    Seghouani, Nacera
    Ma, Yue
    Delmas, Serge
    ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT I, 2025, 15046 : 182 - 195