Predicting Medical Event Occurrence Using Medical Insurance Claims Big Data

被引:0
|
作者
Yoshimoto, Hiromasa [1 ]
Mitsutake, Naohiro [2 ]
Goda, Kazuo [1 ]
机构
[1] Univ Tokyo, Inst Ind Sci, Tokyo, Japan
[2] Inst Hlth Econ & Policy, Tokyo, Japan
来源
MEDINFO 2023 - THE FUTURE IS ACCESSIBLE | 2024年 / 310卷
关键词
Insurance claims; perdition model; machine learning; sparse data; low-frequency event;
D O I
10.3233/SHTI231046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical events are often infrequent, thus becomes hard to predict. In this paper, we focus on predictor that forecasts whether a medical event would occur in the next year, and analyzes the impact of event's frequency and data size via predictor's performance. In the experiment, we made 1572 predictors for medical events using Medical Insurance Claims (MICs) data from 800,000 participants and 205.8 mil- lion claims over 8 years. The result revealed that (a) forecasting error will be increased when predicting low-frequency events, and (b) increasing the number of training dataset reduces errors. This result suggests that increasing data size is a key to solve low frequency problems. However, we still need additional methods to cope with sparse and imbalanced data.
引用
收藏
页码:654 / 658
页数:5
相关论文
共 50 条
  • [1] Challenges of using medical insurance claims data for utilization analysis
    Tyree, Patrick T.
    Lind, Bonnie K.
    Lafferty, William E.
    AMERICAN JOURNAL OF MEDICAL QUALITY, 2006, 21 (04) : 269 - 275
  • [2] Predicting Medical Provider Specialties to Detect Anomalous Insurance Claims
    Bauder, Richard A.
    Khoshgoftaar, Taghi M.
    Richter, Aaron
    Herland, Matthew
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 784 - 790
  • [3] Health insurance and the demand for medical care: Instrumental variable estimates using health insurer claims data
    Dunn, Abe
    JOURNAL OF HEALTH ECONOMICS, 2016, 48 : 74 - 88
  • [4] Predicting Flood Property Insurance Claims over CONUS, Fusing Big Earth Observation Data
    Yang, Qing
    Shen, Xinyi
    Yang, Feifei
    Anagnostou, Emmanouil N.
    He, Kang
    Mo, Chongxun
    Seyyedi, Hojjat
    Kettner, Albert J.
    Zhang, Qingyuan
    BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 2022, 103 (03) : E791 - E809
  • [5] Estimating episodes of care using linked medical claims data
    Williams, G
    Baxter, R
    Kelman, C
    Rainsford, C
    He, HX
    Gu, LF
    Vickers, D
    Hawkins, S
    AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2002, 2557 : 660 - 671
  • [6] Predicting Motor Insurance Claims Using Telematics Data-XGBoost versus Logistic Regression
    Pesantez-Narvaez, Jessica
    Guillen, Montserrat
    Alcaniz, Manuela
    RISKS, 2019, 7 (02)
  • [7] Validation of medical service insurance claims as a surrogate for ascertaining vitiligo cases
    Bell, M.
    Lui, H.
    Lee, T. K.
    Kalia, S.
    ARCHIVES OF DERMATOLOGICAL RESEARCH, 2023, 315 (03) : 541 - 550
  • [8] Validation of medical service insurance claims as a surrogate for ascertaining vitiligo cases
    M. Bell
    H. Lui
    T. K. Lee
    S. Kalia
    Archives of Dermatological Research, 2023, 315 : 541 - 550
  • [9] Medical Provider Specialty Predictions for the Detection of Anomalous Medicare Insurance Claims
    Herland, Matthew
    Bauder, Richard A.
    Khoshgoftaar, Taghi M.
    2017 IEEE 18TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI 2017), 2017, : 579 - 588
  • [10] Predicting risk of suicidal behavior from insurance claims data vs. linked data from insurance claims and electronic health records
    Simon, Gregory E.
    Shortreed, Susan M.
    Johnson, Eric
    Yaseen, Zimri S.
    Stone, Marc
    Mosholder, Andrew D.
    Ahmedani, Brian K.
    Coleman, Karen J.
    Coley, R. Yates
    Penfold, Robert B.
    Toh, Sengwee
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 (01)