Reinforced Risk Prediction With Budget Constraint Using Irregularly Measured Data From Electronic Health Records

被引:1
作者
Pan, Yinghao [1 ]
Laber, Eric B. [2 ]
Smith, Maureen A. [3 ]
Zhao, Ying-Qi [4 ]
机构
[1] Univ N Carolina, Dept Math & Stat, Charlotte, NC USA
[2] North Carolina State Univ, Dept Stat, Raleigh, NC USA
[3] Univ Wisconsin, Dept Populat Hlth Sci, Madison, WI USA
[4] Fred Hutchinson Canc Res Ctr, Publ Hlth Sci Div, Seattle, WA 98109 USA
基金
美国国家卫生研究院;
关键词
Classification with reject option; Cost-effective; Electronic health records; Functional principal component analysis; Reinforcement learning; FUNCTIONAL DATA-ANALYSIS; CLASSIFICATION; PERFORMANCE; DISEASE; SPARSE;
D O I
10.1080/01621459.2021.1978467
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Uncontrolled glycated hemoglobin (HbA1c) levels are associated with adverse events among complex diabetic patients. These adverse events present serious health risks to affected patients and are associated with significant financial costs. Thus, a high-quality predictive model that could identify high-risk patients so as to inform preventative treatment has the potential to improve patient outcomes while reducing healthcare costs. Because the biomarker information needed to predict risk is costly and burdensome, it is desirable that such a model collect only as much information as is needed on each patient so as to render an accurate prediction. We propose a sequential predictive model that uses accumulating patient longitudinal data to classify patients as: high-risk, low-risk, or uncertain. Patients classified as high-risk are then recommended to receive preventative treatment and those classified as low-risk are recommended to standard care. Patients classified as uncertain are monitored until a high-risk or low-risk determination is made. We construct the model using claims and enrollment files from Medicare, linked with patient electronic health records (EHR) data. The proposed model uses functional principal components to accommodate noisy longitudinal data and weighting to deal with missingness and sampling bias. The proposed method demonstrates higher predictive accuracy and lower cost than competing methods in a series of simulation experiments and application to data on complex patients with diabetes. Supplementary materials for this article are available online.
引用
收藏
页码:1090 / 1101
页数:12
相关论文
共 35 条
  • [2] [Anonymous], 2017, NAT DIAB STAT REP
  • [3] Bartlett PL, 2008, J MACH LEARN RES, V9, P1823
  • [4] Bather J., 2000, DECISION THEORY INTR, V180
  • [5] Incorporating Patient Preferences into Estimation of Optimal Individualized Treatment Rules
    Butler, Emily L.
    Laber, Eric B.
    Davis, Sonia M.
    Kosorok, Michael R.
    [J]. BIOMETRICS, 2018, 74 (01) : 18 - 26
  • [6] Prediction model for high glycated hemoglobin concentration among ethnic Chinese in Taiwan
    Chien, Kuo-Liong
    Lin, Hung-Ju
    Lee, Bai-Chin
    Hsu, Hsiu-Ching
    Chen, Ming-Fong
    [J]. CARDIOVASCULAR DIABETOLOGY, 2010, 9
  • [7] CHOW CK, 1970, IEEE T INFORM THEORY, V16, P41, DOI 10.1109/TIT.1970.1054406
  • [8] Measuring performance in primary care: What patient outcome indicators do physicians value?
    Dassow, Paul L.
    [J]. JOURNAL OF THE AMERICAN BOARD OF FAMILY MEDICINE, 2007, 20 (01) : 1 - 8
  • [9] Can Claims Data Algorithms Identify the Physician of Record?
    DuGoff, Eva H.
    Walden, Emily
    Ronk, Katie
    Palta, Mari
    Smith, Maureen
    [J]. MEDICAL CARE, 2018, 56 (03) : e16 - e20
  • [10] Electronic Health Records and Community Health Surveillance of Childhood Obesity
    Flood, Tracy L.
    Zhao, Ying-Qi
    Tomayko, Emily J.
    Tandias, Aman
    Carrel, Aaron L.
    Hanrahan, Lawrence P.
    [J]. AMERICAN JOURNAL OF PREVENTIVE MEDICINE, 2015, 48 (02) : 234 - 240