Healthcare Cost Prediction for Heterogeneous Patient Profiles Using Deep Learning Models with Administrative Claims Data

被引:0
|
作者
Morid, Mohammad Amin [1 ]
Sheng, Olivia R. Liu [2 ]
机构
[1] Santa Clara Univ, Leavey Sch Business, Dept Informat Syst & Analyt, Santa Clara, CA 95053 USA
[2] Arizona State Univ, W P Carey Sch Business, Dept Informat Syst, Tempe, AZ 85281 USA
关键词
high-need patients; heterogeneity; cost prediction; risk adjustment model; representation learning; channel-wise deep learning; RISK-ADJUSTMENT; RHEUMATOID-ARTHRITIS; IDENTIFYING PATIENTS; ANALYTICS; SELECTION; CAPITATION; RECORDS;
D O I
10.1287/isre.2021.0643
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Accurate and fair patient cost predictions, which can lead to healthcare payer cost savings, are essential to support effective decision making regarding health management policies and resource allocations. Patient cost prediction models utilize administrative claims (AC) data collected from multiple healthcare providers, which payers (e.g., government agencies and private insurance companies) rely on for various reimbursement purposes. Both the variety of patient clinical profiles and the multisource nature of the big data from ACs introduce heterogeneity, which undermines both the generalization power and the algorithmic fairness of cost prediction models. In particular, the prediction performance and economic outcomes-such as both underpayments and overpayments-of these models for high-need (HN) patients with multiple and complex chronic conditions differ from those of healthy patients, as their underlying heterogeneous medical profiles are distinct. This study, grounded in sociotechnical considerations for patient cost prediction, presents two key design insights. First, we designed a channel-wise deep learning framework to reduce AC data heterogeneity through effective representation learning, with a separate channel each type of code as well as each type of cost. Second, we incorporated humanistic outcomes and a multichannel entropy measurement into a flexible evaluation design for patient heterogeneity. We evaluate the effectiveness of the proposed channel-wise framework both internally and externally using two real-world data sets containing approximately 111,000 and 134,000 individuals, respectively. On average, channel-wise models substantially reduce prediction errors by 23% compared with the most competitive single-channel counterparts, leading to respective reductions of 16.4% and 19.3% in overpayments and underpayments for patients. The reduction in bias for predictions involving HN patients is more significant than for other patient groups. Our findings offer important implications for decision makers in healthcare and other fields facing similar sociotechnical challenges related to the interplay between diverse population behaviors and data heterogeneity.
引用
收藏
页数:26
相关论文
共 32 条
  • [21] Prediction of liquid ammonia yield using a novel deep learning-based heterogeneous pruning ensemble model
    Dai, Min
    Yang, Fusheng
    Zhang, Zaoxiao
    Liu, Guilian
    Feng, Xiao
    Hou, Jianmin
    ASIA-PACIFIC JOURNAL OF CHEMICAL ENGINEERING, 2020, 15 (02)
  • [22] Prediction of Cocaine Inpatient Treatment Success Using Machine Learning on High-Dimensional Heterogeneous Data
    Tapia-Galisteo, Jose
    Iniesta, Jose M.
    Perez-Gandia, Carmen
    Garcia-Saez, Gema
    Puertolas, Diego Urgeles
    Izquierdo, Francisco J.
    Hernando, M. Elena
    IEEE ACCESS, 2020, 8 : 218936 - 218953
  • [23] Patient?s data privacy protection in medical healthcare transmission services using back propagation learning
    Altameem, Ahmed
    Kovtun, Viacheslav
    Al-Ma'aitah, Mohammed
    Altameem, Torki
    Fouad, H.
    Youssef, Ahmed E.
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
  • [24] Predicting academic performance of students from VLE big data using deep learning models
    Waheed, Hajra
    Hassan, Saeed-Ul
    Aljohani, Naif Radi
    Hardman, Julie
    Alelyani, Salem
    Nawaz, Raheel
    COMPUTERS IN HUMAN BEHAVIOR, 2020, 104
  • [25] Data Driven Natural Gas Spot Price Prediction Models Using Machine Learning Methods
    Su, Moting
    Zhang, Zongyi
    Zhu, Ye
    Zha, Donglan
    Wen, Wenying
    ENERGIES, 2019, 12 (09)
  • [26] Brain Age Prediction: A Comparison between Machine Learning Models Using Brain Morphometric Data
    Han, Juhyuk
    Kim, Seo Yeong
    Lee, Junhyeok
    Lee, Won Hee
    SENSORS, 2022, 22 (20)
  • [27] An interpretable data-driven approach for customer purchase prediction using cost-sensitive learning
    Xiao, Fei
    Chen, Shui-xia
    Chen, Zi-yu
    Wang, Ya-nan
    Wang, Jian-qiang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [28] Assessment of Various Machine Learning Models for Peach Maturity Prediction Using Non-Destructive Sensor Data
    Ljubobratovic, Dejan
    Vukovic, Marko
    Bakaric, Marija Brkic
    Jemric, Tomislav
    Matetic, Maja
    SENSORS, 2022, 22 (15)
  • [29] The importance of health insurance claims data in creating learning health systems: evaluating care for high-need high-cost patients using the National Patient-Centered Clinical Research Network (PCORNet)
    Smith, Maureen A.
    Vaughan-Sarrazin, Mary S.
    Yu, Menggang
    Wang, Xinyi
    Nordby, Peter A.
    Vogeli, Christine
    Jaffery, Jonathan
    Metlay, Joshua P.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (11) : 1305 - 1313
  • [30] Studying the association of diabetes and healthcare cost on distributed data from the Maastricht Study and Statistics Netherlands using a privacy-preserving federated learning infrastructure
    Sun, Chang
    van Soest, Johan
    Koster, Annemarie
    Eussen, Simone J. P. M.
    Schram, Miranda T.
    Stehouwer, Coen D. A.
    Dagnelie, Pieter C.
    Dumontier, Michel
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 134