Healthcare Cost Prediction for Heterogeneous Patient Profiles Using Deep Learning Models with Administrative Claims Data

被引:0
|
作者
Morid, Mohammad Amin [1 ]
Sheng, Olivia R. Liu [2 ]
机构
[1] Santa Clara Univ, Leavey Sch Business, Dept Informat Syst & Analyt, Santa Clara, CA 95053 USA
[2] Arizona State Univ, W P Carey Sch Business, Dept Informat Syst, Tempe, AZ 85281 USA
关键词
high-need patients; heterogeneity; cost prediction; risk adjustment model; representation learning; channel-wise deep learning; RISK-ADJUSTMENT; RHEUMATOID-ARTHRITIS; IDENTIFYING PATIENTS; ANALYTICS; SELECTION; CAPITATION; RECORDS;
D O I
10.1287/isre.2021.0643
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Accurate and fair patient cost predictions, which can lead to healthcare payer cost savings, are essential to support effective decision making regarding health management policies and resource allocations. Patient cost prediction models utilize administrative claims (AC) data collected from multiple healthcare providers, which payers (e.g., government agencies and private insurance companies) rely on for various reimbursement purposes. Both the variety of patient clinical profiles and the multisource nature of the big data from ACs introduce heterogeneity, which undermines both the generalization power and the algorithmic fairness of cost prediction models. In particular, the prediction performance and economic outcomes-such as both underpayments and overpayments-of these models for high-need (HN) patients with multiple and complex chronic conditions differ from those of healthy patients, as their underlying heterogeneous medical profiles are distinct. This study, grounded in sociotechnical considerations for patient cost prediction, presents two key design insights. First, we designed a channel-wise deep learning framework to reduce AC data heterogeneity through effective representation learning, with a separate channel each type of code as well as each type of cost. Second, we incorporated humanistic outcomes and a multichannel entropy measurement into a flexible evaluation design for patient heterogeneity. We evaluate the effectiveness of the proposed channel-wise framework both internally and externally using two real-world data sets containing approximately 111,000 and 134,000 individuals, respectively. On average, channel-wise models substantially reduce prediction errors by 23% compared with the most competitive single-channel counterparts, leading to respective reductions of 16.4% and 19.3% in overpayments and underpayments for patients. The reduction in bias for predictions involving HN patients is more significant than for other patient groups. Our findings offer important implications for decision makers in healthcare and other fields facing similar sociotechnical challenges related to the interplay between diverse population behaviors and data heterogeneity.
引用
收藏
页数:26
相关论文
共 32 条
  • [1] Development of Deep Learning Models for Predicting In-Hospital Mortality Using an Administrative Claims Database: Retrospective Cohort Study
    Matsui, Hiroki
    Yamana, Hayato
    Fushimi, Kiyohide
    Yasunaga, Hideo
    JMIR MEDICAL INFORMATICS, 2022, 10 (02)
  • [2] Time Series Prediction Using Deep Learning Methods in Healthcare
    Morid, Mohammad Amin
    Sheng, Olivia R. Liu
    Dunbar, Joseph
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2023, 14 (01)
  • [3] HTTPS: Heterogeneous Transfer learning for spliT Prediction System evaluated on healthcare data☆
    Syu, Jia-Hao
    Fojcik, Marcin
    Cupek, Rafal
    Lin, Jerry Chun-Wei
    INFORMATION FUSION, 2025, 113
  • [4] Learning hidden patterns from patient multivariate time series data using convolutional neural networks: A case study of healthcare cost prediction
    Morid, Mohammad Amin
    Sheng, Olivia R. Liu
    Kawamoto, Kensaku
    Abdelrahman, Samir
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 111
  • [5] Comparison of statistical and machine learning models for healthcare cost data: a simulation study motivated by Oncology Care Model (OCM) data
    Mazumdar, Madhu
    Lin, Jung-Yi Joyce
    Zhang, Wei
    Li, Lihua
    Liu, Mark
    Dharmarajan, Kavita
    Sanderson, Mark
    Isola, Luis
    Hu, Liangyuan
    BMC HEALTH SERVICES RESEARCH, 2020, 20 (01)
  • [6] Extending business failure prediction models with textual website content using deep learning
    Borchert, Philipp
    Coussement, Kristof
    De Caigny, Arno
    De Weerdt, Jochen
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 306 (01) : 348 - 357
  • [7] New Hybrid Deep Learning Models to Predict Cost From Healthcare Providers in Smart Hospitals
    Bhatti, Muhammad Hamza Rafique
    Javaid, Nadeem
    Mansoor, Babar
    Alrajeh, Nabil
    Aslam, Muhammad
    Asad, Muhammad
    IEEE ACCESS, 2023, 11 : 136988 - 137010
  • [8] Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity
    Brian C. Sauer
    Chia-Chen Teng
    Neil A. Accortt
    Zachary Burningham
    David Collier
    Mona Trivedi
    Grant W. Cannon
    Arthritis Research & Therapy, 19
  • [9] Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity
    Sauer, Brian C.
    Teng, Chia-Chen
    Accortt, Neil A.
    Burningham, Zachary
    Collier, David
    Trivedi, Mona
    Cannon, Grant W.
    ARTHRITIS RESEARCH & THERAPY, 2017, 19
  • [10] Cost prediction for water reuse equipment using interpretable machine learning models
    Chen, Kan
    Zhang, Yuezheng
    Hu, Naixin
    Ye, Chao
    Ma, Ji
    Zheng, Tong
    JOURNAL OF WATER PROCESS ENGINEERING, 2024, 63