Healthcare Cost Prediction for Heterogeneous Patient Profiles Using Deep Learning Models with Administrative Claims Data

被引:0
|
作者
Morid, Mohammad Amin [1 ]
Sheng, Olivia R. Liu [2 ]
机构
[1] Santa Clara Univ, Leavey Sch Business, Dept Informat Syst & Analyt, Santa Clara, CA 95053 USA
[2] Arizona State Univ, W P Carey Sch Business, Dept Informat Syst, Tempe, AZ 85281 USA
关键词
high-need patients; heterogeneity; cost prediction; risk adjustment model; representation learning; channel-wise deep learning; RISK-ADJUSTMENT; RHEUMATOID-ARTHRITIS; IDENTIFYING PATIENTS; ANALYTICS; SELECTION; CAPITATION; RECORDS;
D O I
10.1287/isre.2021.0643
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Accurate and fair patient cost predictions, which can lead to healthcare payer cost savings, are essential to support effective decision making regarding health management policies and resource allocations. Patient cost prediction models utilize administrative claims (AC) data collected from multiple healthcare providers, which payers (e.g., government agencies and private insurance companies) rely on for various reimbursement purposes. Both the variety of patient clinical profiles and the multisource nature of the big data from ACs introduce heterogeneity, which undermines both the generalization power and the algorithmic fairness of cost prediction models. In particular, the prediction performance and economic outcomes-such as both underpayments and overpayments-of these models for high-need (HN) patients with multiple and complex chronic conditions differ from those of healthy patients, as their underlying heterogeneous medical profiles are distinct. This study, grounded in sociotechnical considerations for patient cost prediction, presents two key design insights. First, we designed a channel-wise deep learning framework to reduce AC data heterogeneity through effective representation learning, with a separate channel each type of code as well as each type of cost. Second, we incorporated humanistic outcomes and a multichannel entropy measurement into a flexible evaluation design for patient heterogeneity. We evaluate the effectiveness of the proposed channel-wise framework both internally and externally using two real-world data sets containing approximately 111,000 and 134,000 individuals, respectively. On average, channel-wise models substantially reduce prediction errors by 23% compared with the most competitive single-channel counterparts, leading to respective reductions of 16.4% and 19.3% in overpayments and underpayments for patients. The reduction in bias for predictions involving HN patients is more significant than for other patient groups. Our findings offer important implications for decision makers in healthcare and other fields facing similar sociotechnical challenges related to the interplay between diverse population behaviors and data heterogeneity.
引用
收藏
页数:26
相关论文
共 32 条
  • [31] Prediction models incorporating second metacarpal cortical index for osteoporosis in rheumatoid arthritis: Externally validated machine learning models developed using data from the KURAMA cohort
    Saito, Ryohei
    Fujii, Takayuki
    Murata, Koichi
    Onishi, Akira
    Murakami, Kosaku
    Tanaka, Masao
    Ohmura, Koichiro
    Yasuda, Tadashi
    Morinobu, Akio
    Matsuda, Shuichi
    INTERNATIONAL JOURNAL OF RHEUMATIC DISEASES, 2024, 27 (10)
  • [32] Data-driven prediction of prolonged air leak after video-assisted thoracoscopic surgery for lung cancer: Development and validation of machine-learning-based models using real-world data through the ePath system
    Tou, Saori
    Matsumoto, Koutarou
    Hashinokuchi, Asato
    Kinoshita, Fumihiko
    Nakaguma, Hideki
    Kozuma, Yukio
    Sugeta, Rui
    Nohara, Yasunobu
    Yamashita, Takanori
    Wakata, Yoshifumi
    Takenaka, Tomoyoshi
    Iwatani, Kazunori
    Soejima, Hidehisa
    Yoshizumi, Tomoharu
    Nakashima, Naoki
    Kamouchi, Masahiro
    LEARNING HEALTH SYSTEMS, 2024,