Healthcare Cost Prediction for Heterogeneous Patient Profiles Using Deep Learning Models with Administrative Claims Data

被引：0

作者：

Morid, Mohammad Amin ^{[1
]}

Sheng, Olivia R. Liu ^{[2
]}

机构：

[1] Santa Clara Univ, Leavey Sch Business, Dept Informat Syst & Analyt, Santa Clara, CA 95053 USA

[2] Arizona State Univ, W P Carey Sch Business, Dept Informat Syst, Tempe, AZ 85281 USA

来源：

INFORMATION SYSTEMS RESEARCH | 2025年

关键词：

high-need patients; heterogeneity; cost prediction; risk adjustment model; representation learning; channel-wise deep learning; RISK-ADJUSTMENT; RHEUMATOID-ARTHRITIS; IDENTIFYING PATIENTS; ANALYTICS; SELECTION; CAPITATION; RECORDS;

D O I：

10.1287/isre.2021.0643

中图分类号：

G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];

学科分类号：

1205 ; 120501 ;

摘要：

Accurate and fair patient cost predictions, which can lead to healthcare payer cost savings, are essential to support effective decision making regarding health management policies and resource allocations. Patient cost prediction models utilize administrative claims (AC) data collected from multiple healthcare providers, which payers (e.g., government agencies and private insurance companies) rely on for various reimbursement purposes. Both the variety of patient clinical profiles and the multisource nature of the big data from ACs introduce heterogeneity, which undermines both the generalization power and the algorithmic fairness of cost prediction models. In particular, the prediction performance and economic outcomes-such as both underpayments and overpayments-of these models for high-need (HN) patients with multiple and complex chronic conditions differ from those of healthy patients, as their underlying heterogeneous medical profiles are distinct. This study, grounded in sociotechnical considerations for patient cost prediction, presents two key design insights. First, we designed a channel-wise deep learning framework to reduce AC data heterogeneity through effective representation learning, with a separate channel each type of code as well as each type of cost. Second, we incorporated humanistic outcomes and a multichannel entropy measurement into a flexible evaluation design for patient heterogeneity. We evaluate the effectiveness of the proposed channel-wise framework both internally and externally using two real-world data sets containing approximately 111,000 and 134,000 individuals, respectively. On average, channel-wise models substantially reduce prediction errors by 23% compared with the most competitive single-channel counterparts, leading to respective reductions of 16.4% and 19.3% in overpayments and underpayments for patients. The reduction in bias for predictions involving HN patients is more significant than for other patient groups. Our findings offer important implications for decision makers in healthcare and other fields facing similar sociotechnical challenges related to the interplay between diverse population behaviors and data heterogeneity.

引用

页数：26

共 32 条

[31] Prediction models incorporating second metacarpal cortical index for osteoporosis in rheumatoid arthritis: Externally validated machine learning models developed using data from the KURAMA cohort
Saito, Ryohei
Fujii, Takayuki
Murata, Koichi
Onishi, Akira
Murakami, Kosaku
Tanaka, Masao
Ohmura, Koichiro
Yasuda, Tadashi
Morinobu, Akio
Matsuda, Shuichi
INTERNATIONAL JOURNAL OF RHEUMATIC DISEASES, 2024, 27 (10)
[32] Data-driven prediction of prolonged air leak after video-assisted thoracoscopic surgery for lung cancer: Development and validation of machine-learning-based models using real-world data through the ePath system
Tou, Saori
Matsumoto, Koutarou
Hashinokuchi, Asato
Kinoshita, Fumihiko
Nakaguma, Hideki
Kozuma, Yukio
Sugeta, Rui
Nohara, Yasunobu
Yamashita, Takanori
Wakata, Yoshifumi
Takenaka, Tomoyoshi
Iwatani, Kazunori
Soejima, Hidehisa
Yoshizumi, Tomoharu
Nakashima, Naoki
Kamouchi, Masahiro
LEARNING HEALTH SYSTEMS, 2024,

← 1 2 3 4 →