Predictive modeling of total healthcare costs using pharmacy claims data - A comparison of alternative econometric cost modeling techniques

被引:61
作者
Powers, CA [1 ]
Meyer, CM [1 ]
Roebuck, MC [1 ]
Vaziri, B [1 ]
机构
[1] Caremark, Hunt Valley, MD USA
关键词
predictive modeling; risk assessment; two-part model; log model; generalized linear model; pharmacy;
D O I
10.1097/01.mlr.0000182408.54390.00
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective: We sought to evaluate several statistical modeling approaches in predicting prospective total annual health costs (medical plus pharmacy) of health plan participants using Pharmacy Health Dimensions (PHD), a pharmacy claims-based risk index. Methods: We undertook a 2-year (baseline year/follow-up year) longitudinal analysis of integrated medical and pharmacy claims' included were plan participants younger than, 65 years of age with continuous medical and pharmacy coverage (n = 344,832). PHD drug categories, age, gender, and pharmacy costs were derived across the baseline year. Annual. total health costs were calculated for each plan participant in follow-up year. Models examined included ordinary least squares (OLS) regression, log-transformed OLS regression with smearing estimator, and 3 two-part models using OLS regression, log-OLS regression with smearing estimator, and generalized linear modeling (GLM), respectively. A 10% random sample was withheld for model validation, which was assessed via adjusted r(2), mean absolute prediction error, specificity, and positive predictive value. Results: Most PHD drug categories were significant independent predictors of total costs. Among models tested, the OLS model had the lowest mean absolute prediction error and highest adjusted r(2). The log-OLS and 2-part log-OLS models did not predict costs accurately as the result of issues of log-scale heteroscedasticity. The 2-part model using GLM had lower adjusted r(2) but similar performance in other assessment measures compared with the OLS or 2-part OLS models. Conclusion: The PHD system derived solely from pharmacy claims data can be used to predict future total health costs. Using PHD with a simple OLS model may provide similar predictive accuracy in comparison to more advanced econometric models.
引用
收藏
页码:1065 / 1072
页数:8
相关论文
共 31 条
[1]  
[Anonymous], SAS VERS 8 02
[2]   Modeling risk using generalized linear models [J].
Blough, DK ;
Madden, CW ;
Hornbrook, MC .
JOURNAL OF HEALTH ECONOMICS, 1999, 18 (02) :153-171
[3]   SIMPLE TEST FOR HETEROSCEDASTICITY AND RANDOM COEFFICIENT VARIATION [J].
BREUSCH, TS ;
PAGAN, AR .
ECONOMETRICA, 1979, 47 (05) :1287-1294
[4]   Too much ado about two-part models and transformation? Comparing methods of modeling Medicare expenditures [J].
Buntin, MB ;
Zaslavsky, AM .
JOURNAL OF HEALTH ECONOMICS, 2004, 23 (03) :525-542
[5]   A CHRONIC DISEASE SCORE WITH EMPIRICALLY DERIVED WEIGHTS [J].
CLARK, DO ;
VONKORFF, M ;
SAUNDERS, K ;
BALUCH, WM ;
SIMON, GE .
MEDICAL CARE, 1995, 33 (08) :783-795
[6]  
COOK RD, 1983, BIOMETRIKA, V71, P1
[7]  
Cumming R. B., 2002, COMP ANAL CLAIMS BAS
[9]   Risk adjustment using automated ambulatory pharmacy data - The RxRisk model [J].
Fishman, PA ;
Goodman, MJ ;
Hornbrook, MC ;
Meenan, RT ;
Bachman, DJ ;
Rosetti, MCO .
MEDICAL CARE, 2003, 41 (01) :84-99
[10]   Development and estimation of a pediatric chronic disease score using automated pharmacy data [J].
Fishman, PA ;
Shay, DK .
MEDICAL CARE, 1999, 37 (09) :874-883