Retrospective Study on the Influencing Factors and Prediction of Hospitalization Expenses for Chronic Renal Failure in China Based on Random Forest and LASSO Regression

被引:48
作者
Dai, Pingping [1 ,2 ]
Chang, Weifu [1 ]
Xin, Zirui [1 ,2 ]
Cheng, Haiwei [3 ]
Ouyang, Wei [1 ,2 ]
Luo, Aijing [4 ]
机构
[1] Cent South Univ, Xiangya Hosp 3, Key Lab Med Informat Res, Changsha, Peoples R China
[2] Cent South Univ, Sch Life Sci, Dept Med Informat, Changsha, Peoples R China
[3] Cent South Univ, Dept Sociol, Changsha, Peoples R China
[4] Cent South Univ, Xiangya Hosp 2, Changsha, Peoples R China
关键词
random forest; LASSO regression; chronic renal failure; hospitalization costs; influencing factors; prediction; CHRONIC KIDNEY-DISEASE; SYSTEM; COST;
D O I
10.3389/fpubh.2021.678276
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Aim: With the improvement in people's living standards, the incidence of chronic renal failure (CRF) is increasing annually. The increase in the number of patients with CRF has significantly increased pressure on China's medical budget. Predicting hospitalization expenses for CRF can provide guidance for effective allocation and control of medical costs. The purpose of this study was to use the random forest (RF) method and least absolute shrinkage and selection operator (LASSO) regression to predict personal hospitalization expenses of hospitalized patients with CRF and to evaluate related influencing factors. Methods: The data set was collected from the first page of data of the medical records of three tertiary first-class hospitals for the whole year of 2016. Factors influencing hospitalization expenses for CRF were analyzed. Random forest and least absolute shrinkage and selection operator regression models were used to establish a prediction model for the hospitalization expenses of patients with CRF, and comparisons and evaluations were carried out. Results: For CRF inpatients, statistically significant differences in hospitalization expenses were found for major procedures, medical payment method, hospitalization frequency, length of stay, number of other diagnoses, and number of procedures. The R-2 of LASSO regression model and RF regression model are 0.6992 and 0.7946, respectively. The mean absolute error (MAE) and root mean square error (RMSE) of the LASSO regression model were 0.0268 and 0.043, respectively, and the MAE and RMSE of the RF prediction model were 0.0171 and 0.0355, respectively. In the RF model, and the weight of length of stay was the highest (0.730). Conclusions: The hospitalization expenses of patients with CRF are most affected by length of stay. The RF prediction model is superior to the LASSO regression model and can be used to predict the hospitalization expenses of patients with CRF. Health administration departments may consider formulating accurate individualized hospitalization expense reimbursement mechanisms accordingly.
引用
收藏
页数:11
相关论文
共 39 条
[1]  
Al-Shdaifat Emad Adel, 2013, Indian J Med Sci, V67, P103
[2]   DRGs in Transfusion Medicine and Hemotherapy in Germany [J].
Bauer, Matthaeus ;
Ostermann, Helmut .
TRANSFUSION MEDICINE AND HEMOTHERAPY, 2012, 39 (02) :60-66
[3]  
Berry M.J., 2000, MASTERING DATA MININ
[4]   Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics [J].
Boulesteix, Anne-Laure ;
Janitza, Silke ;
Kruppa, Jochen ;
Koenig, Inke R. .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (06) :493-507
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Prediction of medical expenses for gastric cancer based on process mining [J].
Cao, Yongzhong ;
Guo, Yalu ;
She, Qiang ;
Zhu, Junwu ;
Li, Bin .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (15)
[7]  
Vieira LGD, 2019, ARQ NEURO-PSIQUIAT, V77, P393, DOI [10.1590/0004-282X20190056, 10.1590/0004-282x20190056]
[8]   Renal Targeted Therapies of Antihypertensive and Cardiovascular Drugs for Patients With Stages 3 Through 5d Kidney Disease [J].
Dhaybi, O. A. I. ;
Bakris, G. L. .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2017, 102 (03) :450-458
[9]   An efficient convolutional neural network for coronary heart disease prediction [J].
Dutta, Aniruddha ;
Batabyal, Tamal ;
Basu, Meheli ;
Acton, Scott T. .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 159
[10]   A random forest classifier for the prediction of energy expenditure and type of physical activity from wrist and hip accelerometers [J].
Ellis, Katherine ;
Kerr, Jacqueline ;
Godbole, Suneeta ;
Lanckriet, Gert ;
Wing, David ;
Marshall, Simon .
PHYSIOLOGICAL MEASUREMENT, 2014, 35 (11) :2191-2203