A boosting ensemble learning based hybrid light gradient boosting machine and extreme gradient boosting model for predicting house prices

被引：40

作者：

Sibindi, Racheal ^{[1
,4
]}

Mwangi, Ronald Waweru ^{[2
]}

Waititu, Anthony Gichuhi ^{[3
]}

机构：

[1] Pan African Univ Inst Basic Sci, Dept Math Technol & Innovatc, Nairobi, Kenya

[2] Jomo Kenyatta Univ Agr & Technol, Sch Comp & Informat Technol, Dept Comp, Nairobi, Kenya

[3] Jomo Kenyatta Univ Agr & Technol, Stat & Actuarial Sci Dept, Nairobi, Kenya

[4] Pan African Univ, Inst Basic Sci, Dept Math Technol & Innovat, Nairobi, Kenya

来源：

ENGINEERING REPORTS | 2023年 / 5卷 / 04期

关键词：

boosting ensemble learning; light gradient boosting machine; extreme gradient boosting;

D O I：

10.1002/eng2.12599

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The implementation of tree-ensemble models has become increasingly essential in solving classification and prediction problems. Boosting ensemble techniques have been widely used as individual machine learning algorithms in predicting house prices. One of the techniques is LGBM algorithm that employs leaf wise growth strategy, reduces loss and improves accuracy during training which results in overfitting. However, XGBoost algorithm uses level wise growth strategy which takes time to compute resulting in higher computation time. Nevertheless, XGBoost has a regularization parameter, implements column sampling and weight reduction on new trees which combats overfitting. This study focuses on developing a hybrid LGBM and XGBoost model in order to prevent overfitting through minimizing variance whilst improving accuracy. Bayesian hyperparameter optimization technique is implemented on the base learners in order to find the best combination of hyperparameters. This resulted in reduced variance (overfitting) in the hybrid model since the regularization parameter values were optimized. The hybrid model is compared to LGBM, XGBoost, Adaboost and GBM algorithms to evaluate its performance in giving accurate house price predictions using MSE, MAE and MAPE evaluation metrics. The hybrid LGBM and XGBoost model outperformed the other models with MSE, MAE and MAPE of 0.193, 0.285, and 0.156 respectively.

引用

页数：19

共 38 条

[1]

Abou Omar K. B., 2018, PREPRINT

[2] Enhancing the prediction of student performance based on the machine learning XGBoost algorithm [J].