Advanced Machine Learning Algorithms for House Price Prediction: Case Study in Kuala Lumpur

被引:0
作者
Abdul-Rahman, Shuzlina [1 ]
Mutalib, Sofianita [1 ]
Zulkifley, Nor Hamizah [2 ]
Ibrahim, Ismail [3 ]
机构
[1] Univ Teknol MARA, Fac Comp & Math Sci, Res Initiat Grp Intelligent Syst, Shah Alam, Selangor, Malaysia
[2] Univ Teknol MARA, Fac Comp & Math Sci, Shah Alam, Selangor, Malaysia
[3] PETRONAS Digital Sdn Bhd, Data Sci Dept, Kuala Lumpur, Malaysia
关键词
House price; house price prediction; machine learning; property; regression analysis;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
House price is affected significantly by several factors and determining a reasonable house price involves a calculative process. This paper proposes advanced machine learning (ML) approaches for house price prediction. Two recent advanced ML algorithms, namely LightGBM and XGBoost were compared with two traditional approaches: multiple regression analysis and ridge regression. This study utilizes a secondary dataset called `Property Listing in Kuala Lumpur', gathered from Kaggle and Google Map, containing 21984 observations with 11 variables, including a target variable. The performance of the ML models was evaluated using mean absolute error (MAE), root mean square error (RMSE), and adjusted r-squared value. The findings revealed that the house price prediction model based on XGBoost showed the highest performance by generating the lowest MAE and RMSE, and the closest adjusted r-squared value to one, consistently outperformed other ML models. A new dataset which consists of 1300 samples was deployed at the model deployment stage. It was found that the percentage of the variance between the actual and predicted price was relatively small, which indicated that this model is reliable and acceptable. This study can greatly assist in predicting future house prices and the establishment of real estate policies.
引用
收藏
页码:736 / 745
页数:10
相关论文
共 49 条
  • [1] Abdul-Rahman S, 2021, INT J ADV COMPUT SC, V12, P434
  • [2] Abdullahi A., 2018, ATBU Journal of Environmental Technology, V11, P26
  • [3] Alfiyatin AN, 2017, INT J ADV COMPUT SC, V8, P323, DOI 10.14569/IJACSA.2017.081042
  • [4] Byrne BM., 2016, Basic Concepts, Applications, and Programming, V3, DOI 10.4324/9781315757421
  • [5] Analysis of housing prices in Petaling district, Malaysia using functional relationship model
    Chang, Yun Fah
    Choong, Wei Cheng
    Looi, Sing Yan
    Pan, Wei Yeing
    Goh, Hong Lip
    [J]. INTERNATIONAL JOURNAL OF HOUSING MARKETS AND ANALYSIS, 2019, 12 (05) : 884 - 905
  • [6] LightGBM-PPI: Predicting protein-protein interactions through LightGBM with multi-information fusion
    Chen, Cheng
    Zhang, Qingmei
    Ma, Qin
    Yu, Bin
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 191 : 54 - 64
  • [7] FORECASTING SPATIAL DYNAMICS OF THE HOUSING MARKET USING SUPPORT VECTOR MACHINE
    Chen, Jieh-Haur
    Ong, Chuan Fan
    Zheng, Linzi
    Hsu, Shu-Chien
    [J]. INTERNATIONAL JOURNAL OF STRATEGIC PROPERTY MANAGEMENT, 2017, 21 (03) : 273 - 283
  • [8] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [9] Choong W. C., 2018, STAT ANAL HOUSING PR
  • [10] Damia Abd Samad P. H., 2019, INDONES J ELECT ENG, V16, P1050, DOI 10.11591/ijeecs.v16.i2.pp1050-1058