Building energy performance prediction: A reliability analysis and evaluation of feature selection methods

被引:17
作者
Olu-Ajayi, Razak [1 ]
Alaka, Hafiz [1 ]
Sulaimon, Ismail [1 ]
Balogun, Habeeb [1 ]
Wusu, Godoyon [1 ]
Yusuf, Wasiu [1 ]
Adegoke, Muideen [1 ]
机构
[1] Univ Hertfordshire, Big Data Technol & Innovat Lab, Hatfield AL10 9AB, England
关键词
Feature selection; Building energy performance; Energy efficiency; Machine learning; Energy prediction; RANDOM FOREST; CONSUMPTION PREDICTION; LOAD PREDICTION; NEURAL-NETWORK; COOLING LOAD; MACHINE; MODELS; ALGORITHMS; FRAMEWORK; ACCURACY;
D O I
10.1016/j.eswa.2023.120109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advancement of smart meters using evolving technologies such as the Internet of Things (IoT) is producing more data for the training of energy prediction models. Since many machine learning techniques were not premeditated to handle a large number of irrelevant features, it has engendered the search for optimal techniques to decrease the generated features and potentially identify the most relevant features that have an impact on building energy efficiency. Feature selection is considered one of the most suitable methods of pinpointing the best features combination. However, the fraction of studies that deliver comprehensive insights on the incor-poration of feature selection with machine learning is still limited, notwithstanding the capabilities of feature selection to produce a good result in terms of accuracy and speed. To address this gap, this study investigates feature selection methods centred on building energy consumption prediction using machine learning. This study conducted a comparative analysis of 14 machine learning algorithms on 5 different data sizes and explored the effect of 7 feature selection methods on model performance for predicting energy consumption in buildings. Furthermore, this study identifies the most effective feature selection methods and machine learning models for energy use prediction. The experimental results demonstrate that feature selection can affect model's perfor-mance positively or negatively, depending on the algorithm employed. Nevertheless, the filter method was noted as the most appropriate method for most Machine Learning (ML) classification algorithms. Moreover, Gradient Boosting (GB) was identified as the most effective model for predicting energy performance in buildings. Additionally, the reliability analysis confirms the hypothesis that "the larger the data, the more accurate the result" but only for specific algorithms such as Deep Neural Networks (DNN). This study also presents the theoretical and practical implications of this research.
引用
收藏
页数:17
相关论文
共 113 条
  • [31] Dalal KR, 2018, PROCEEDINGS OF THE 2018 3RD INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2018), P270, DOI 10.1109/ICICT43934.2018.9034256
  • [32] Dandotiya B., 2020, IMPACTS CLIMATE CHAN, DOI [10.4018/978-1-7998-3343-7.ch007, DOI 10.4018/978-1-7998-3343-7.CH007]
  • [33] Dash M., 1997, Intelligent Data Analysis, V1
  • [34] Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis
    Ding, Hui
    Feng, Peng-Mian
    Chen, Wei
    Lin, Hao
    [J]. MOLECULAR BIOSYSTEMS, 2014, 10 (08) : 2229 - 2235
  • [35] A comparative analysis of data-driven methods in building energy benchmarking
    Ding, Yong
    Liu, Xue
    [J]. ENERGY AND BUILDINGS, 2020, 209
  • [36] A Few Useful Things to Know About Machine Learning
    Domingos, Pedro
    [J]. COMMUNICATIONS OF THE ACM, 2012, 55 (10) : 78 - 87
  • [37] Applying support vector machines to predict building energy consumption in tropical region
    Dong, B
    Cao, C
    Lee, SE
    [J]. ENERGY AND BUILDINGS, 2005, 37 (05) : 545 - 553
  • [38] Hourly energy consumption prediction of an office building based on ensemble learning and energy consumption pattern classification
    Dong, Zhenxiang
    Liu, Jiangyan
    Liu, Bin
    Li, Kuining
    Li, Xin
    [J]. ENERGY AND BUILDINGS, 2021, 241
  • [39] A Framework for Selecting Deep Learning Hyper-parameters
    Donoghue, Jim O'
    Roantree, Mark
    [J]. DATA SCIENCE, 2015, 9147 : 120 - 132
  • [40] Duy Tang Hoang, 2019, Intelligent Computing Methodologies. 15th International Conference, ICIC 2019. Proceedings: Lecture Notes in Artificial Intelligence (LNAI 11645), P283, DOI 10.1007/978-3-030-26766-7_26