Building energy performance prediction: A reliability analysis and evaluation of feature selection methods

被引：17

作者：

Olu-Ajayi, Razak ^{[1
]}

Alaka, Hafiz ^{[1
]}

Sulaimon, Ismail ^{[1
]}

Balogun, Habeeb ^{[1
]}

Wusu, Godoyon ^{[1
]}

Yusuf, Wasiu ^{[1
]}

Adegoke, Muideen ^{[1
]}

机构：

[1] Univ Hertfordshire, Big Data Technol & Innovat Lab, Hatfield AL10 9AB, England

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2023年 / 225卷

关键词：

Feature selection; Building energy performance; Energy efficiency; Machine learning; Energy prediction; RANDOM FOREST; CONSUMPTION PREDICTION; LOAD PREDICTION; NEURAL-NETWORK; COOLING LOAD; MACHINE; MODELS; ALGORITHMS; FRAMEWORK; ACCURACY;

D O I：

10.1016/j.eswa.2023.120109

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The advancement of smart meters using evolving technologies such as the Internet of Things (IoT) is producing more data for the training of energy prediction models. Since many machine learning techniques were not premeditated to handle a large number of irrelevant features, it has engendered the search for optimal techniques to decrease the generated features and potentially identify the most relevant features that have an impact on building energy efficiency. Feature selection is considered one of the most suitable methods of pinpointing the best features combination. However, the fraction of studies that deliver comprehensive insights on the incor-poration of feature selection with machine learning is still limited, notwithstanding the capabilities of feature selection to produce a good result in terms of accuracy and speed. To address this gap, this study investigates feature selection methods centred on building energy consumption prediction using machine learning. This study conducted a comparative analysis of 14 machine learning algorithms on 5 different data sizes and explored the effect of 7 feature selection methods on model performance for predicting energy consumption in buildings. Furthermore, this study identifies the most effective feature selection methods and machine learning models for energy use prediction. The experimental results demonstrate that feature selection can affect model's perfor-mance positively or negatively, depending on the algorithm employed. Nevertheless, the filter method was noted as the most appropriate method for most Machine Learning (ML) classification algorithms. Moreover, Gradient Boosting (GB) was identified as the most effective model for predicting energy performance in buildings. Additionally, the reliability analysis confirms the hypothesis that "the larger the data, the more accurate the result" but only for specific algorithms such as Deep Neural Networks (DNN). This study also presents the theoretical and practical implications of this research.

引用

页数：17

共 113 条

[31] Dalal KR, 2018, PROCEEDINGS OF THE 2018 3RD INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2018), P270, DOI 10.1109/ICICT43934.2018.9034256
[32] Dandotiya B., 2020, IMPACTS CLIMATE CHAN, DOI [10.4018/978-1-7998-3343-7.ch007, DOI 10.4018/978-1-7998-3343-7.CH007]
[33] Dash M., 1997, Intelligent Data Analysis, V1
[34] Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis
Ding, Hui
Feng, Peng-Mian
Chen, Wei
Lin, Hao
[J]. MOLECULAR BIOSYSTEMS, 2014, 10 (08) : 2229 - 2235
[35] A comparative analysis of data-driven methods in building energy benchmarking
Ding, Yong
Liu, Xue
[J]. ENERGY AND BUILDINGS, 2020, 209
[36] A Few Useful Things to Know About Machine Learning
Domingos, Pedro
[J]. COMMUNICATIONS OF THE ACM, 2012, 55 (10) : 78 - 87
[37] Applying support vector machines to predict building energy consumption in tropical region
Dong, B
Cao, C
Lee, SE
[J]. ENERGY AND BUILDINGS, 2005, 37 (05) : 545 - 553
[38] Hourly energy consumption prediction of an office building based on ensemble learning and energy consumption pattern classification
Dong, Zhenxiang
Liu, Jiangyan
Liu, Bin
Li, Kuining
Li, Xin
[J]. ENERGY AND BUILDINGS, 2021, 241
[39] A Framework for Selecting Deep Learning Hyper-parameters
Donoghue, Jim O'
Roantree, Mark
[J]. DATA SCIENCE, 2015, 9147 : 120 - 132
[40] Duy Tang Hoang, 2019, Intelligent Computing Methodologies. 15th International Conference, ICIC 2019. Proceedings: Lecture Notes in Artificial Intelligence (LNAI 11645), P283, DOI 10.1007/978-3-030-26766-7_26

← 1 2 3 4 5 6 7 8 9 10 →