CRISP-DM-Based Data-Driven Approach for Building Energy Prediction Utilizing Indoor and Environmental Factors

被引:0
作者
Elkabalawy, Moaaz [1 ]
Al-Sakkaf, Abobakr [1 ]
Abdelkader, Eslam Mohammed [2 ]
Alfalah, Ghasan [3 ]
机构
[1] Concordia Univ, Dept Bldg Civil & Environm Engn, Montreal, PQ H3G 1M8, Canada
[2] Cairo Univ, Fac Engn, Struct Engn Dept, Giza 12613, Egypt
[3] King Saud Univ, Coll Architecture & Planning, Dept Architecture & Bldg Sci, Riyadh 11362, Saudi Arabia
关键词
smart buildings; sustainability; data mining; data-driven energy prediction models; CRISP-DM; supervised machine learning; occupancy prediction; feature selection analysis; SUPPORT VECTOR MACHINES; LOAD PREDICTION; COOLING LOAD; HEATING LOAD; CONSUMPTION; OCCUPANCY; MODEL; VERIFICATION; SELECTION; SYSTEMS;
D O I
10.3390/su16177249
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The significant energy consumption associated with the built environment demands comprehensive energy prediction modelling. Leveraging their ability to capture intricate patterns without extensive domain knowledge, supervised data-driven approaches present a marked advantage in adaptability over traditional physical-based building energy models. This study employs various machine learning models to predict energy consumption for an office building in Berkeley, California. To enhance the accuracy of these predictions, different feature selection techniques, including principal component analysis (PCA), decision tree regression (DTR), and Pearson correlation analysis, were adopted to identify key attributes of energy consumption and address collinearity. The analyses yielded nine influential attributes: heating, ventilation, and air conditioning (HVAC) system operating parameters, indoor and outdoor environmental parameters, and occupancy. To overcome missing occupancy data in the datasets, we investigated the possibility of occupancy-based Wi-Fi prediction using different machine learning algorithms. The results of the occupancy prediction modelling indicate that Wi-Fi can be used with acceptable accuracy in predicting occupancy count, which can be leveraged to analyze occupant comfort and enhance the accuracy of building energy models. Six machine learning models were tested for energy prediction using two different datasets: one before and one after occupancy prediction. Using a 10-fold cross-validation with an 8:2 training-to-testing ratio, the Random Forest algorithm emerged superior, exhibiting the highest R2 value of 0.92 and the lowest RMSE of 3.78 when occupancy data were included. Additionally, an error propagation analysis was conducted to assess the impact of the occupancy-based Wi-Fi prediction model's error on the energy prediction model. The results indicated that Wi-Fi-based occupancy prediction can improve the data inputs for building energy models, leading to more accurate energy consumption predictions. The findings underscore the potential of integrating the developed energy prediction models with fault detection systems, model predictive controllers, and energy load shape analysis, ultimately enhancing energy management practices.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] A data-driven approach for multi-scale GIS-based building energy modeling for analysis, planning and support decision making
    Ali, Usman
    Shamsi, Mohammad Haris
    Bohacek, Mark
    Purcell, Karl
    Hoare, Cathal
    Mangina, Eleni
    O'Donnell, James
    APPLIED ENERGY, 2020, 279
  • [42] A data-driven life-cycle optimisation approach for building retrofitting: A comprehensive assessment on economy, energy and environment
    Luo, X. J.
    Oyedele, Lukumon O.
    JOURNAL OF BUILDING ENGINEERING, 2021, 43 (43):
  • [43] A simple load model based on hybrid mechanism and data-driven approach for district heating in building complex
    Yang, Junhong
    Zhao, Tong
    Peng, Mengbo
    Cui, Mianshan
    Zhu, Junda
    ENERGY AND BUILDINGS, 2024, 322
  • [44] A Systematic Review of Building Energy Consumption Prediction: From Perspectives of Load Classification, Data-Driven Frameworks, and Future Directions
    Chen, Guanzhong
    Lu, Shengze
    Zhou, Shiyu
    Tian, Zhe
    Kim, Moon Keun
    Liu, Jiying
    Liu, Xinfeng
    APPLIED SCIENCES-BASEL, 2025, 15 (06):
  • [45] Data-driven, long-term prediction of building performance under climate change: Building energy demand and BIPV energy generation analysis across Turkey
    Tamer, Tolga
    Dino, Ipek Gursel
    Akgul, Cagla Meral
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2022, 162
  • [46] An Energy Data-Driven Approach for Operating Status Recognition of Machine Tools Based on Deep Learning
    Yan, Wei
    Lu, Chenxun
    Liu, Ying
    Zhang, Xumei
    Zhang, Hua
    SENSORS, 2022, 22 (17)
  • [47] A hybrid physics-based and data-driven approach for long-term VRFB aging prediction
    Cai, Mingxuan
    Yang, Bo
    Liu, Qi
    Zhu, Jiajie
    JOURNAL OF CONTROL AND DECISION, 2025,
  • [48] A data-driven approach for tool wear recognition and quantitative prediction based on radar map feature fusion
    Li, Xuebing
    Liu, Xianli
    Yue, Caixu
    Liu, Shaoyang
    Zhang, Bowen
    Li, Rongyi
    Liang, Steven Y.
    Wang, Lihui
    MEASUREMENT, 2021, 185
  • [49] A novel methodology to explain and evaluate data-driven building energy performance models based on interpretable machine learning
    Fan, Cheng
    Xiao, Fu
    Yan, Chengchu
    Liu, Chengliang
    Li, Zhengdao
    Wang, Jiayuan
    APPLIED ENERGY, 2019, 235 : 1551 - 1560
  • [50] Effect of physical, environmental, and social factors on prediction of building energy consumption for public buildings based on real-world big data
    Zhang, Yuhang
    Zhang, Yi
    Zhang, Yi
    Zhang, Chengxu
    ENERGY, 2022, 261