A machine learning framework to estimate residential electricity demand based on smart meter electricity, climate, building characteristics, and socioeconomic datasets

被引:8
作者
Peplinski, Mckenna [1 ]
Dilkina, Bistra [2 ]
Chen, Mo [3 ]
Silva, Sam J. [4 ,5 ]
Ban-Weiss, GeorgeA. [1 ]
Sanders, Kelly T. [1 ]
机构
[1] Univ Southern Calif, Dept Civil & Environm Engn, Los Angeles, CA 90007 USA
[2] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA USA
[3] Calif Air Resources Board, Sacramento, CA USA
[4] Univ Southern Calif, Dept Earth Sci, Los Angeles, CA USA
[5] Univ Southern Calif, Dept Civil & Environm Engn, Los Angeles, CA USA
关键词
Smart meter; Residential electricity; Machine learning; Climate change; Building energy; Energy forecasting; FORECASTING ENERGY-CONSUMPTION; SUPPORT VECTOR REGRESSION; ARTIFICIAL NEURAL-NETWORK; FEATURE-SELECTION; UNITED-STATES; USE INTENSITY; PREDICTION; PERFORMANCE; IMPACT; MODEL;
D O I
10.1016/j.apenergy.2023.122413
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Due to the substantial portion of total electricity use attributed to the residential sector and projected rises in demand, anticipating future energy needs in the context of a warming climate will be essential to maintain grid reliability and plan for future infrastructure investments. Machine learning has become a popular tool for forecasting residential electricity demand, but previous studies have been limited by lack of access to high spatiotemporal resolution at a regional scale, which reduces a model's ability to capture the relationship between electricity and its driving factors. In this study, we develop and execute a machine learning framework to predict residential electricity demand at varying temporal and spatial resolutions using hourly smart meter electricity records from roughly 58,000 homes provided by Southern California Edison as well as local weather data, building characteristics, and socioeconomic indicators. The best performing model at the household level, multilayer perceptron (MLP), was able to predict electricity demand most accurately at a monthly resolution, achieving an r2 of 0.45, while the most accurate annual and daily models (also MLP) had r2 values of 0.34 and 0.38, respectively. The results also show that models trained with data aggregated to the census tract level were more accurate (e.g., r2 = 0.82 for the monthly MLP model) than at the household level across all three temporal resolutions analyzed. Total square footage and various climate indicators had the highest feature importance values. Square footage was ranked first in feature importance for the annual and daily models, while the month of the year, which is strongly tied to temperature, was most important to the monthly model. Through this analysis we gain insight into factors that drive electricity demand and the usefulness of machine learning for predicting residential electricity use.
引用
收藏
页数:16
相关论文
共 135 条
  • [1] Forecasting highly fluctuating electricity load using machine learning models based on multimillion observations
    Abdallah, Mohamed
    Abu Talib, Manar
    Hosny, Mariam
    Abu Waraga, Omnia
    Nasir, Qassim
    Arshad, Muhammad Arbab
    [J]. ADVANCED ENGINEERING INFORMATICS, 2022, 53
  • [2] Ahmad Tohari, 2019, ICIC Express Letters, V13, P93, DOI 10.24507/icicel.13.02.93
  • [3] Supervised based machine learning models for short, medium and long-term energy prediction in distinct building environment
    Ahmad, Tanveer
    Chen, Huanxin
    Huang, Ronggeng
    Guo Yabin
    Wang, Jiangyu
    Shair, Jan
    Akram, Hafiz Muhammad Azeem
    Mohsan, Syed Agha Hassnain
    Kazim, Muhammad
    [J]. ENERGY, 2018, 158 : 17 - 32
  • [4] Response of residential electricity demand to price: The effect of measurement error
    Alberini, Anna
    Filippini, Massimo
    [J]. ENERGY ECONOMICS, 2011, 33 (05) : 889 - 895
  • [5] Data preprocessing in predictive data mining
    Alexandropoulos, Stamatios-Aggelos N.
    Kotsiantis, Sotiris B.
    Vrahatis, Michael N.
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2019, 34
  • [6] A review of data-driven building energy consumption prediction studies
    Amasyali, Kadir
    El-Gohary, Nora M.
    [J]. RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2018, 81 : 1192 - 1205
  • [7] Anguita D, 2014, K - fold cross validation for error rate estimate in support vector machines, P2009
  • [8] [Anonymous], 2009, HOUSEHOLD ENERGY USE
  • [9] [Anonymous], 2021, CAPITA US RESIDENTIA
  • [10] [Anonymous], 2023, Electric power annual: Table 1.2. summary statistics for the United States, 2010-2020