Estimation of the building energy use intensity in the urban scale by integrating GIS and big data technology

被引:155
作者
Ma, Jun [1 ]
Cheng, Jack C. P. [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Civil & Environm Engn, Kowloon, Hong Kong, Peoples R China
关键词
Artificial Neural Network (ANN); Big Data; Energy use intensity (EUI); Feature selection; Geographic information system (GIS); Support Vector Regression (SVR); RIDGE-REGRESSION; NEURAL-NETWORKS; CLIMATE-CHANGE; LOW-INCOME; CONSUMPTION; PREDICTION; SELECTION; IMPACT; OPTIMIZATION; MODELS;
D O I
10.1016/j.apenergy.2016.08.079
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Buildings are the major source of energy consumption in urban areas. Accurate modeling and forecasting of the building energy use intensity (EUI) in the urban scale have many important applications, such as energy benchmarking and urban energy infrastructure planning. The use of Big Data technology is expected to have the capability of integrating a large number of predictors and giving an accurate prediction of the energy use intensity of buildings in the urban scale. However, past research has often used Big Data technology in estimating energy consumption of a single building rather than the urban scale, due to several challenges such as data collection and feature engineering. This paper therefore proposes a geographic information system integrated data mining methodology framework for estimating the building EUI in the urban scale, including preprocessing, feature selection, and algorithm optimization. Based on 216 prepared features, a case study on estimating the site EUI of 3640 multi-family residential buildings in New York City, was tested and validated using the proposed methodology framework. A comparative study on the feature selection strategies and the commonly used regression algorithms was also included in the case study. The results show that the framework was able to help produce lower estimation errors than previous research, and the model built by the Support Vector Regression algorithm on the features selected by Elastic Net has the least cross-validation mean squared error. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:182 / 192
页数:11
相关论文
共 59 条
[1]   Using multiple regression analysis to develop energy consumption indicators for commercial buildings in the US [J].
Amiri, Shideh Shams ;
Mottahedi, Mohammad ;
Asadi, Somayeh .
ENERGY AND BUILDINGS, 2015, 109 :209-216
[2]  
[Anonymous], 1999, Ph.D. Thesis
[3]  
[Anonymous], 2015, REGR AN
[4]  
[Anonymous], 2009, NEURAL NETWORKS LEAR
[5]  
[Anonymous], 2015, GEOGR INF SYST
[6]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[7]   Comparing the effectiveness of weatherization treatments for low-income, American, urban housing stocks in different climates [J].
Bradshaw, Jonathan L. ;
Bou-Zeid, Elie ;
Harris, Robert H. .
ENERGY AND BUILDINGS, 2014, 69 :535-543
[8]   Comparison of feature selection methods using ANNs in MCP-wind speed methods. A case study [J].
Carta, Jose A. ;
Cabrera, Pedro ;
Matias, Jose M. ;
Castellano, Fernando .
APPLIED ENERGY, 2015, 158 :490-507
[9]   A non-linear case-based reasoning approach for retrieval of similar cases and selection of target credits in LEED projects [J].
Cheng, Jack C. P. ;
Ma, Lucky J. .
BUILDING AND ENVIRONMENT, 2015, 93 :349-361
[10]   A data-driven study of important climate factors on the achievement of LEED-EB credits [J].
Cheng, Jack C. P. ;
Ma, Lucky J. .
BUILDING AND ENVIRONMENT, 2015, 90 :232-244