A Review on Data Preprocessing Techniques Toward Efficient and Reliable Knowledge Discovery From Building Operational Data

被引:167
作者
Fan, Cheng [1 ,2 ]
Chen, Meiling [1 ,2 ]
Wang, Xinghua [3 ]
Wang, Jiayuan [1 ,2 ]
Huang, Bufu [3 ]
机构
[1] Shenzhen Univ, Sino Australia Joint Res Ctr BIM & Smart Construc, Shenzhen, Peoples R China
[2] Shenzhen Univ, Dept Construct Management & Real Estate, Shenzhen, Peoples R China
[3] eSight Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
data preprocessing; building operational data analysis; data science; knowledge discovery; building energy management; LEARNING-BASED METHODOLOGY; FAULT-DETECTION ANALYSIS; DATA MINING TECHNIQUES; ENERGY-CONSUMPTION; ASSOCIATION; DIAGNOSIS; SYSTEM; PERFORMANCE; PREDICTION; FRAMEWORK;
D O I
10.3389/fenrg.2021.652801
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The rapid development in data science and the increasing availability of building operational data have provided great opportunities for developing data-driven solutions for intelligent building energy management. Data preprocessing serves as the foundation for valid data analyses. It is an indispensable step in building operational data analysis considering the intrinsic complexity of building operations and deficiencies in data quality. Data preprocessing refers to a set of techniques for enhancing the quality of the raw data, such as outlier removal and missing value imputation. This article serves as a comprehensive review of data preprocessing techniques for analysing massive building operational data. A wide variety of data preprocessing techniques are summarised in terms of their applications in missing value imputation, outlier detection, data reduction, data scaling, data transformation, and data partitioning. In addition, three state-of-the-art data science techniques are proposed to tackle practical data challenges in the building field, i.e., data augmentation, transfer learning, and semi-supervised learning. In-depth discussions have been presented to describe the pros and cons of existing preprocessing methods, possible directions for future research and potential applications in smart building energy management. The research outcomes are helpful for the development of data-driven research in the building field.
引用
收藏
页数:17
相关论文
共 72 条
[1]  
[Anonymous], 2014, INT C LEARN REPR BAN
[2]  
[Anonymous], 2019, Energy Efficiency: Buildings, The global exchange for energy efficiency policies, data and analysis
[3]   Systematic approach to provide building occupants with feedback to reduce energy consumption [J].
Ashouri, Milad ;
Fung, Benjamin C. M. ;
Haghighat, Fariborz ;
Yoshino, Hiroshi .
ENERGY, 2020, 194
[4]   Development of building energy saving advisory: A data mining approach [J].
Ashouri, Milad ;
Haghighat, Fariborz ;
Fung, Benjamin C. M. ;
Lazrak, Amine ;
Yoshino, Hiroshi .
ENERGY AND BUILDINGS, 2018, 172 :139-151
[5]   Fault detection analysis using data mining techniques for a cluster of smart office buildings [J].
Capozzoli, Alfonso ;
Lauro, Fiorella ;
Khan, Imran .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (09) :4324-4338
[6]   Artificial neural network model for forecasting sub-hourly electricity usage in commercial buildings [J].
Chae, Young Tae ;
Horesh, Raya ;
Hwang, Youngdeok ;
Lee, Young M. .
ENERGY AND BUILDINGS, 2016, 111 :184-194
[7]  
Chapelle O., 2009, Semi-Supervised Learning, V20, P542
[8]  
Chollet F., 2018, DEEP LEARNING WITH R
[9]   Smart meter monitoring and data mining techniques for predicting refrigeration system performance [J].
Chou, Jui-Sheng ;
Hsu, Yu-Chien ;
Lin, Liang-Tse .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (05) :2144-2156
[10]  
Cornacchia M., 2018, IEEE APPL IM PATT RE