Unsupervised learning for feature projection: Extracting patterns from multidimensional building measurements

被引:4
作者
Xiao, Chunze [1 ]
Khayatian, Fazel [2 ]
Dall'O, Giuliano [3 ]
机构
[1] UCL, Bartlett Sch Environm Energy & Resources, London, England
[2] Swiss Fed Labs Mat Sci & Technol, Empa, Urban Energy Syst Lab, Dubendorf, Switzerland
[3] Politecn Milan, Dept Architecture Built Environm & Construct Engn, Milan, Italy
关键词
Unsupervised learning; Building performance; Dimensionality reduction; Data representation; ENERGY-CONSUMPTION; DIMENSIONALITY; PREDICTION; SIMULATION;
D O I
10.1016/j.enbuild.2020.110228
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Data visualization is an important resource for decision makers to obtain information from large datasets. Based on the data obtained from either predictions or measurements, different strategies are combined and tested to reduce the energy demand, whilst keeping the indoor comfort at suitable level. Although the information expressed from data representation can significantly influence the decisions, little research has focused on extracting features from building measurements. This paper provides an indepth view into representation of building data, and applies three dimensionality reduction algorithms Principle Component Analysis (PCA), autoencoder and t-Distributed Stochastic Neighbour Embedding (t-SNE) on measurements from a teaching building. Results show that whilst PCA returns linear representations, it also has the least data compression, which can be useful for obtaining more general features. On the other hand, t-SNE returns the most compressed data, which is suitable for seeking large margins within a dataset. However, t-SNE may be unsuitable for datasets with recurring step-like temporal profiles. Autoencoder is the best overall option, as they capture the nonlinearities within a dataset whilst avoiding excessive data compression. Fine-tuning the hyperparameters of studied the algorithms, and the perils of relying on poorly tuned models is discussed at the end of the study. (C) 2020 The Authors. Published by Elsevier B.V.
引用
收藏
页数:18
相关论文
共 41 条
[31]   SILHOUETTES - A GRAPHICAL AID TO THE INTERPRETATION AND VALIDATION OF CLUSTER-ANALYSIS [J].
ROUSSEEUW, PJ .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 1987, 20 :53-65
[32]   Exploring HVAC system sizing under uncertainty [J].
Sun, Yuming ;
Gu, Li ;
Wu, C. F. Jeff ;
Augenbroe, Godfried .
ENERGY AND BUILDINGS, 2014, 81 :243-252
[33]   International study on energy end-use data among industrial SMEs (small and medium-sized enterprises) and energy end-use efficiency improvement opportunities [J].
Thollander, Patrik ;
Paramonova, Svetlana ;
Cornelis, Erwin ;
Kimura, Osamu ;
Trianni, Andrea ;
Karlsson, Magnus ;
Cagno, Enrico ;
Morales, Ines ;
Jimenez Navarro, Juan Pablo .
JOURNAL OF CLEANER PRODUCTION, 2015, 104 :282-296
[34]  
UN Environment and International Energy Agency, 2017, ZER EM EFF RES BUILD, P14
[35]  
United Nations Environment Programme-Sustainable Buildings & Climate Initiative, 2009, BUILD CLIM CHANG SUM, P9
[36]   Energy use in buildings in a long-term perspective [J].
Urge-Vorsatz, Diana ;
Petrichenko, Ksenia ;
Staniec, Maja ;
Eom, Jiyong .
CURRENT OPINION IN ENVIRONMENTAL SUSTAINABILITY, 2013, 5 (02) :141-151
[37]  
van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
[38]   Generalized Autoencoder: A Neural Network Framework for Dimensionality Reduction [J].
Wang, Wei ;
Huang, Yan ;
Wang, Yizhou ;
Wang, Liang .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, :496-+
[39]  
Wattenberg Martin, 2016, Distill, DOI [DOI 10.23915/DISTILL.00002, 10.23915/distill.00002]
[40]   A method of formulating energy load profile for domestic buildings in the UK [J].
Yao, RM ;
Steemers, K .
ENERGY AND BUILDINGS, 2005, 37 (06) :663-671