A case study to examine the imputation of missing data to improve clustering analysis of building electrical demand

被引:10
|
作者
Inman, Daniel [1 ]
Elmore, Ryan [2 ]
Bush, Brian [1 ]
机构
[1] Natl Renewable Energy Lab, Strateg Energy Anal Ctr, Golden, CO 80401 USA
[2] Natl Renewable Energy Lab, Computat Sci Ctr, Golden, CO 80401 USA
关键词
Clustering; missing data; building electrical demand; FAULT-DETECTION; SYSTEMS;
D O I
10.1177/0143624415573215
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Building performance data are widely used for daily operation, improving building efficiency, identifying and diagnosing performance problems, and commissioning. In this study, the authors explore the use of missing data imputation and clustering on an electrical demand dataset. The objective was to compare four approaches of data imputation and clustering analysis. Results of this study suggest that using multiple imputation to fill in missing data prior to performing clustering analysis results in more informative clusters. Commonly used methods to fill in missing data lead to changes in cluster membership that are not suggestive of a change in the building's performance, but instead is a result of the choice of imputation method used.Practical application: The authors demonstrate, through the use of a case study, the application of a statistically sound method for filling in missing data in large buildings performance datasets. The methods used in this analysis are available through the open-source programming language R and are straight forward to implement. The approach demonstrated in this case study could aid buildings analysts with fault detection and continuous commissioning of large commercial buildings.
引用
收藏
页码:628 / 637
页数:10
相关论文
共 50 条
  • [21] Sensitivity Analysis of Missing Data: Case Studies Using Model-Based Multiple Imputation
    Zhang, Jie
    DRUG INFORMATION JOURNAL, 2009, 43 (04): : 475 - 484
  • [22] Multiple imputation of missing fMRI data in whole brain analysis
    Vaden, Kenneth I., Jr.
    Gebregziabher, Mulugeta
    Kuchinsky, Stefanie E.
    Eckert, Marl A.
    NEUROIMAGE, 2012, 60 (03) : 1843 - 1855
  • [23] Missing data analyses: a hybrid multiple imputation algorithm using Gray System Theory and entropy based on clustering
    Tian, Jing
    Yu, Bing
    Yu, Dan
    Ma, Shilong
    APPLIED INTELLIGENCE, 2014, 40 (02) : 376 - 388
  • [24] Mediation Analysis with Missing Data Through Multiple Imputation and Bootstrap
    Zhang, Zhiyong
    Wang, Lijuan
    Tong, Xin
    Quantitative Psychology Research, 2015, 140 : 341 - 355
  • [25] Sensitivity Analysis of Missing Data: Case Studies Using Model-Based Multiple Imputation
    Jie Zhang
    Drug information journal : DIJ / Drug Information Association, 2009, 43 (4): : 475 - 484
  • [26] Missing Data Characteristics and the Choice of Imputation Technique: An Empirical Study
    Alade, Oyekale Abel
    Sallehuddin, Roselina
    Radzi, Nor Haizan Mohamed
    Selamat, Ali
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 88 - 97
  • [27] Imputation of data Missing Not at Random: Artificial generation and benchmark analysis
    Pereira, Ricardo Cardoso
    Abreu, Pedro Henriques
    Rodrigues, Pedro Pereira
    Figueiredo, Mario A. T.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [28] Clustering of meteorological data to improve agricultural decisions: a case study with SIMAGRO-RS
    de Oliveira, Marcos Antonio, Jr.
    Varone, Flavio A.
    Fraisse, Clyde W.
    Araujo, Ricardo Matsumura
    Cavalheiro, Gerson Geraldo H.
    PROCEEDINGS OF THE 20TH BRAZILIAN SYMPOSIUM ON INFORMATIONS SYSTEMS, SBSI 2024, 2024,
  • [29] Fault detection based on Bayesian network and missing data imputation for building energy systems
    Wang, Zhanwei
    Wang, Lin
    Tan, Yingying
    Yuan, Junfei
    APPLIED THERMAL ENGINEERING, 2021, 182
  • [30] Handling missing data: analysis of a challenging data set using multiple imputation
    Pampaka, Maria
    Hutcheson, Graeme
    Williams, Julian
    INTERNATIONAL JOURNAL OF RESEARCH & METHOD IN EDUCATION, 2016, 39 (01) : 19 - 37