A case study to examine the imputation of missing data to improve clustering analysis of building electrical demand

被引:10
|
作者
Inman, Daniel [1 ]
Elmore, Ryan [2 ]
Bush, Brian [1 ]
机构
[1] Natl Renewable Energy Lab, Strateg Energy Anal Ctr, Golden, CO 80401 USA
[2] Natl Renewable Energy Lab, Computat Sci Ctr, Golden, CO 80401 USA
关键词
Clustering; missing data; building electrical demand; FAULT-DETECTION; SYSTEMS;
D O I
10.1177/0143624415573215
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Building performance data are widely used for daily operation, improving building efficiency, identifying and diagnosing performance problems, and commissioning. In this study, the authors explore the use of missing data imputation and clustering on an electrical demand dataset. The objective was to compare four approaches of data imputation and clustering analysis. Results of this study suggest that using multiple imputation to fill in missing data prior to performing clustering analysis results in more informative clusters. Commonly used methods to fill in missing data lead to changes in cluster membership that are not suggestive of a change in the building's performance, but instead is a result of the choice of imputation method used.Practical application: The authors demonstrate, through the use of a case study, the application of a statistically sound method for filling in missing data in large buildings performance datasets. The methods used in this analysis are available through the open-source programming language R and are straight forward to implement. The approach demonstrated in this case study could aid buildings analysts with fault detection and continuous commissioning of large commercial buildings.
引用
收藏
页码:628 / 637
页数:10
相关论文
共 50 条
  • [31] Imputation and Missing Indicators for Handling Missing Longitudinal Data: Data Simulation Analysis Based on Electronic Health Record Data
    Ehrig, Molly
    Bullock, Garrett S.
    Leng, Xiaoyan Iris
    Pajewski, Nicholas M.
    Speiser, Jaime Lynn
    JMIR MEDICAL INFORMATICS, 2025, 13
  • [32] A novel clustering-based purity and distance imputation for handling medical data with missing values
    Cheng, Ching-Hsue
    Huang, Shu-Fen
    SOFT COMPUTING, 2021, 25 (17) : 11781 - 11801
  • [33] Imputation Method Based on Collaborative Filtering and Clustering for the Missing Data of the Squeeze Casting Process Parameters
    Jianxin Deng
    Zhixing Ye
    Lubao Shan
    Dongdong You
    Guangming Liu
    Integrating Materials and Manufacturing Innovation, 2022, 11 : 95 - 108
  • [34] Missing data imputation in clinical trials using recurrent neural network facilitated by clustering and oversampling
    Haliduola, Halimu N.
    Bretz, Frank
    Mansmann, Ulrich
    BIOMETRICAL JOURNAL, 2022, 64 (05) : 863 - 882
  • [35] A novel clustering-based purity and distance imputation for handling medical data with missing values
    Ching-Hsue Cheng
    Shu-Fen Huang
    Soft Computing, 2021, 25 : 11781 - 11801
  • [36] Imputation Method Based on Collaborative Filtering and Clustering for the Missing Data of the Squeeze Casting Process Parameters
    Deng, Jianxin
    Ye, Zhixing
    Shan, Lubao
    You, Dongdong
    Liu, Guangming
    INTEGRATING MATERIALS AND MANUFACTURING INNOVATION, 2022, 11 (01) : 95 - 108
  • [37] Analysis of Missing Data in Progressed Learners: The Use of Multiple Imputation Methods
    Mabungane, S.
    Ramroop, S.
    Mwambi, H.
    AFRICAN JOURNAL OF RESEARCH IN MATHEMATICS SCIENCE AND TECHNOLOGY EDUCATION, 2023, 27 (02) : 112 - 122
  • [38] Development of Imputation Methods for Missing Data in Multiple Linear Regression Analysis
    Thidarat Thongsri
    Klairung Samart
    Lobachevskii Journal of Mathematics, 2022, 43 : 3390 - 3399
  • [39] Missing Values Imputation Using Genetic Algorithm for the Analysis of Traffic Data
    Midde, Ranjit Reddy
    Srinivasa, K. G.
    Reddy, Eswara B.
    ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2017, 2018, 668 : 251 - 261
  • [40] Development of Imputation Methods for Missing Data in Multiple Linear Regression Analysis
    Thongsri, Thidarat
    Samart, Klairung
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2022, 43 (11) : 3390 - 3399