Integrating active learning and semi-supervised learning for improved data-driven HVAC fault diagnosis performance

被引:26
作者
Fan, Cheng [1 ,2 ,3 ]
Wu, Qiuting [2 ,3 ]
Zhao, Yang [4 ]
Mo, Like [1 ,2 ,3 ]
机构
[1] Shenzhen Univ, Key Lab Resilient Infrastructures Coastal Cities, Minist Educ, Shenzhen, Peoples R China
[2] Shenzhen Univ, Sino Australia Joint Res Ctr BIM & Smart Construct, Shenzhen, Peoples R China
[3] Shenzhen Univ, Coll Civil & Transportat Engn, Shenzhen, Peoples R China
[4] Zhejiang Univ, Inst Refrigerat & Cryogen, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Active learning; Semi-supervised learning; HVAC fault diagnosis; Data-driven model; Artificial intelligence; CLASSIFICATION; SYSTEMS;
D O I
10.1016/j.apenergy.2023.122356
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Data-driven methods have drawn increasing interests in HVAC fault diagnosis tasks due to their intrinsic advantages in making real-time automated decisions. To ensure the reliability of data-driven models, it is essential to prepare sufficient labeled data for predictive modeling. In practice, it can be very time-consuming and laborintensive to determine the actual operating condition or label of each data sample (e.g., Normal or Faulty), making it highly challenging to develop robust data-driven solutions through conventional supervised learning methods. To tackle such challenges, this study proposes a data analytic framework to integrate active learning and semi-supervised learning to utilize massive unlabeled data for improved fault diagnosis performance. More specifically, five active learning methods have been tested to quantify their effectiveness in discovering valuable unlabeled data for expert labeling. Semi-supervised data-driven models have been developed to enable autonomous knowledge discovery from unlabeled building operational data through self-training protocols. Data experiments have been conducted to explore the separated and integrated values of active and semi-supervised learning. The results show that active learning can effectively identify valuable data samples for fault diagnosis and thereby, reducing approximately 50% labeling costs. Cost-effective combinatorial strategies have been derived to integrate active learning and semi-supervised learning for practical applications. The research outcomes are valuable for developing advanced data-driven solutions with substantial decreases in manual costs.
引用
收藏
页数:14
相关论文
共 56 条
[1]   Semi-Supervised Machine Learning for Fault Detection and Diagnosis of a Rooftop Unit [J].
Albayati, Mohammed G. G. ;
Faraj, Jalal ;
Thompson, Amy ;
Patil, Prathamesh ;
Gorthala, Ravi ;
Rajasekaran, Sanguthevar .
BIG DATA MINING AND ANALYTICS, 2023, 6 (02) :170-184
[2]   Fault detection and diagnosis for chiller based on feature-recognition model and Kernel Discriminant Analysis [J].
Bai, Xi ;
Zhang, Muxing ;
Jin, Zhenghao ;
You, Yilin ;
Liang, Caihua .
SUSTAINABLE CITIES AND SOCIETY, 2022, 79
[3]  
Blundell C, 2015, Arxiv, DOI arXiv:1505.05424
[4]   Early detection of faults in HVAC systems using an XGBoost model with a dynamic threshold [J].
Chakraborty, Debaditya ;
Elzarka, Hazem .
ENERGY AND BUILDINGS, 2019, 185 :326-344
[5]  
Chapelle O., 2006, SEMISUPERVISED LEARN, V20, P542, DOI [10.1109/TNN.2009.2015974, DOI 10.1109/TNN.2009.2015974]
[6]   Using discrete Bayesian networks for diagnosing and isolating cross-level faults in HVAC systems [J].
Chen, Yimin ;
Wen, Jin ;
Pradhan, Ojas ;
Lo, L. James ;
Wu, Teresa .
APPLIED ENERGY, 2022, 327
[7]   A review of data-driven fault detection and diagnostics for building HVAC systems [J].
Chen, Zhelun ;
O'Neill, Zheng ;
Wen, Jin ;
Pradhan, Ojas ;
Yang, Tao ;
Lu, Xing ;
Lin, Guanjing ;
Miyata, Shohei ;
Lee, Seungjae ;
Shen, Chou ;
Chiosa, Roberto ;
Piscitelli, Marco Savino ;
Capozzoli, Alfonso ;
Hengel, Franz ;
Kuehrer, Alexander ;
Pritoni, Marco ;
Liu, Wei ;
Clauss, John ;
Chen, Yimin ;
Herr, Terry .
APPLIED ENERGY, 2023, 339
[8]  
Comstrock MC, 1999, ASHRAE research project RP-1043
[9]   Novel transformer-based self-supervised learning methods for improved HVAC fault diagnosis performance with limited labeled data [J].
Fan, Cheng ;
Lei, Yutian ;
Sun, Yongjun ;
Mo, Like .
ENERGY, 2023, 278
[10]   Leveraging graph convolutional networks for semi-supervised fault diagnosis of HVAC systems in data-scarce contexts [J].
Fan, Cheng ;
Lin, Yiwen ;
Piscitelli, Marco Savino ;
Chiosa, Roberto ;
Wang, Huilong ;
Capozzoli, Alfonso ;
Ma, Yuanyuan .
BUILDING SIMULATION, 2023, 16 (08) :1499-1517