Poster Abstract: Data Efficient HVAC Control using Gaussian Process-based Reinforcement Learning

被引:0
|
作者
An, Zhiyu [1 ]
Ding, Xianzhong [1 ]
Du, Wan [1 ]
机构
[1] Univ Calif Merced, Merced, CA 95343 USA
关键词
Epistemic uncertainty estimation; Model-based reinforcement learning; HVAC control; Model predictive control;
D O I
10.1145/3625687.3628403
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Model-based Reinforcement Learning (MBRL) has been widely studied for energy-efficient control of the Heating, Ventilation, and Air Conditioning (HVAC) systems. One of the fundamental issues of the current approaches is the large amount of data required to train an accurate building system dynamics model. In this work, we developed a data-efficient system capable of excellent HVAC control performance with only days of training data. We use a Gaussian Process (GP) as the dynamics model which provides uncertainty for each prediction. To improve the data efficiency, we designed a meta kernel learning technique for GP kernel selection. To incorporate uncertainty in the control decisions, we designed a model predictive control method that considers the uncertainty of every prediction. Simulation experiments show that our method achieves excellent data efficiency, yielding similar energy savings and 12.07% less human comfort violation compared with the state-of-the-art MBRL method, while only trained on a seven-day training dataset.
引用
收藏
页码:538 / 539
页数:2
相关论文
共 50 条
  • [21] Gaussian Process-Based Predictive Control for Periodic Error Correction
    Klenske, Edgar D.
    Zeilinger, Melanie N.
    Schoelkopf, Bernhard
    Hennig, Philipp
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2016, 24 (01) : 110 - 121
  • [22] Gaussian process-based visual pursuit control with unknown target motion learning in three dimensions
    Omainska M.
    Yamauchi J.
    Beckers T.
    Hatanaka T.
    Hirche S.
    Fujita M.
    SICE Journal of Control, Measurement, and System Integration, 2021, 14 (01) : 116 - 127
  • [23] Control of HVAC-Systems Using Reinforcement Learning With Hysteresis and Tolerance Control
    Blad, Christian
    Kallesoe, Carsten Skovmose
    Bogh, Simon
    2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 938 - 942
  • [24] Cloud Job Scheduling Control Scheme Based on Gaussian Process Regression and Reinforcement Learning
    Peng, Zhiping
    Cui, Delong
    Xiong, Jianbin
    Xu, Bo
    Ma, Yuanjia
    Lin, Weiwei
    2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD 2016), 2016, : 278 - 286
  • [25] Control of HVAC-systems with Slow Thermodynamic Using Reinforcement Learning
    Blad, C.
    Koch, S.
    Ganeswarathas, S.
    Kallesoe, C. S.
    Bogh, S.
    29TH INTERNATIONAL CONFERENCE ON FLEXIBLE AUTOMATION AND INTELLIGENT MANUFACTURING (FAIM 2019): BEYOND INDUSTRY 4.0: INDUSTRIAL ADVANCES, ENGINEERING EDUCATION AND INTELLIGENT MANUFACTURING, 2019, 38 : 1308 - 1315
  • [26] Evaluating the Adaptability of Reinforcement Learning Based HVAC Control for Residential Houses
    Kurte, Kuldeep
    Munk, Jeffrey
    Kotevska, Olivera
    Amasyali, Kadir
    Smith, Robert
    McKee, Evan
    Du, Yan
    Cui, Borui
    Kuruganti, Teja
    Zandi, Helia
    SUSTAINABILITY, 2020, 12 (18)
  • [27] Development of an HVAC system control method using weather forecasting data with deep reinforcement learning algorithms
    Shin, Minjae
    Kim, Sungsoo
    Kim, Youngjin
    Song, Ahhyun
    Kim, Yeeun
    Kim, Ha Young
    BUILDING AND ENVIRONMENT, 2024, 248
  • [28] Gaussian Process-Based Transfer Kernel Learning for Unsupervised Domain Adaptation
    Ge, Pengfei
    Sun, Yesen
    MATHEMATICS, 2023, 11 (22)
  • [29] Transfer learning for occupancy-based HVAC control: A data-driven approach using unsupervised learning of occupancy profiles and deep reinforcement learning
    Esrafilian-Najafabadi, Mohammad
    Haghighat, Fariborz
    ENERGY AND BUILDINGS, 2023, 300
  • [30] OPTIMIZATION VIA SIMULATION USING GAUSSIAN PROCESS-BASED SEARCH
    Sun, Lihua
    Hong, L. Jeff
    Hu, Zhaolin
    PROCEEDINGS OF THE 2011 WINTER SIMULATION CONFERENCE (WSC), 2011, : 4134 - 4145