Optimization control method for dedicated outdoor air system in multi-zone office buildings based on deep reinforcement learning

被引：1

作者：

Tang, Xudong ^{[1
,2
,3
]}

Zhang, Ling ^{[1
,2
,3
]}

Luo, Yongqiang ^{[4
]}

机构：

[1] Hunan Univ, Coll Civil Engn, Changsha 410082, Peoples R China

[2] Hunan Univ, Natl Ctr Int Res Collaborat Bldg Safety & Environm, Changsha 410082, Peoples R China

[3] Hunan Univ, Key Lab Bldg Safety & Energy Efficiency, Minist Educ, Changsha 410082, Peoples R China

[4] Huazhong Univ Sci & Technol, Sch Environm Sci & Engn, Wuhan 430074, Peoples R China

来源：

BUILDING SIMULATION | 2025年 / 18卷 / 04期

基金：

中国国家自然科学基金;

关键词：

multi-zone HVAC systems; energy consumption; thermal comfort; indoor air quality; multi-agent deep reinforcement learning; VENTILATION; COMFORT; MODEL;

D O I：

10.1007/s12273-025-1231-0

中图分类号：

O414.1 [热力学];

学科分类号：

摘要：

Heating, ventilation, and air conditioning (HVAC) systems consume a significant amount of energy to maintain thermal comfort and indoor air quality in buildings, which results in high operational costs. Reinforcement learning is an effective method for controlling HVAC systems. However, in large and complex HVAC systems, traditional reinforcement learning algorithms often face the challenges of slow training speed and poor convergence performance. This paper proposes a multi-objective optimization control method based on the multi-agent deep deterministic policy gradient (MADDPG) algorithm, which aims to minimize HVAC energy consumption while ensuring optimal thermal comfort and indoor air quality in each zone. Using a multi-zone office building with fan coil units and a dedicated outdoor air system as a case study, we developed an EnergyPlus-Python co-simulation platform. The proposed control method was employed during both the heating and cooling seasons to independently control the temperature setpoints and fresh airflow in different zones of the office building. The simulation results from both the heating and cooling seasons demonstrate that the MADDPG control method exhibits faster convergence during training and excellent learning capabilities, allowing it to adapt effectively to changes in environmental conditions and implement appropriate control actions. Under similar indoor thermal comfort and air quality conditions, the MADDPG control method consumes less energy than the traditional reinforcement learning method, it saves 24.1% of energy during the heating season and 8.9% during the cooling season compared to the rule-based control method. Additionally, by adjusting the reward function in the MADDPG algorithm, it is possible to flexibly balance energy consumption, thermal comfort, and air quality preferences, demonstrating the algorithm's strong applicability.

引用

页码：881 / 896

页数：16

共 31 条

[1]

An Z., 2023, P 10 ACM INT C SYSTE

[2] Advanced Reinforcement Learning Solution for Clock Skew Engineering: Modified Q-Table Update Technique for Peak Current and IR Drop Minimization [J].

Beheshti-Shirazi, Sayed Aresh ;

Nazari, Najmeh ;

Gubbi, Kevin Immanuel ;

Latibari, Banafsheh Saber ;

Rafatirad, Setareh ;

Homayoun, Houman ;

Sasan, Avesta ;

Manoj, P. D. Sai .

IEEE ACCESS, 2023, 11 :87869-87886

[3] A laboratory test of an Offline-trained Multi-Agent Reinforcement Learning Algorithm for Heating Systems [J].

Blad, C. ;

Bogh, S. ;

Kallesoe, C. ;

Raftery, Paul .

APPLIED ENERGY, 2023, 337

[4] Deep reinforcement learning to optimise indoor temperature control and heating energy consumption in buildings [J].

Brandi, Silvio ;

Piscitelli, Marco Savino ;

Martellacci, Marco ;

Capozzoli, Alfonso .

ENERGY AND BUILDINGS, 2020, 224

[5] Dynamic indoor thermal environment using Reinforcement Learning-based controls: Opportunities and challenges [J].

Chatterjee, Arnab ;

Khovalyg, Dolaana .

BUILDING AND ENVIRONMENT, 2023, 244

[6] An Online Reinforcement Learning Method for Multi-Zone Ventilation Control With Pre-Training [J].

Cui, Can ;

Li, Chunxiao ;

Li, Ming .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (07) :7163-7172

[7] Reinforcement learning for energy conservation and comfort in buildings [J].

Dalamagkidis, K. ;

Kolokotsa, D. ;

Kalaitzakis, K. ;

Stavrakakis, G. S. .

BUILDING AND ENVIRONMENT, 2007, 42 (07) :2686-2698

[8] Towards optimal HVAC control in non-stationary building environments combining active change detection and deep reinforcement learning [J].

Deng, Xiangtian ;

Zhang, Yi ;

Qi, He .

BUILDING AND ENVIRONMENT, 2022, 211

[9] Multi-Zone HVAC Control With Model-Based Deep Reinforcement Learning [J].

Ding, Xianzhong ;

Cerpa, Alberto ;

Du, Wan .

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, :4408-4426

[10] Intelligent multi-zone residential HVAC control strategy based on deep reinforcement learning [J].

Du, Yan ;

Zandi, Helia ;

Kotevska, Olivera ;

Kurte, Kuldeep ;

Munk, Jeffery ;

Amasyali, Kadir ;

Mckee, Evan ;

Li, Fangxing .

APPLIED ENERGY, 2021, 281

← 1 2 3 4 →