An Alternative Reinforcement Learning (ARL) control strategy for data center air-cooled HVAC systems

被引：8

作者：

Lu, Ruyuan ^{[1
]}

Li, Xin ^{[1
]}

Chen, Ronghao ^{[1
]}

Lei, Aimin ^{[2
]}

Ma, Xiaoming ^{[1
]}

机构：

[1] Peking Univ, Sch Environm & Energy, Shenzhen 518055, Peoples R China

[2] Vertiv Tech Co Ltd, Shenzhen 518055, Peoples R China

来源：

ENERGY | 2024年 / 308卷

关键词：

Reinforcement learning (RL); Deep deterministic policy gradient (DDPG); Proximal policy optimization (PPO); Air-cooled HVAC systems; Data center; ENERGY; INTERNET; COMFORT;

D O I：

10.1016/j.energy.2024.132977

中图分类号：

O414.1 [热力学];

学科分类号：

摘要：

Energy efficiency of data center is of great concern globally due to their large amount of energy consumption and the foreseeable growth in the demand of digital services in the future. Advanced control strategies are needed to reduce energy consumption. However, the optimization of data center HVAC control is a challenge task due to the complexity of the thermal dynamic models of buildings and uncertainties associated with both server loads and outdoor temperature. In this paper, we propose a new control strategy called Alternately- Reinforcement Learning-control(ARL), which realizes the alternating control of RL and proportional, integral and derivative (PID) for optimizing the control of air-cooled HVAC systems in data centers. The control object of the ARL is the speed set of the compressor and the condensing fan, and the control goal is to minimize energy consumption while maintaining temperature stability. The applied RL algorithm is Deep deterministic policy gradient (DDPG) and Proximal policy optimization (PPO). We pre-train the ARL strategy in offline environment firstly and deploy it in real data center environment for online testing. The test results show that the ARL has significant advantages in maintaining temperature stability while reducing energy consumption, and the PPO algorithm performs better than the DDPG algorithm. Compared with the PID algorithm, the PPO algorithm can save energy by 5.27%, and the temperature control effect can be improved by 3.27%,which indicates the feasibility of the ARL for implementation in real data centers.

引用

页数：15

共 41 条

[21] Optimized tracking control using reinforcement learning strategy for a class of nonlinear systems [J].

Yang, Xue ;

Li, Bin .

ASIAN JOURNAL OF CONTROL, 2023, 25 (03) :2095-2104

[22] A multi-setpoint cooling control approach for air-cooled data centers using the deep Q-network algorithm [J].

Chen, Yaohua ;

Guo, Weipeng ;

Liu, Jinwen ;

Shen, Songyu ;

Lin, Jianpeng ;

Cui, Delong .

MEASUREMENT & CONTROL, 2024, 57 (06) :782-793

[23] Transfer learning for occupancy-based HVAC control: A data-driven approach using unsupervised learning of occupancy profiles and deep reinforcement learning [J].

Esrafilian-Najafabadi, Mohammad ;

Haghighat, Fariborz .

ENERGY AND BUILDINGS, 2023, 300

[24] Thermal full-field prediction of an air-cooled data center using a novel multi-scale approach based on POD and CFD coupling [J].

Dai, Yanjun ;

Zhao, Jie ;

Zhang, Xiuli ;

Bai, Fan ;

Tao, Wenquan ;

Wang, Yungang .

ENERGY AND BUILDINGS, 2024, 307

[25] Reinforcement Learning-based Data-driven Control Design for Motion Control Systems [J].

Deng, Zhengqi ;

Huo, Xin ;

Du, Qinlong ;

Liu, Qingquan .

PROCEEDINGS OF THE 36TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC 2024, 2024, :5745-5749

[26] Towards self-learning control of HVAC systems with the consideration of dynamic occupancy patterns: Application of model-free deep reinforcement learning [J].

Esrafilian-Najafabadi, Mohammad ;

Haghighat, Fariborz .

BUILDING AND ENVIRONMENT, 2022, 226

[27] Green Data Center Cooling Control via Physics-guided Safe Reinforcement Learning [J].

Wang, Ruihang ;

Cao, Zhiwei ;

Zhou, Xin ;

Wen, Yonggang ;

Tan, Rui .

ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2024, 8 (02)

[28] Deep reinforcement learning optimal control strategy for temperature setpoint real-time reset in multi-zone building HVAC system [J].

Fang, Xi ;

Gong, Guangcai ;

Li, Guannan ;

Chun, Liang ;

Peng, Pei ;

Li, Wenqiang ;

Shi, Xing ;

Chen, Xiang .

APPLIED THERMAL ENGINEERING, 2022, 212

[29] Toward Physics-Guided Safe Deep Reinforcement Learning for Green Data Center Cooling Control [J].

Wang, Ruihang ;

Zhang, Xinyi ;

Zhou, Xin ;

Wen, Yonggang ;

Tan, Rui .

2022 13TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2022), 2022, :159-169

[30] Observer-Based Optimal Backstepping Security Control for Nonlinear Systems Using Reinforcement Learning Strategy [J].

Wei, Qinglai ;

Chen, Wendi ;

Tan, Xiangmin ;

Xiao, Jun ;

Dong, Qi .

IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (11) :7011-7023

← 1 2 3 4 5 →