A Novel Hybrid-Action-Based Deep Reinforcement Learning for Industrial Energy Management

被引：8

作者：

Lu, Renzhi ^{[1
,2
,3
]}

Jiang, Zhenyu ^{[4
]}

Yang, Tao ^{[5
]}

Chen, Ying ^{[6
]}

Wang, Dong ^{[7
,8
]}

Peng, Xin ^{[9
]}

机构：

[1] Huazhong Univ Sci & Technol, Engn Res Ctr Autonomous Intelligent Unmanned Syst, Sch Artificial Intelligence & Automat, Key Lab Image Proc & Intelligent Control, Wuhan 430074, Peoples R China

[2] Huazhong Univ Sci & Technol, Key Lab Syst Control & Informat Proc, Minist Educ, Shanghai 200240, Peoples R China

[3] Huazhong Univ Sci & Technol, Hubei Key Lab Adv Control & Intelligent Automat Co, Wuhan 430074, Peoples R China

[4] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China

[5] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China

[6] Tsinghua Univ, Elect Engn, Beijing 100084, Peoples R China

[7] Dalian Univ Technol, Key Lab Intelligent Control & Optimizat Ind Equipm, Minist Educ, Dalian 116024, Peoples R China

[8] Dalian Univ Technol, Sch Control Sci & Engn, Dalian 116024, Peoples R China

[9] East China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2024年 / 20卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Energy management; Costs; Power generation; Renewable energy sources; Optimization; Load modeling; Uncertainty; Deep reinforcement learning (DRL); energy management; hybrid actions; industrial energy system; DEMAND RESPONSE;

D O I：

10.1109/TII.2024.3424529

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

As environmental pollution becomes increasingly serious and industrial energy consumption continuously rises, an intelligent and efficient industrial energy management policy is urgently needed to reduce costs and maximize the benefits of industrial energy systems. However, modern industrial energy systems are characterized by hybrid industrial equipment actions, diverse objectives, and highly intermittent and stochastically distributed renewable energy sources. Therefore, efficient operation and control are difficult. This article presents a novel, model-free energy management policy using a hybrid action deep reinforcement learning algorithm for energy scheduling of industrial equipments operating in various modes. Specifically, the interaction process between the industrial energy management center and each equipment is modeled as a Markov decision process that minimizes the daily operating cost of the energy system and maximizes the revenue of the production equipment. Then, a double parameterized deep Q-networks that does not require an explicit environmental model is developed to learn the hybrid action signals using actor and critic networks, in which the double Q value mechanism avoids value overestimation and improves the algorithm efficiency. In addition, the policy gradient of the proposed algorithm is derived and its convergence proof is discussed. Finally, numerical studies are conducted using real-world data to evaluate algorithm performance and verify its effectiveness.

引用

页码：12461 / 12475

页数：15

共 37 条

[1] A Demand Response Energy Management Scheme for Industrial Facilities in Smart Grid [J].

Ding, Yue Min ;

Hong, Seung Ho ;

Li, Xiao Hui .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2014, 10 (04) :2257-2269

[2] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients [J].

Grondman, Ivo ;

Busoniu, Lucian ;

Lopes, Gabriel A. D. ;

Babuska, Robert .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06) :1291-1307

[3]

gurobi, 2024, Gurobi Optimization

[4]

Lillicrap T.P., 2020, IEEE INTERNET THINGS, V7, P2751

[5] A Novel Sequence-to-Sequence-Based Deep Learning Model for Multistep Load Forecasting [J].

Lu, Renzhi ;

Bai, Ruichang ;

Li, Ruidong ;

Zhu, Lijun ;

Sun, Mingyang ;

Xiao, Feng ;

Wang, Dong ;

Wu, Huaming ;

Ding, Yuemin .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) :638-652

[6] Reward Shaping-Based Actor-Critic Deep Reinforcement Learning for Residential Energy Management [J].

Lu, Renzhi ;

Jiang, Zhenyu ;

Wu, Huaming ;

Ding, Yuemin ;

Wang, Dong ;

Zhang, Hai-Tao .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) :2662-2673

[7] Deep Reinforcement Learning-Based Demand Response for Smart Facilities Energy Management [J].

Lu, Renzhi ;

Bai, Ruichang ;

Luo, Zhe ;

Jiang, Junhui ;

Sun, Mingyang ;

Zhang, Hai-Tao .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (08) :8554-8565

[8] A hybrid deep learning-based online energy management scheme for industrial microgrid [J].

Lu, Renzhi ;

Bai, Ruichang ;

Ding, Yuemin ;

Wei, Min ;

Jiang, Junhui ;

Sun, Mingyang ;

Xiao, Feng ;

Zhang, Hai-Tao .

APPLIED ENERGY, 2021, 304 (304)

[9] Demand Response for Home Energy Management Using Reinforcement Learning and Artificial Neural Network [J].

Lu, Renzhi ;

Hong, Seung Ho ;

Yu, Mengmeng .

IEEE TRANSACTIONS ON SMART GRID, 2019, 10 (06) :6629-6639

[10]

Lu Y.-C., 2020, Appl. Energy, V276

← 1 2 3 4 →