Physics-Shielded Multi-Agent Deep Reinforcement Learning for Safe Active Voltage Control With Photovoltaic/Battery Energy Storage Systems

被引：20

作者：

Chen, Pengcheng ^{[1
]}

Liu, Shichao ^{[1
]}

Wang, Xiaozhe ^{[2
]}

Kamwa, Innocent ^{[3
]}

机构：

[1] Carleton Univ, Dept Elect, Ottawa, ON K1S 5B6, Canada

[2] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 0G4, Canada

[3] Laval Univ, Elect & Comp Sci Engn Dept, Quebec City, PQ G1V 0A6, Canada

来源：

IEEE TRANSACTIONS ON SMART GRID | 2023年 / 14卷 / 04期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Active voltage control; distribution systems; multi-agent deep reinforcement learning; shield; battery energy storage systems; DISTRIBUTION NETWORKS; HIGH PENETRATION; VOLT/VAR CONTROL;

D O I：

10.1109/TSG.2022.3228636

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

While many multi-agent deep reinforcement learning (MADRL) algorithms have been implemented for active voltage control (AVC) in power distribution systems, the safety of electrical components involved in the operation of these algorithms are mostly ignored. In this work, a safe MADRL control scheme is proposed to regulate the reactive and active power control of photovoltaics (PVs) to alleviate power congestion and improve voltage quality by coordinating battery energy storage systems (BESSs) and static var compensators (SVCs). Uniquely, the learning algorithm designed in this paper can limit the action of the agent when approaching a dangerous state to ensure the safety of BESSs during the training process, which is realized by developing a multi-agent twin delayed deep deterministic (MATD3) policy gradient algorithm with a physics-based shielding mechanism. Specifically, actions that lead to dangerous states, the state-of-charge (SoC) of BESSs is fully loaded or drained, are replaced by the shielding mechanism with safe actions while maintaining system stability. Furthermore, each PVs node in the power distribution network is treated as an agent under the fact of reactive and active power sensitivities to voltage in the MATD3 algorithm, which is beneficial for improving scalability. Training, testing and comparative results on IEEE 33-bus and 141-bus with real-world data are provided to demonstrate the effectiveness and superiority of the proposed algorithm.

引用

页码：2656 / 2667

页数：12

共 35 条

[1] Ensemble Joint Sparse Low-Rank Matrix Decomposition for Thermography Diagnosis System [J].

Ahmed, Junaid ;

Gao, Bin ;

Woo, Wai Lok ;

Zhu, Yuyu .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (03) :2648-2658

[2] Adaptive Real Power Capping Method for Fair Overvoltage Regulation of Distribution Networks With High Penetration of PV Systems [J].

Alyami, Saeed ;

Wang, Yang ;

Wang, Caisheng ;

Zhao, Junhui ;

Zhao, Bo .

IEEE TRANSACTIONS ON SMART GRID, 2014, 5 (06) :2729-2738

[3] Development of New Identification Method for Global Group of Controls for Online Coordinated Voltage Control in Active Distribution Networks [J].

Alzaareer, Khaled ;

Saad, Maarouf ;

Mehrjerdi, Hasan ;

Asber, Dalal ;

Lefebvre, Serge .

IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (05) :3921-3931

[4] Deep Reinforcement Learning Enabled Physical-Model-Free Two-Timescale Voltage Control Method for Active Distribution Systems [J].

Cao, Di ;

Zhao, Junbo ;

Hu, Weihao ;

Yu, Nanpeng ;

Ding, Fei ;

Huang, Qi ;

Chen, Zhe .

IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (01) :149-165

[5] Data-Driven Multi-Agent Deep Reinforcement Learning for Distribution System Decentralized Voltage Control With High Penetration of PVs [J].

Cao, Di ;

Zhao, Junbo ;

Hu, Weihao ;

Ding, Fei ;

Huang, Qi ;

Chen, Zhe ;

Blaabjerg, Frede .

IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (05) :4137-4150

[6] A Multi-Agent Deep Reinforcement Learning Based Voltage Regulation Using Coordinated PV Inverters [J].

Cao, Di ;

Hu, Weihao ;

Zhao, Junbo ;

Huang, Qi ;

Chen, Zhe ;

Blaabjerg, Frede .

IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (05) :4120-4123

[7] Network Partition and Voltage Coordination Control for Distribution Networks With High Penetration of Distributed PV Units [J].

Chai, Yuanyuan ;

Guo, Li ;

Wang, Chengshan ;

Zhao, Zongzheng ;

Du, Xiaofeng ;

Pan, Jing .

IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (03) :3396-3407

[8] Deep-Reinforcement-Learning-Based Autonomous Voltage Control for Power Grid Operations [J].

Duan, Jiajun ;

Shi, Di ;

Diao, Ruisheng ;

Li, Haifeng ;

Wang, Zhiwei ;

Zhang, Bei ;

Bian, Desong ;

Yi, Zhehan .

IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (01) :814-817

[9]

Elsayed-Aly I., 2021, arXiv

[10]

Fujimoto S, 2018, PR MACH LEARN RES, V80

← 1 2 3 4 →