Online Multi-Agent Reinforcement Learning for Decentralized Inverter-Based Volt-VAR Control

被引:114
作者
Liu, Haotian [1 ]
Wu, Wenchuan [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, State Key Lab Power Syst, Beijing 100084, Peoples R China
基金
美国国家科学基金会;
关键词
Voltage control; Training; Markov processes; Reactive power; Optimization; Games; Decentralized control; multi-agent reinforcement learning; reactive power; distributed control; CONTROL SCHEME; SYSTEMS;
D O I
10.1109/TSG.2021.3060027
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The distributed Volt/Var control (VVC) methods have been widely studied for active distribution networks(ADNs), which is based on perfect model and real-time P2P communication. However, the model is always incomplete with significant parameter errors and such P2P communication system is hard to maintain. In this article, we propose an online multi-agent reinforcement learning and decentralized control framework (OLDC) for VVC. In this framework, the VVC problem is formulated as a constrained Markov game and we propose a novel multi-agent constrained soft actor-critic (MACSAC) reinforcement learning algorithm. MACSAC is used to train the control agents online, so the accurate ADN model is no longer needed. Then, the trained agents can realize decentralized optimization using local measurements without real-time P2P communication. The OLDC with MACSAC has shown extraordinary flexibility, efficiency and robustness to various computing and communication conditions. Numerical simulations on IEEE test cases not only demonstrate that the proposed MACSAC outperforms the state-of-art learning algorithms, but also support the superiority of our OLDC framework in the online application.
引用
收藏
页码:2980 / 2990
页数:11
相关论文
共 39 条
[1]  
Achiam J, 2017, PR MACH LEARN RES, V70
[2]   CasADi: a software framework for nonlinear optimization and optimal control [J].
Andersson, Joel A. E. ;
Gillis, Joris ;
Horn, Greg ;
Rawlings, James B. ;
Diehl, Moritz .
MATHEMATICAL PROGRAMMING COMPUTATION, 2019, 11 (01) :1-36
[3]  
[Anonymous], 2012, P 2012 N AM POWER S, DOI DOI 10.1109/NAPS.2012.6336354
[4]   A Decentralized Robust Mixed H2/H∞ Voltage Control Scheme to Improve Small/Large-Signal Stability and FRT Capability of Islanded Multi-DER Microgrid Considering Load Disturbances [J].
Baghaee, Hamid Reza ;
Mirsalim, Mojtaba ;
Gharehpetian, Gevork B. ;
Talebi, Heidar Ali .
IEEE SYSTEMS JOURNAL, 2018, 12 (03) :2610-2621
[5]   Unbalanced harmonic power sharing and voltage compensation of microgrids using radial basis function neural network-based harmonic power-flow calculations for distributed and decentralised control structures [J].
Baghaee, Hamid Reza ;
Mirsalim, Mojtaba ;
Gharehpetian, Gevork B. ;
Talebi, Heidar Ali .
IET GENERATION TRANSMISSION & DISTRIBUTION, 2018, 12 (07) :1518-1530
[6]   Power Calculation Using RBF Neural Networks to Improve Power Sharing of Hierarchical Control Scheme in Multi-DER Microgrids [J].
Baghaee, Hamid Reza ;
Mirsalim, Mojtaba ;
Gharehpetian, Gevork B. .
IEEE JOURNAL OF EMERGING AND SELECTED TOPICS IN POWER ELECTRONICS, 2016, 4 (04) :1217-1225
[7]   NETWORK RECONFIGURATION IN DISTRIBUTION-SYSTEMS FOR LOSS REDUCTION AND LOAD BALANCING [J].
BARAN, ME ;
WU, FF .
IEEE TRANSACTIONS ON POWER DELIVERY, 1989, 4 (02) :1401-1407
[8]  
Brockman Greg, 2016, arXiv
[9]  
Dimeas AL, 2010, IEEE POW ENER SOC GE
[10]  
Eitan A., 1999, STOCHASTIC MODELING