Physics-Guided Multi-Agent Adversarial Reinforcement Learning for Robust Active Voltage Control With Peer-to-Peer (P2P) Energy Trading

被引:6
作者
Chen, Pengcheng [1 ]
Liu, Shichao [1 ]
Wang, Xiaozhe [2 ]
Kamwa, Innocent [3 ]
机构
[1] Carleton Univ, Dept Elect, Ottawa, ON K1S5B6, Canada
[2] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 0G4, Canada
[3] Laval Univ, Elect & Comp Sci Engn Dept, Quebec City, PQ G1V 0A6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Voltage control; Peer-to-peer computing; Games; Fluctuations; Deep reinforcement learning; Regulation; Reactive power; Distribution network; voltage regulation; P2P energy trading; multi-agent deep reinforcement learning; DISTRIBUTION NETWORKS; OPTIMIZATION; OPERATION;
D O I
10.1109/TPWRS.2024.3369591
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The utilization of peer-to-peer (P2P) energy trading in the active distribution network can facilitate profit sharing among numerous prosumers while effectively accommodating the integration of distributed renewable energy. However, the market-oriented P2P energy trading involved self-interested prosumers could unavoidably result in voltage violations of buses. To tackle this challenge, this paper proposes a physics-guided multi-agent deep reinforcement learning (MADRL) integrated with adversarial learning for both day-ahead trading and intra-day voltage regulation. In the day-ahead energy trading, an energy cost minimization framework is built and constrained with distributed generators and battery energy storage systems (BESSs) for the repeated rounds of multilateral negotiations among prosumers to reach an optimal trading solution without considering physical network limitations. Then, in the intra-day voltage regulation, the physics-guided multi-agent adversarial twin delayed deep deterministic (PG-MA2TD3) policy gradient algorithm is designed to overcome the voltage fluctuation problem and minimize the line loss via adjusting the active power from BESSs and reactive power from photovoltaic (PV) inverters. Moreover, the Jacobian matrix is exploited to measure the impact of neighbor bus active power variations on local voltage due to neighborhood trading in P2P transaction and a multi-agent adversarial learning (MAAL) approach is implemented to obtain an adaptive descend gradient corresponding to the action of the adjacent agents in Q function for increasing the robustness of trained policy. It is verified that the proposed method provides better robustness and the largest steady-state reward with comparison to various state-of-the-art methods on the IEEE 33-bus system with three-year data in Portuguese power system.
引用
收藏
页码:7089 / 7101
页数:13
相关论文
共 36 条
[1]   Energy Peer-to-Peer Trading in Virtual Microgrids in Smart Grids: A Game-Theoretic Approach [J].
Anoh, Kelvin ;
Maharjan, Sabita ;
Ikpehai, Augustine ;
Zhang, Yan ;
Adebisi, Bamidele .
IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (02) :1264-1275
[2]  
[Anonymous], 2018, 15472018 IEEE, DOI DOI 10.1109/IEEESTD.2018.8332112
[3]   A Cooperative P2P Trading Framework: Developed and Validated Through Hardware-in-Loop [J].
Azim, M. Imran ;
Alam, Mollah R. R. ;
Tushar, Wayes ;
Saha, Tapan K. K. ;
Yuen, Chau .
IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (04) :2999-3015
[4]   Coalition Graph Game-Based P2P Energy Trading With Local Voltage Management [J].
Azim, M. Imran ;
Tushar, Wayes ;
Saha, Tapan Kumar .
IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (05) :4389-4402
[5]   Distributed optimization and statistical learning via the alternating direction method of multipliers [J].
Boyd S. ;
Parikh N. ;
Chu E. ;
Peleato B. ;
Eckstein J. .
Foundations and Trends in Machine Learning, 2010, 3 (01) :1-122
[6]   Network Partition and Voltage Coordination Control for Distribution Networks With High Penetration of Distributed PV Units [J].
Chai, Yuanyuan ;
Guo, Li ;
Wang, Chengshan ;
Zhao, Zongzheng ;
Du, Xiaofeng ;
Pan, Jing .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (03) :3396-3407
[7]   Privacy-Preserving Distributed Energy Transaction in Active Distribution Networks [J].
Chang, Xinyue ;
Xu, Yinliang ;
Sun, Hongbin ;
Wu, Qiuwei .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (04) :3413-3426
[8]   A Byzantine-Resilient Distributed Peer-to-Peer Energy Management Approach [J].
Chang, Xinyue ;
Xu, Yinliang ;
Guo, Qinglai ;
Sun, Hongbin ;
Chan, Wai Kin .
IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (01) :623-634
[9]   Vertex scenario-based robust peer-to-peer transactive energy trading in distribution networks [J].
Chang, Xinyue ;
Xu, Yinliang ;
Sun, Hongbin .
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 138
[10]   Forecasting of photovoltaic power generation and model optimization: A review [J].
Das, Utpal Kumar ;
Tey, Kok Soon ;
Seyedmahmoudian, Mehdi ;
Mekhilef, Saad ;
Idris, Moh Yamani Idna ;
Van Deventer, Willem ;
Horan, Bend ;
Stojcevski, Alex .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2018, 81 :912-928