Distributed Optimization for Distribution Grids With Stochastic DER Using Multi-Agent Deep Reinforcement Learning

被引:17
作者
Al-Saffar, Mohammed [1 ]
Musilek, Petr [1 ,2 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2V4, Canada
[2] Univ Hradec Kralove, Dept Cybernet, Hradec Kralove 50003, Czech Republic
基金
加拿大自然科学与工程研究理事会;
关键词
Optimization; Microgrids; Heuristic algorithms; Stochastic processes; Power systems; Real-time systems; Convex functions; Distributed architecture; distributed optimization; Monte Carlo tree search; multi-agent deep reinforcement learning; optimal power flow; MODEL-PREDICTIVE CONTROL; MICROGRIDS; CLASSIFICATION; ALGORITHMS; SYSTEMS; TIME; ADMM;
D O I
10.1109/ACCESS.2021.3075247
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article develops a special decomposition methodology for the traditional optimal power flow which facilitates optimal integration of stochastic distributed energy resources in power distribution systems. The resulting distributed optimal power flow algorithm reduces the computational complexity of the conventional linear programming approach while avoiding the challenges associated with the stochastic nature of the energy resources and loads. It does so using machine learning algorithms employed for two crucial tasks. First, two proposed algorithms, Dynamic Distributed Multi-Microgrid and Monte Carlo Tree Search based Reinforcement Learning, constitute dynamic microgrids of network nodes to confirm the electric power transaction optimality. Second, the optimal distributed energy resources are obtained by the proposed deep reinforcement learning method named Multi Leader-Follower Actors under Centralized Critic. It accelerates conventional linear programming approach by considering a reduced set of resources and their constraints. The proposed method is demonstrated through a real-time balancing electricity market constructed over the IEEE 123-bus system and enhanced using price signals based on distribution locational marginal prices. This application clearly shows the ability of the new approach to effectively coordinate multiple distribution system entities while maintaining system security constraints.
引用
收藏
页码:63059 / 63072
页数:14
相关论文
共 53 条
[1]  
Achlerkar P. D., 2018, P 8 IEEE IND INT C P, P1
[2]   Optimal WDG planning in active distribution networks based on possibilistic-probabilistic PEVs load modelling [J].
Ahmadian, Ali ;
Sedghi, Mahdi ;
Elkamel, Ali ;
Aliakbar-Golkar, Masoud ;
Fowler, Michael .
IET GENERATION TRANSMISSION & DISTRIBUTION, 2017, 11 (04) :865-875
[3]  
Al-saffar M., 2019, 2019 IEEE CAN C ELEC, P1, DOI DOI 10.1109/CCECE.2019.8861957
[4]   Reinforcement Learning-Based Distributed BESS Management for Mitigating Overvoltage Issues in Systems With High PV Penetration [J].
Al-Saffar, Mohammed ;
Musilek, Petr .
IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (04) :2980-2994
[5]   Demand Response Strategy Based on Reinforcement Learning and Fuzzy Reasoning for Home Energy Management [J].
Alfaverh, Fayiz ;
Denai, M. ;
Sun, Yichuang .
IEEE ACCESS, 2020, 8 :39310-39321
[6]  
Blumsack S., 2009, 2009 IEEE POW EN SOC, P1, DOI DOI 10.1109/PES.2009.5275353
[7]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[8]   On linear-time deterministic algorithms for optimization problems in fixed dimension [J].
Chazelle, B ;
Matousek, J .
JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 1996, 21 (03) :579-597
[9]   Multi-Attribute Partitioning of Power Networks Based on Electrical Distance [J].
Cotilla-Sanchez, Eduardo ;
Hines, Paul D. H. ;
Barrows, Clayton ;
Blumsack, Seth ;
Patel, Mahendra .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2013, 28 (04) :4979-4987
[10]  
Foerster JN, 2018, AAAI CONF ARTIF INTE, P2974