Multi-Agent Deep Reinforcement Learning-Based Distributed Optimal Generation Control of DC Microgrids

被引:24
作者
Fan, Zhen [1 ]
Zhang, Wei [2 ]
Liu, Wenxin [1 ]
机构
[1] Lehigh Univ, Dept Elect & Comp Engn, Smart Microgrid & Renewable Technol Res Lab, Bethlehem, PA 18015 USA
[2] Operat Technol ETAP, Res & Dev, Irvine, CA 92618 USA
关键词
Multi-agent deep reinforcement learning; DC microgrids; data-efficient; optimal control; distributed system; VOLTAGE CONTROL; SYSTEM; TIME;
D O I
10.1109/TSG.2023.3237200
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Optimal generation allocation is customarily applied to the DC microgrids to enable optimal operation. Conventionally, the optimization is implemented periodically to obtain the optimal bus voltage or output power references for given operating conditions, which will unavoidably deviate from the actual ones. Since slight disturbances such as load changes will trigger real-time control adjustments, hence, the overall cost will increase due to the disconnect between optimization and real-time control. To overcome this issue, it is preferable to directly apply the optimal control method to render an optimal time path of control actions in real-time. This paper has studied the optimal generation control problem as a constrained non-convex problem with non-linearity. DRL has been successfully applied to solve such problems without mathematically modeling the actual system; the links between states and actions are discovered via ongoing environmental interactions, decreasing the reliance on system parameter information. This paper also showed that TD3-based optimal control could be applied to DC microgrids using a monotonical policy gradient search approach. Furthermore, DRL's distributed training and execution framework is designed to realize real-time distributed control. The data sampling, storage, and experience buffer initialization strategy are customized to improve learning efficiency. The case study demonstrated its effectiveness.
引用
收藏
页码:3337 / 3351
页数:15
相关论文
共 35 条
[1]   The Load as an Energy Asset in a Distributed DC SmartGrid Architecture [J].
Balog, Robert S. ;
Weaver, Wayne W. ;
Krein, Philip T. .
IEEE TRANSACTIONS ON SMART GRID, 2012, 3 (01) :253-260
[2]   Double Deep Q-Learning-Based Distributed Operation of Battery Energy Storage System Considering Uncertainties [J].
Bui, Van-Hai ;
Hussain, Akhtar ;
Kim, Hak-Man .
IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (01) :457-469
[3]   Autonomous DC Voltage Control of a DC Microgrid With Multiple Slack Terminals [J].
Chen, Dong ;
Xu, Lie .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2012, 27 (04) :1897-1905
[4]   Design and Implementation of Energy Management System With Fuzzy Control for DC Microgrid Systems [J].
Chen, Yu-Kai ;
Wu, Yung-Chun ;
Song, Chau-Chung ;
Chen, Yu-Syun .
IEEE TRANSACTIONS ON POWER ELECTRONICS, 2013, 28 (04) :1563-1570
[5]   Reinforcement learning in continuous time and space [J].
Doya, K .
NEURAL COMPUTATION, 2000, 12 (01) :219-245
[6]   Distributed Control of DC Microgrids for Optimal Coordination of Conventional and Renewable Generators [J].
Fan, Zhen ;
Fan, Bo ;
Liu, Wenxin .
IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (06) :4607-4615
[7]   Operation Loss Minimization Targeted Distributed Optimal Control of DC Microgrids [J].
Fan, Zhen ;
Fan, Bo ;
Peng, Jiangkai ;
Liu, Wenxin .
IEEE SYSTEMS JOURNAL, 2021, 15 (04) :5186-5196
[8]  
Fujimoto S, 2018, PR MACH LEARN RES, V80
[9]  
He H., 2012, Reinforcement Learning and Approximate Dynamic Programming for Feedback Control, P78
[10]  
Kamien M., 1991, Dynamic Optimization: The Calculus of Variation and Optimal Control in Economics and Management