Two-Stage Deep Reinforcement Learning for Inverter-Based Volt-VAR Control in Active Distribution Networks

被引:102
作者
Liu, Haotian [1 ]
Wu, Wenchuan [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, State Key Lab Power Syst, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Reactive power; Training; Voltage control; Reinforcement learning; Safety; Markov processes; Games; transfer learning; deep reinforcement learning; reactive power; DISTRIBUTION-SYSTEMS; OPTIMIZATION; GAME;
D O I
10.1109/TSG.2020.3041620
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Model-based Vol/VAR optimization method is widely used to eliminate voltage violations and reduce network losses. However, the parameters of active distribution networks(ADNs) are not onsite identified, so significant errors may be involved in the model and make the model-based method infeasible. To cope with this critical issue, we propose a novel two-stage deep reinforcement learning (DRL) method to improve the voltage profile by regulating inverter-based energy resources, which consists of offline stage and online stage. In the offline stage, a highly efficient adversarial reinforcement learning algorithm is developed to train an offline agent robust to the model mismatch. In the sequential online stage, we transfer the offline agent safely as the online agent to perform continuous learning and controlling online with significantly improved safety and efficiency. Numerical simulations on IEEE test networks not only demonstrate that the proposed adversarial reinforcement learning algorithm outperforms the state-of-art algorithm, but also show that our proposed two-stage method achieves much better performance than the existing DRL based methods in the online application.
引用
收藏
页码:2037 / 2047
页数:11
相关论文
共 45 条
[1]   CasADi: a software framework for nonlinear optimization and optimal control [J].
Andersson, Joel A. E. ;
Gillis, Joris ;
Horn, Greg ;
Rawlings, James B. ;
Diehl, Moritz .
MATHEMATICAL PROGRAMMING COMPUTATION, 2019, 11 (01) :1-36
[2]   Model-Free Optimal Control of VAR Resources in Distribution Systems: An Extremum Seeking Approach [J].
Arnold, Daniel B. ;
Negrete-Pincetic, Matias ;
Sankur, Michael D. ;
Auslander, David M. ;
Callaway, Duncan S. .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2016, 31 (05) :3583-3593
[3]  
Auchariyamet S., 2010, P 45 INT U POW ENG C, P1
[4]   NETWORK RECONFIGURATION IN DISTRIBUTION-SYSTEMS FOR LOSS REDUCTION AND LOAD BALANCING [J].
BARAN, ME ;
WU, FF .
IEEE TRANSACTIONS ON POWER DELIVERY, 1989, 4 (02) :1401-1407
[6]  
Brockman G., 2016, Openai gym
[7]   Optimal placement of capacitors in radial distribution system using a Fuzzy-GA method [J].
Das, D. .
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2008, 30 (6-7) :361-367
[8]   Voltage/VAR Control in Distribution Networks via Reactive Power Injection Through Distributed Generators [J].
Deshmukh, Siddharth ;
Natarajan, Balasubramaniam ;
Pahwa, Anil .
IEEE TRANSACTIONS ON SMART GRID, 2012, 3 (03) :1226-1234
[9]   Deep-Reinforcement-Learning-Based Autonomous Voltage Control for Power Grid Operations [J].
Duan, Jiajun ;
Shi, Di ;
Diao, Ruisheng ;
Li, Haifeng ;
Wang, Zhiwei ;
Zhang, Bei ;
Bian, Desong ;
Yi, Zhehan .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (01) :814-817
[10]  
Duan Y., 2016, INT C MACH LEARN, V48, P1329, DOI [10.5555/3045390.3045531, DOI 10.1109/CVPR.2014.180]