Model-Free Economic Dispatch for Virtual Power Plants: An Adversarial Safe Reinforcement Learning Approach

被引:14
作者
Yi, Zhongkai [1 ]
Xu, Ying [1 ]
Wu, Chenyu [2 ]
机构
[1] Harbin Inst Technol, Sch Elect Engn & Automat, Harbin 150001, Peoples R China
[2] Southeast Univ, Sch Elect Engn, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Economic dispatch; virtual power plant; safe reinforcement learning; adversarial learning; FLEXIBILITY; SYSTEMS;
D O I
10.1109/TPWRS.2023.3289334
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To address the model inaccuracy and uncertainty of virtual power plants (VPPs), a model-free economic dispatch approach for multiple VPPs is studied in this article, which does not rely on an accurate environmental model. An adversarial safe reinforcement learning approach is proposed, which promotes the safety of the actions and makes the model robust to deviations between the training and testing environments. Moreover, a two-stage reinforcement learning framework is formulated based on the proposed algorithm. The dispatch policy is pretrained in the simulator and then fine-tuned in the real-world environment. The numerical simulations illustrate that the proposed approach is adaptive to the deviation between the training and testing environments, and it provides higher robustness to the noise of the network parameters and uncertainty of the VPPs' power outputs. The scalability and superiority of the proposed approach are verified by comparing it with existing methods.
引用
收藏
页码:3153 / 3168
页数:16
相关论文
共 45 条
[1]   Secondary Frequency Control of Microgrids: An Online Reinforcement Learning Approach [J].
Adibi, Mahya ;
van der Woude, Jacob .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (09) :4824-4831
[2]   Economic-Emission Dispatch Problem in Power Systems With Carbon Capture Power Plants [J].
Akbari-Dibavar, Alireza ;
Mohammadi-ivatloo, Behnam ;
Zare, Kazem ;
Khalili, Tohid ;
Bidram, Ali .
IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2021, 57 (04) :3341-3351
[3]   Reinforcement Learning-Based Distributed BESS Management for Mitigating Overvoltage Issues in Systems With High PV Penetration [J].
Al-Saffar, Mohammed ;
Musilek, Petr .
IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (04) :2980-2994
[4]  
[Anonymous], 2017, beta-vae: Learning basic visual concepts with a constrained, P1
[5]   Strong NP-hardness of AC power flows feasibility [J].
Bienstock, Daniel ;
Verma, Abhinav .
OPERATIONS RESEARCH LETTERS, 2019, 47 (06) :494-501
[6]  
Chen JY, 2019, IEEE INT C INTELL TR, P2765, DOI [10.1109/itsc.2019.8917306, 10.1109/ITSC.2019.8917306]
[7]   Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future Challenges [J].
Chen, Xin ;
Qu, Guannan ;
Tang, Yujie ;
Low, Steven ;
Li, Na .
IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (04) :2935-2958
[8]  
Cobbe K, 2019, PR MACH LEARN RES, V97
[9]  
Dalal G, 2018, Arxiv, DOI arXiv:1801.08757
[10]   Safely Learning to Control the Constrained Linear Quadratic Regulator [J].
Dean, Sarah ;
Tu, Stephen ;
Matni, Nikolai ;
Recht, Benjamin .
2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, :5582-5588