Virtual-Action-Based Coordinated Reinforcement Learning for Distributed Economic Dispatch

被引:33
作者
Li, Dewen [1 ]
Yu, Liying [1 ]
Li, Ning [1 ]
Lewis, Frank [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[2] Univ Texas Arlington, UTA Res Inst, Ft Worth, TX 76118 USA
基金
中国国家自然科学基金;
关键词
Generators; Heuristic algorithms; Power system dynamics; Cost function; Wind power generation; Upper bound; Research and development; Distributed reinforcement learning; economic dispatch; multi-agent system; singularly perturbed system; ALGORITHM;
D O I
10.1109/TPWRS.2021.3070161
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A unified distributed reinforcement learning (RL) solution is offered for both static and dynamic economic dispatch problems (EDPs). Each agent is assigned with a fixed, discrete, virtual action set, and a projection method generates the feasible, actual actions to satisfy the constraints. A distributed algorithm, based on singularly perturbed system, solves the projection problem. A distributed form of Hysteretic Q-learning achieves coordination among agents. Therein, the Q-values are developed based on the virtual actions, while the rewards are produced by the projected actual actions. The proposed algorithm deals with continuous action space and power loads without using function approximations. Theoretical analysis and comparative simulation studies verify algorithm's convergence and optimality.
引用
收藏
页码:5143 / 5152
页数:10
相关论文
共 35 条
[1]  
Abouheaf MI, 2014, 2014 IEEE 11 INT MUL, P1, DOI 10.1109/SSD.2014.6808789
[2]  
[Anonymous], 2008, IEEE T SYST MAN CYB, DOI DOI 10.1109/TSMC.1979.4310158
[3]  
Bai L, 2016, IEEE DECIS CONTR P, P6934, DOI 10.1109/CDC.2016.7799337
[4]   An O(1/k) Gradient Method for Network Resource Allocation Problems [J].
Beck, Amir ;
Nedic, Angelia ;
Ozdaglar, Asuman ;
Teboulle, Marc .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2014, 1 (01) :64-73
[5]  
Bhatia A., 2019, P INT C AUT PLANN SC, V29, P610, DOI DOI 10.1609/ICAPS.V29I1.3528
[6]   A Distributed Auction-Based Algorithm for the Nonconvex Economic Dispatch Problem [J].
Binetti, Giulio ;
Davoudi, Ali ;
Naso, David ;
Turchiano, Biagio ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2014, 10 (02) :1124-1132
[7]   Parallel and Distributed Computation for Dynamical Economic Dispatch [J].
Chen, Guo ;
Li, Chaojie ;
Dong, Zhaoyang .
IEEE TRANSACTIONS ON SMART GRID, 2017, 8 (02) :1026-1027
[8]   Improved genetic algorithm for power economic dispatch of units with valve-point effects and multiple fuels [J].
Chiang, CL .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2005, 20 (04) :1690-1699
[9]  
Dahiya S., 2020, P IEEE 9 POW IND INT, P1
[10]   Distributed Reinforcement Learning Algorithm for Dynamic Economic Dispatch With Unknown Generation Cost Functions [J].
Dai, Pengcheng ;
Yu, Wenwu ;
Wen, Guanghui ;
Baldi, Simone .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (04) :2258-2267