Distributed multi-step Q(λ) learning for Optimal Power Flow of large-scale power grids

被引：34

作者：

Yu, T. ^{[2
]}

Liu, J. ^{[2
]}

Chan, K. W. ^{[1
]}

Wang, J. J. ^{[2
]}

机构：

[1] Hong Kong Polytech Univ, Hong Kong, Hong Kong, Peoples R China

[2] S China Univ Technol, Guangzhou 510640, Guangdong, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS | 2012年 / 42卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Optimal Power Flow (OPF); Q(lambda) learning; Multi-objective optimization; Distributed Reinforcement Learning (DRL); OPTIMIZATION; ALGORITHMS;

D O I：

10.1016/j.ijepes.2012.04.062

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents a novel distributed multi-step Q(lambda) learning algorithm (DQ(lambda)L) based on multi-agent system for solving large-scale multi-objective OPF problem. It does not require any manipulation to the conventional mathematical Optimal Power Flow (OPF) model. Large-scale power system is first partitioned to subsystems and each subsystem is managed by an agent. Each agent adopts the standard multi-step Q(lambda) learning algorithm to pursue its own objectives independently and approaches to the global optimal through cooperation and coordination among agents. The proposed DQ(lambda)L has been thoroughly studied and tested on the IEEE 9-bus and 118-bus systems. Case studies demonstrated that DQ(lambda)L is a feasible and effective for solving multi-objective OPF problem in large-scale complex power grid. (C) 2012 Elsevier Ltd. All rights reserved.

引用

页码：614 / 620

页数：7

共 25 条

[1] Real and reactive power loss allocation in pool-based electricity markets
Alturki, Y. A.
Lo, K. L.
[J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2010, 32 (04) : 262 - 270
[2] DAILY GENERATION SCHEDULING OPTIMIZATION WITH TRANSMISSION CONSTRAINTS - A NEW CLASS OF ALGORITHMS
BATUT, J
RENAUD, A
[J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 1992, 7 (03) : 982 - 989
[3] Cost/worth assessment of reliability improvement in distribution networks by means of artificial intelligence
Bouhouras, Aggelos S.
Labridis, Dimitris P.
Bakirtzis, Anastasios G.
[J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2010, 32 (05) : 530 - 538
[4] Mean field theory for optimal power flow
Chen, LN
Suzuki, H
Katou, K
[J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 1997, 12 (04) : 1481 - 1486
[5] El-Sharkawi M, 2008, MODERN HEURISTIC OPT
[6] Fan Bo, 2008, INT WORKSH ED TECHN, V1, P667
[7] Gerhard W, 1995, ROBOT AUTON SYST, V15, P135
[8] A primal-dual interior point method for optimal power flow dispatching
Jabr, RA
Coonick, AH
Cory, BJ
[J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2002, 17 (03) : 654 - 662
[9] Reinforcement Learning approaches to Economic Dispatch problem
Jasmin, E. A.
Ahamed, T. P. Imthias
Raj, V. P. Jagathy
[J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2011, 33 (04) : 836 - 845
[10] Improved evolutionary programming with dynamic mutation and metropolis criteria for multi-objective reactive power optimisation
Jiang, C
Wang, C
[J]. IEE PROCEEDINGS-GENERATION TRANSMISSION AND DISTRIBUTION, 2005, 152 (02) : 291 - 294

← 1 2 3 →