Approximate Value Iteration in the Reinforcement Learning Context. Application to Electrical Power System Control

被引：21

作者：

Ernst, Damien ^{[1
]}

Glavic, Mevludin ^{[1
]}

Geurts, Pierre ^{[1
]}

Wehenkel, Louis ^{[1
]}

机构：

[1] Univ Liege, Elect Engn & Comp Sci Dept, Sart Tilman B28, B-4000 Liege, Belgium

来源：

INTERNATIONAL JOURNAL OF EMERGING ELECTRIC POWER SYSTEMS | 2005年 / 3卷 / 01期

关键词：

reinforcement learning; power system control; electrical power oscillations damping; TCSC control; approximate value iteration;

D O I：

10.2202/1553-779X.1066

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper we explain how to design intelligent agents able to process the information acquired from interaction with a system to learn a good control policy and show how the methodology can be applied to control some devices aimed to damp electrical power oscillations. The control problem is formalized as a discrete-time optimal control problem and the information acquired from interaction with the system is a set of samples, where each sample is composed of four elements: a state, the action taken while being in this state, the instantaneous reward observed and the successor state of the system. To process this information we consider reinforcement learning algorithms that determine an approximation of the so-called Q-function by mimicking the behavior of the value iteration algorithm. Simulations are first carried on a benchmark power system modeled with two state variables. Then we present a more complex case study on a four-machine power system where the reinforcement learning algorithm controls a Thyristor Controlled Series Capacitor (TCSC) aimed to damp power system oscillations.

引用

页码：1 / 35

页数：36

共 41 条

[1]

Bellman R., 1957, DYNAMIC PROGRAMMING

[2]

Bellman R., 1963, MATH COMP, V17, P155

[3]

Chan K. H., 2000, P EIS 2000 PAISL UK

[4]

Diu A., 2002, P IEEE PES SUMM M PA

[5]

Ernst D, 2005, J MACH LEARN RES, V6, P503

[6] Power systems stability control: Reinforcement learning framework [J].

Ernst, D ;

Glavic, M ;

Wehenkel, L .

IEEE TRANSACTIONS ON POWER SYSTEMS, 2004, 19 (01) :427-435

[7] Approximate Value Iteration in the Reinforcement Learning Context. Application to Electrical Power System Control [J].

Ernst, Damien ;

Glavic, Mevludin ;

Geurts, Pierre ;

Wehenkel, Louis .

INTERNATIONAL JOURNAL OF EMERGING ELECTRIC POWER SYSTEMS, 2005, 3 (01) :1-35

[8]

Ernst D., 2005, UNPUB

[9]

Ernst D., 2002, P 14 POW SYST COMP C

[10]

Ernst D., 2003, THESIS

← 1 2 3 4 5 →