Applying reinforcement learning and tree search to the unit commitment problem

被引:39
作者
de Mars, Patrick [1 ]
O'Sullivan, Aidan [1 ]
机构
[1] UCL Energy Inst, London, England
基金
英国工程与自然科学研究理事会;
关键词
Unit commitment; Reinforcement learning; Tree search; Deep learning; Power systems; OPTIMIZATION; ALGORITHM;
D O I
10.1016/j.apenergy.2021.117519
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Recent advances in artificial intelligence have demonstrated the capability of reinforcement learning (RL) methods to outperform the state of the art in decision-making problems under uncertainty. Day-ahead unit commitment (UC), scheduling power generation based on forecasts, is a complex power systems task that is becoming more challenging in light of increasing uncertainty. While RL is a promising framework for solving the UC problem, the space of possible actions from a given state is exponential in the number of generators and it is infeasible to apply existing RL methods in power systems larger than a few generators. Here we present a novel RL algorithm, guided tree search, which does not suffer from an exponential explosion in the action space with increasing number of generators. The method augments a tree search algorithm with a policy that intelligently reduces the branching factor. Using data from the GB power system, we demonstrate that guided tree search outperforms an unguided method in terms of computational complexity, while producing solutions that show no performance loss in terms of operating costs. We compare solutions against mixed-integer linear programming (MILP) and find that guided tree search outperforms a solution using reserve constraints, the current industry approach. The RL solutions exhibit complex behaviours that differ qualitatively from MILP, demonstrating its potential as a decision support tool for human operators.
引用
收藏
页数:9
相关论文
共 25 条
[1]  
[Anonymous], 2013, Power Generation, Operation and Control
[2]   Adaptive Robust Optimization for the Security Constrained Unit Commitment Problem [J].
Bertsimas, Dimitris ;
Litvinov, Eugene ;
Sun, Xu Andy ;
Zhao, Jinye ;
Zheng, Tongxin .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2013, 28 (01) :52-63
[3]   A computationally efficient mixed-integer linear formulation for the thermal unit commitment problem [J].
Carrion, Miguel ;
Arroyo, Jose M. .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2006, 21 (03) :1371-1378
[4]  
Dalal G., 2016, INT C MACH LEARN, P2197
[5]  
Dalal G, 2015, 2015 IEEE EINDHOVEN POWERTECH
[6]  
Henderson P, 2018, AAAI CONF ARTIF INTE, P3207
[7]   Using Standard Deviation as a Measure of Increased Operational Reserve Requirement for Wind Power [J].
Holttinen, Hannele ;
Milligan, Michael ;
Kirby, Brendan ;
Acker, Tom ;
Neimane, Viktoria ;
Molinski, Tom .
WIND ENGINEERING, 2008, 32 (04) :355-377
[8]  
Jasmin E, 2016, 2016 IEEE INT C POW, P1, DOI [10.1109/PEDES.2016.7914428., DOI 10.1109/PEDES.2016.7914428]
[9]  
Jasmin E. A., 2009, Proceedings of the 2009 International Conference on Advances in Computing, Control, & Telecommunication Technologies (ACT 2009), P324, DOI 10.1109/ACT.2009.87
[10]   A genetic algorithm solution to the unit commitment problem [J].
Kazarlis, SA ;
Bakirtzis, AG ;
Petridis, V .
IEEE TRANSACTIONS ON POWER SYSTEMS, 1996, 11 (01) :83-90