Reinforcement learning and A* search for the unit commitment problem

被引:13
作者
de Mars, Patrick [1 ]
OSullivan, Aidan [1 ]
机构
[1] UCL Energy Inst, London, England
基金
英国工程与自然科学研究理事会;
关键词
Unit commitment; Reinforcement learning; Tree search; Power systems; SHORTEST PATHS; OPTIMIZATION; ALGORITHM; SHOGI; CHESS; GO;
D O I
10.1016/j.egyai.2022.100179
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous research has combined model-free reinforcement learning with model-based tree search methods to solve the unit commitment problem with stochastic demand and renewables generation. This approach was limited to shallow search depths and suffered from significant variability in run time across problem instances with varying complexity. To mitigate these issues, we extend this methodology to more advanced search algorithms based on A* search. First, we develop a problem-specific heuristic based on priority list unit commitment methods and apply this in Guided A* search, reducing run time by up to 94% with negligible impact on operating costs. In addition, we address the run time variability issue by employing a novel anytime algorithm, Guided IDA*, replacing the fixed search depth parameter with a time budget constraint. We show that Guided IDA* mitigates the run time variability of previous guided tree search algorithms and enables further operating cost reductions of up to 1%.
引用
收藏
页数:10
相关论文
共 54 条
[1]  
About Us, National Grid Demand Data
[2]  
Balancing Mechanism Reporting Service, About Us
[3]  
Baldwin C.J., 1959, Transactions of the American Institute of Electrical Engineers. Part III: Power Apparatus and Systems, V78, P1272
[4]   Adaptive Robust Optimization for the Security Constrained Unit Commitment Problem [J].
Bertsimas, Dimitris ;
Litvinov, Eugene ;
Sun, Xu Andy ;
Zhao, Jinye ;
Zheng, Tongxin .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2013, 28 (01) :52-63
[5]   Stochastic security for operations planning with significant wind power generation [J].
Bouffard, Francois ;
Galiana, Francisco D. .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2008, 23 (02) :306-316
[6]   A computationally efficient mixed-integer linear formulation for the thermal unit commitment problem [J].
Carrion, Miguel ;
Arroyo, Jose M. .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2006, 21 (03) :1371-1378
[7]   Unit commitment strategy of thermal generators by using advanced fuzzy controlled binary particle swarm optimization algorithm [J].
Chakraborty, Shantanu ;
Ito, Takayuki ;
Senjyu, Tomonobu ;
Saber, Ahmed Yousuf .
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2012, 43 (01) :1072-1080
[8]  
Dalal G, 2016, PR MACH LEARN RES, V48
[9]  
Dalal G, 2015, 2015 IEEE EINDHOVEN POWERTECH
[10]   Applying reinforcement learning and tree search to the unit commitment problem [J].
de Mars, Patrick ;
O'Sullivan, Aidan .
APPLIED ENERGY, 2021, 302