An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems

被引：3

作者：

Qin, Jingtao ^{[1
]}

Gao, Yuanqi ^{[1
]}

Bragin, Mikhail ^{[1
]}

Yu, Nanpeng ^{[1
]}

机构：

[1] Univ Calif Riverside, Dept Elect & Comp Engn, Riverside, CA 92521 USA

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Optimization; Costs; Machine learning algorithms; Heuristic algorithms; Ions; Deep learning; Uncertainty; Reinforcement learning; Deep reinforcement learning; multi-step return; optimization methods; unit commitment; FORMULATION;

D O I：

10.1109/ACCESS.2023.3313998

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Unit commitment (UC) is a fundamental problem in the day-ahead electricity market, and it is critical to solve UC problems efficiently. Mathematical optimization techniques like dynamic programming, Lagrangian relaxation, and mixed-integer quadratic programming (MIQP) are commonly adopted for UC problems. However, the calculation time of these methods increases at an exponential rate with the number of generators and energy resources, which is still the main bottleneck in the industry. Recent advances in artificial intelligence have demonstrated the capability of reinforcement learning (RL) to solve UC problems. Unfortunately, the existing research on solving UC problems with RL suffers from the curse of dimensionality when the size of UC problems grows. To deal with these problems, we propose an optimization method-assisted ensemble deep reinforcement learning algorithm, where UC problems are formulated as a Markov Decision Process (MDP) and solved by multi-step deep Q-learning in an ensemble framework. The proposed algorithm establishes a candidate action set by solving tailored optimization problems to ensure relatively high performance and the satisfaction of operational constraints. Numerical studies on three test systems show that our algorithm outperforms the baseline RL algorithm in terms of computation efficiency and operation cost. By employing the output of our proposed algorithm as a warm start, the MIQP technique can achieve further reductions in operational costs. Furthermore, the proposed algorithm shows strong generalization capacity under unforeseen operational conditions.

引用

页码：100125 / 100136

页数：12

共 42 条

[1]

Abdou I, 2018, International Journal of Electrical and Computer Engineering (IJECE), V8, P1357, DOI [10.11591/ijece.v8i3.pp1357-1372, 10.11591/ijece.v8i3.pp1357-1372, DOI 10.11591/IJECE.V8I3.PP1357-1372]

[2] Exploring the Modeling Capacity of Two-Stage Robust Optimization: Variants of Robust Unit Commitment Model [J].

An, Yu ;

Zeng, Bo .

IEEE TRANSACTIONS ON POWER SYSTEMS, 2015, 30 (01) :109-122

[3] A State Transition MIP Formulation for the Unit Commitment Problem [J].

Atakan, Semih ;

Lulli, Guglielmo ;

Sen, Suvrajeet .

IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (01) :736-748

[4] A computationally efficient mixed-integer linear formulation for the thermal unit commitment problem [J].

Carrion, Miguel ;

Arroyo, Jose M. .

IEEE TRANSACTIONS ON POWER SYSTEMS, 2006, 21 (03) :1371-1378

[5]

CASIO, 2022, California ISO Demand Forecast Website

[6]

Cornish Hellaby Watkins ChristopherJohn., 1989, LEARNING DELAYED REW

[7]

Dalal G, 2015, 2015 IEEE EINDHOVEN POWERTECH

[8] Applying reinforcement learning and tree search to the unit commitment problem [J].

de Mars, Patrick ;

O'Sullivan, Aidan .

APPLIED ENERGY, 2021, 302

[9] Investigation of stochastic unit commitment to enable advanced flexibility measures for high shares of solar PV [J].

Doubleday, Kate ;

Lara, Jose Daniel ;

Hodge, Bri-Mathias .

APPLIED ENERGY, 2022, 321

[10] Internally Induced Branch-and-Cut Acceleration for Unit Commitment Based on Improvement of Upper Bound [J].

Gao, Qian ;

Yang, Zhifang ;

Yin, Wotao ;

Li, Wenyuan ;

Yu, Juan .

IEEE TRANSACTIONS ON POWER SYSTEMS, 2022, 37 (03) :2455-2458

← 1 2 3 4 5 →