Optimal adaptive control for solid oxide fuel cell with operating constraints via large-scale deep reinforcement learning

被引：9

作者：

Li, Jiawen ^{[1
]}

Yu, Tao ^{[1
]}

机构：

[1] South China Univ Technol, Coll Elect Power, Guangzhou 510640, Peoples R China

来源：

CONTROL ENGINEERING PRACTICE | 2021年 / 117卷

基金：

中国国家自然科学基金;

关键词：

Large-scale agent deep reinforcement learning; Fittest survival strategy large-scale twin; delayed deep deterministic policy gradient; (FSSL-TD3); Solid oxide fuel cell; Fuel flow; Fuel utilization; PREDICTIVE CONTROL; GENERATION CONTROL; CONTROL STRATEGY; MODEL;

D O I：

10.1016/j.conengprac.2021.104951

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Since a solid oxide fuel cell (SOFC) is a complicated nonlinear, time-varying and constrained system, it is difficult to control the fuel flow to stabilize the output voltage while considering fuel utilization operating constraints. To overcome this problem, an adaptive fractional-order proportional integral derivative (FOPID) controller, taking advantage of the adaptability and model-free features of large-scale deep reinforcement learning, is proposed in this paper. Furthermore, a fittest survival strategy large-scale twin delayed deep deterministic policy gradient (FSSL-TD3) algorithm is designed as the tuner of this controller. In this algorithm, the exploration efficacy is improved by way of the fittest survival strategy and imitation learning. Other techniques are also applied to this algorithm in order to improve the robustness of FOPID controller. In addition, by formulating the reward function of the FSSL-TD3 algorithm, the fuel utilization of the SOFC can always be kept in a safe range, which is not possible for conventional control algorithms. The simulation results in this paper show that the output voltage of SOFCs can be controlled effectively by this controller while fuel utilization is retained within a reasonable range.

引用

页数：14

共 37 条

[1] Fractional Order Fuzzy PID Control of Automotive PEM Fuel Cell Air Feed System Using Neural Network Optimization Algorithm
AbouOmar, Mahmoud S.
Zhang, Hua-Jun
Su, Yi-Xin
[J]. ENERGIES, 2019, 12 (08)
[2] Maximum power point tracking of a proton exchange membrane fuel cell system using PSO-PID controller
Ahmadi, S.
Abdi, Sh.
Kakavand, M.
[J]. INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2017, 42 (32) : 20430 - 20443
[3] Distributed generation system control strategies with PV and fuel cell in microgrid operation
Bai, Wenlei
Abedi, M. Reza
Lee, Kwang Y.
[J]. CONTROL ENGINEERING PRACTICE, 2016, 53 : 184 - 193
[4] Bavarian M, 2013, P AMER CONTR CONF, P5356
[5] Thermal Management-Oriented Multivariable Robust Control of a kW-Scale Solid Oxide Fuel Cell Stand-Alone System
Cao, Hongliang
Li, Xi
[J]. IEEE TRANSACTIONS ON ENERGY CONVERSION, 2016, 31 (02) : 603 - 612
[6] Deep-Reinforcement-Learning-Based Autonomous Voltage Control for Power Grid Operations
Duan, Jiajun
Shi, Di
Diao, Ruisheng
Li, Haifeng
Wang, Zhiwei
Zhang, Bei
Bian, Desong
Yi, Zhehan
[J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (01) : 814 - 817
[7] Fujimoto S, 2018, PR MACH LEARN RES, V80
[8] Theory and application of a novel fuzzy PID controller using a simplified Takagi-Sugeno rule scheme
Hao, Y
[J]. INFORMATION SCIENCES, 2000, 123 (3-4) : 281 - 293
[9] Horalek R, 2015, 2015 IEEE INTERNATIONAL WORKSHOP OF ELECTRONICS, CONTROL, MEASUREMENT, SIGNALS AND THEIR APPLICATION TO MECHATRONICS (ECMSM)
[10] Horgan D, 2018, 6 INT C LEARNING REP

← 1 2 3 4 →