Optimal adaptive control for solid oxide fuel cell with operating constraints via large-scale deep reinforcement learning

被引:9
作者
Li, Jiawen [1 ]
Yu, Tao [1 ]
机构
[1] South China Univ Technol, Coll Elect Power, Guangzhou 510640, Peoples R China
基金
中国国家自然科学基金;
关键词
Large-scale agent deep reinforcement learning; Fittest survival strategy large-scale twin; delayed deep deterministic policy gradient; (FSSL-TD3); Solid oxide fuel cell; Fuel flow; Fuel utilization; PREDICTIVE CONTROL; GENERATION CONTROL; CONTROL STRATEGY; MODEL;
D O I
10.1016/j.conengprac.2021.104951
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since a solid oxide fuel cell (SOFC) is a complicated nonlinear, time-varying and constrained system, it is difficult to control the fuel flow to stabilize the output voltage while considering fuel utilization operating constraints. To overcome this problem, an adaptive fractional-order proportional integral derivative (FOPID) controller, taking advantage of the adaptability and model-free features of large-scale deep reinforcement learning, is proposed in this paper. Furthermore, a fittest survival strategy large-scale twin delayed deep deterministic policy gradient (FSSL-TD3) algorithm is designed as the tuner of this controller. In this algorithm, the exploration efficacy is improved by way of the fittest survival strategy and imitation learning. Other techniques are also applied to this algorithm in order to improve the robustness of FOPID controller. In addition, by formulating the reward function of the FSSL-TD3 algorithm, the fuel utilization of the SOFC can always be kept in a safe range, which is not possible for conventional control algorithms. The simulation results in this paper show that the output voltage of SOFCs can be controlled effectively by this controller while fuel utilization is retained within a reasonable range.
引用
收藏
页数:14
相关论文
共 37 条
  • [1] Fractional Order Fuzzy PID Control of Automotive PEM Fuel Cell Air Feed System Using Neural Network Optimization Algorithm
    AbouOmar, Mahmoud S.
    Zhang, Hua-Jun
    Su, Yi-Xin
    [J]. ENERGIES, 2019, 12 (08)
  • [2] Maximum power point tracking of a proton exchange membrane fuel cell system using PSO-PID controller
    Ahmadi, S.
    Abdi, Sh.
    Kakavand, M.
    [J]. INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2017, 42 (32) : 20430 - 20443
  • [3] Distributed generation system control strategies with PV and fuel cell in microgrid operation
    Bai, Wenlei
    Abedi, M. Reza
    Lee, Kwang Y.
    [J]. CONTROL ENGINEERING PRACTICE, 2016, 53 : 184 - 193
  • [4] Bavarian M, 2013, P AMER CONTR CONF, P5356
  • [5] Thermal Management-Oriented Multivariable Robust Control of a kW-Scale Solid Oxide Fuel Cell Stand-Alone System
    Cao, Hongliang
    Li, Xi
    [J]. IEEE TRANSACTIONS ON ENERGY CONVERSION, 2016, 31 (02) : 603 - 612
  • [6] Deep-Reinforcement-Learning-Based Autonomous Voltage Control for Power Grid Operations
    Duan, Jiajun
    Shi, Di
    Diao, Ruisheng
    Li, Haifeng
    Wang, Zhiwei
    Zhang, Bei
    Bian, Desong
    Yi, Zhehan
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (01) : 814 - 817
  • [7] Fujimoto S, 2018, PR MACH LEARN RES, V80
  • [8] Theory and application of a novel fuzzy PID controller using a simplified Takagi-Sugeno rule scheme
    Hao, Y
    [J]. INFORMATION SCIENCES, 2000, 123 (3-4) : 281 - 293
  • [9] Horalek R, 2015, 2015 IEEE INTERNATIONAL WORKSHOP OF ELECTRONICS, CONTROL, MEASUREMENT, SIGNALS AND THEIR APPLICATION TO MECHATRONICS (ECMSM)
  • [10] Horgan D, 2018, 6 INT C LEARNING REP