A Novel Integral Reinforcement Learning-Based Control Method Assisted by Twin Delayed Deep Deterministic Policy Gradient for Solid Oxide Fuel Cell in DC Microgrid

被引:7
作者
Liu, Yulin [1 ]
Qie, Tianhao [1 ]
Yu, Yang [4 ]
Wang, Yuxuan [1 ]
Chau, Tat Kei [1 ]
Zhang, Xinan [1 ]
Manandhar, Ujjal [2 ]
Li, Sinan [3 ]
Iu, Herbert H. C. [1 ]
Fernando, Tyrone [1 ]
机构
[1] Univ Western Australia, Sch Engn, Crawley, WA 6009, Australia
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] Univ Sydney, Sch Elect & Informat Engn, Sydney 2006, Australia
[4] Halliburton Ltd, Ctr Excellence Adv Control, Singapore 639940, Singapore
关键词
Solid oxide fuel cell; DC microgrid; integral reinforcement learning; hardware-in-the-loop; twin delayed deep deterministic policy gradient; POWER-PLANT; H-INFINITY; MODEL;
D O I
10.1109/TSTE.2022.3224179
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper proposes a new online integral reinforcement learning (IRL)-based control algorithm for the solid oxide fuel cell (SOFC) to overcome the long-lasting problems of model dependency and sensitivity to offline training dataset in the existing SOFC control approaches. The proposed method automatically updates the optimal control gains through the online neural network training. Unlike the other online learning-based control methods that rely on the assumption of initial stabilizing control or trial-and-error based initial control policy search, the proposed method employs the offline twin delayed deep deterministic policy gradient (TD3) algorithm to systematically determine the initial stabilizing control policy. Compared to the conventional IRL-based control, the proposed method contributes to greatly reduce the computational burden without compromising the control performance. The excellent performance of the proposed method is verified by hardware-in-the-loop experiments.
引用
收藏
页码:688 / 703
页数:16
相关论文
共 27 条
  • [1] Voltage control of solid oxide fuel cell power plant based on intelligent proportional integral-adaptive sliding mode control with anti-windup compensator
    Abbaker, A. M. Omer
    Wang, Haoping
    Tian, Yang
    [J]. TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2020, 42 (01) : 116 - 130
  • [2] Policy iterations on the Hamilton-Jacobi-Isaacs equation for H∞ state feedback control with input saturation
    Abu-Khalaf, Murad
    Lewis, Frank L.
    Huang, Jie
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (12) : 1989 - 1995
  • [3] Deep Reinforcement Learning A brief survey
    Arulkumaran, Kai
    Deisenroth, Marc Peter
    Brundage, Miles
    Bharath, Anil Anthony
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
  • [4] Design and Implementation of the Off-Line Robust Model Predictive Control for Solid Oxide Fuel Cells
    Chatrattanawet, Narissara
    Kheawhom, Soorathep
    Chen, Yong-Song
    Arpornwichanop, Amornchai
    [J]. PROCESSES, 2019, 7 (12)
  • [5] Optimal control strategy for solid oxide fuel cell-based hybrid energy system using deep reinforcement learning
    Chen, Tao
    Gao, Ciwei
    Song, Yutong
    [J]. IET RENEWABLE POWER GENERATION, 2022, 16 (05) : 912 - 921
  • [6] Adaptive Neural Network-Based Control of a Hybrid AC/DC Microgrid
    Chettibi, Nadjwa
    Mellit, Adel
    Sulligoi, Giorgio
    Pavan, Alessandro Massi
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2018, 9 (03) : 1667 - 1679
  • [7] Designing Hydrogen and Oxygen Flow Rate Control on a Solid Oxide Fuel Cell Simulator Using the Fuzzy Logic Control Method
    Darjat
    Sulistyo
    Triwiyatno, Aris
    Sudjadi
    Kurniahadi, Andra
    [J]. PROCESSES, 2020, 8 (02)
  • [8] Fujimoto S, 2018, PR MACH LEARN RES, V80
  • [9] Adaptive Optimal Control for a Class of Nonlinear Systems: The Online Policy Iteration Approach
    He, Shuping
    Fang, Haiyang
    Zhang, Maoguang
    Liu, Fei
    Ding, Zhengtao
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 549 - 558
  • [10] A novel data-driven controller for solid oxide fuel cell via deep reinforcement learning
    Li, Jiawen
    Yu, Tao
    [J]. JOURNAL OF CLEANER PRODUCTION, 2021, 321