A Novel Integral Reinforcement Learning-Based Control Method Assisted by Twin Delayed Deep Deterministic Policy Gradient for Solid Oxide Fuel Cell in DC Microgrid

被引：7

作者：

Liu, Yulin ^{[1
]}

Qie, Tianhao ^{[1
]}

Yu, Yang ^{[4
]}

Wang, Yuxuan ^{[1
]}

Chau, Tat Kei ^{[1
]}

Zhang, Xinan ^{[1
]}

Manandhar, Ujjal ^{[2
]}

Li, Sinan ^{[3
]}

Iu, Herbert H. C. ^{[1
]}

Fernando, Tyrone ^{[1
]}

机构：

[1] Univ Western Australia, Sch Engn, Crawley, WA 6009, Australia

[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore

[3] Univ Sydney, Sch Elect & Informat Engn, Sydney 2006, Australia

[4] Halliburton Ltd, Ctr Excellence Adv Control, Singapore 639940, Singapore

来源：

IEEE TRANSACTIONS ON SUSTAINABLE ENERGY | 2023年 / 14卷 / 01期

关键词：

Solid oxide fuel cell; DC microgrid; integral reinforcement learning; hardware-in-the-loop; twin delayed deep deterministic policy gradient; POWER-PLANT; H-INFINITY; MODEL;

D O I：

10.1109/TSTE.2022.3224179

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

This paper proposes a new online integral reinforcement learning (IRL)-based control algorithm for the solid oxide fuel cell (SOFC) to overcome the long-lasting problems of model dependency and sensitivity to offline training dataset in the existing SOFC control approaches. The proposed method automatically updates the optimal control gains through the online neural network training. Unlike the other online learning-based control methods that rely on the assumption of initial stabilizing control or trial-and-error based initial control policy search, the proposed method employs the offline twin delayed deep deterministic policy gradient (TD3) algorithm to systematically determine the initial stabilizing control policy. Compared to the conventional IRL-based control, the proposed method contributes to greatly reduce the computational burden without compromising the control performance. The excellent performance of the proposed method is verified by hardware-in-the-loop experiments.

引用

页码：688 / 703

页数：16

共 27 条

[1] Voltage control of solid oxide fuel cell power plant based on intelligent proportional integral-adaptive sliding mode control with anti-windup compensator
Abbaker, A. M. Omer
Wang, Haoping
Tian, Yang
[J]. TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2020, 42 (01) : 116 - 130
[2] Policy iterations on the Hamilton-Jacobi-Isaacs equation for H∞ state feedback control with input saturation
Abu-Khalaf, Murad
Lewis, Frank L.
Huang, Jie
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (12) : 1989 - 1995
[3] Deep Reinforcement Learning A brief survey
Arulkumaran, Kai
Deisenroth, Marc Peter
Brundage, Miles
Bharath, Anil Anthony
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
[4] Design and Implementation of the Off-Line Robust Model Predictive Control for Solid Oxide Fuel Cells
Chatrattanawet, Narissara
Kheawhom, Soorathep
Chen, Yong-Song
Arpornwichanop, Amornchai
[J]. PROCESSES, 2019, 7 (12)
[5] Optimal control strategy for solid oxide fuel cell-based hybrid energy system using deep reinforcement learning
Chen, Tao
Gao, Ciwei
Song, Yutong
[J]. IET RENEWABLE POWER GENERATION, 2022, 16 (05) : 912 - 921
[6] Adaptive Neural Network-Based Control of a Hybrid AC/DC Microgrid
Chettibi, Nadjwa
Mellit, Adel
Sulligoi, Giorgio
Pavan, Alessandro Massi
[J]. IEEE TRANSACTIONS ON SMART GRID, 2018, 9 (03) : 1667 - 1679
[7] Designing Hydrogen and Oxygen Flow Rate Control on a Solid Oxide Fuel Cell Simulator Using the Fuzzy Logic Control Method
Darjat
Sulistyo
Triwiyatno, Aris
Sudjadi
Kurniahadi, Andra
[J]. PROCESSES, 2020, 8 (02)
[8] Fujimoto S, 2018, PR MACH LEARN RES, V80
[9] Adaptive Optimal Control for a Class of Nonlinear Systems: The Online Policy Iteration Approach
He, Shuping
Fang, Haiyang
Zhang, Maoguang
Liu, Fei
Ding, Zhengtao
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 549 - 558
[10] A novel data-driven controller for solid oxide fuel cell via deep reinforcement learning
Li, Jiawen
Yu, Tao
[J]. JOURNAL OF CLEANER PRODUCTION, 2021, 321

← 1 2 3 →