Real-time power optimization based on Q-learning algorithm for direct methanol fuel cell system

被引：1

作者：

Chi, Xuncheng ^{[1
]}

Chen, Fengxiang ^{[1
]}

Zhai, Shuang ^{[2
]}

Hu, Zhe ^{[2
]}

Zhou, Su ^{[3
]}

Wei, Wei ^{[4
]}

机构：

[1] Tongji Univ, Sch Automot Studies, Shanghai, Peoples R China

[2] Shanghai Refire Technol Co Ltd, Shanghai, Peoples R China

[3] Shanghai Zhongqiao Vocat & Tech Univ, Shanghai, Peoples R China

[4] CAS &M Zhangjiagang New Energy Technol Co Ltd, Zhangjiagang, Peoples R China

来源：

INTERNATIONAL JOURNAL OF HYDROGEN ENERGY | 2024年 / 89卷

基金：

中国国家自然科学基金;

关键词：

Direct methanol fuel cell (DMFC) system; Real-time power optimization; Methanol supply control; Reinforcement learning; Q -learning algorithm; MASS-TRANSPORT MODEL; NUMERICAL-MODEL; PERFORMANCE; DMFC;

D O I：

10.1016/j.ijhydene.2024.09.084

中图分类号：

O64 [物理化学（理论化学）、化学物理学];

学科分类号：

070304 ; 081704 ;

摘要：

Efficient real-time power optimization of direct methanol fuel cell (DMFC) system is crucial for enhancing its performance and reliability. The power of DMFC system is mainly affected by stack temperature and circulating methanol concentration. However, the methanol concentration cannot be directly measured using reliable sensors, which poses a challenge for the real-time power optimization. To address this issue, this paper investigates the operating mechanism of DMFC system and establishes a system power model. Based on the established model, reinforcement learning using Q-learning algorithm is proposed to control methanol supply to optimize DMFC system power under varying operating conditions. This algorithm is simple, easy to implement, and does not rely on methanol concentration measurements. To validate the effectiveness of the proposed algorithm, simulation comparisons between the proposed method and the traditional perturbation and observation (P&O) algorithm are implemented under different operating conditions. The results show that proposed power optimization based on Q-learning algorithm improves net power by 1% and eliminates the fluctuation of methanol supply caused by P&O. For practical implementation considerations and real-time requirements of the algorithm, hardware-in-the-loop (HIL) experiments are conducted. The experiment results demonstrate that the proposed methods optimize net power under different operating conditions. Additionally, in terms of model accuracy, the experimental results are well matched with the simulation. Moreover, under varying load condition, compared with P&O, proposed power optimization based on Q-learning algorithm reduces root mean square error (RMSE) from 7.271% to 2.996% and mean absolute error (MAE) from 5.036% to 0.331%.

引用

页码：1241 / 1253

页数：13

共 50 条

[21] Anisotropic Q-learning and waiting estimation based real-time routing for automated guided vehicles at container terminals
Zhou, Pengfei
Lin, Li
Kim, Kap Hwan
JOURNAL OF HEURISTICS, 2023, 29 (2-3) : 207 - 228
[22] Anisotropic Q-learning and waiting estimation based real-time routing for automated guided vehicles at container terminals
Pengfei Zhou
Li Lin
Kap Hwan Kim
Journal of Heuristics, 2023, 29 : 207 - 228
[23] Discrete-Time Optimal Control Scheme Based on Q-Learning Algorithm
Wei, Qinglai
Liu, Derong
Song, Ruizhuo
2016 SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2016, : 125 - 130
[24] Heat and power management of a direct-methanol-fuel-cell (DMFC) system
Dohle, H
Mergel, J
Stolten, D
JOURNAL OF POWER SOURCES, 2002, 111 (02) : 268 - 282
[25] Neural network-based adaptive control and energy management system of a direct methanol fuel cell in a hybrid renewable power system
Jienkulsawad, Prathak
Eamsiri, Kornkamol
Chen, Yong-Song
Arpornwichanop, Amornchai
SUSTAINABLE CITIES AND SOCIETY, 2022, 87
[26] Q-learning algorithm based method for enhancing resiliency of integrated energy system
Wu X.
Tang Z.
Xu Q.
Zhou Y.
Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2020, 40 (04): : 146 - 152
[27] Q-learning based energy management strategy for a hybrid multi-stack fuel cell system considering degradation
Ghaderi, Razieh
Kandidayeni, Mohsen
Boulon, Loic
Trovao, Joao P.
ENERGY CONVERSION AND MANAGEMENT, 2023, 293
[28] Q-Learning based Maximum Power Extraction for Wind Energy Conversion System With Variable Wind Speed
Kushwaha, Ashish
Gopal, Madan
Singh, Bhim
IEEE TRANSACTIONS ON ENERGY CONVERSION, 2020, 35 (03) : 1160 - 1170
[29] Multi-AGV route planning in automated warehouse system based on shortest-time Q-learning algorithm
Zhang, Zheng
Chen, Juan
Zhao, Wenbing
ASIAN JOURNAL OF CONTROL, 2024, 26 (02) : 683 - 702
[30] Real-Time Optimization Energy Management Strategy for Fuel Cell Hybrid Ships Considering Power Sources Degradation
Zhang, Zehui
Guan, Cong
Liu, Zhiyong
IEEE ACCESS, 2020, 8 : 87046 - 87059

← 1 2 3 4 5 →