Real-time power optimization based on Q-learning algorithm for direct methanol fuel cell system

被引:1
|
作者
Chi, Xuncheng [1 ]
Chen, Fengxiang [1 ]
Zhai, Shuang [2 ]
Hu, Zhe [2 ]
Zhou, Su [3 ]
Wei, Wei [4 ]
机构
[1] Tongji Univ, Sch Automot Studies, Shanghai, Peoples R China
[2] Shanghai Refire Technol Co Ltd, Shanghai, Peoples R China
[3] Shanghai Zhongqiao Vocat & Tech Univ, Shanghai, Peoples R China
[4] CAS &M Zhangjiagang New Energy Technol Co Ltd, Zhangjiagang, Peoples R China
基金
中国国家自然科学基金;
关键词
Direct methanol fuel cell (DMFC) system; Real-time power optimization; Methanol supply control; Reinforcement learning; Q -learning algorithm; MASS-TRANSPORT MODEL; NUMERICAL-MODEL; PERFORMANCE; DMFC;
D O I
10.1016/j.ijhydene.2024.09.084
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Efficient real-time power optimization of direct methanol fuel cell (DMFC) system is crucial for enhancing its performance and reliability. The power of DMFC system is mainly affected by stack temperature and circulating methanol concentration. However, the methanol concentration cannot be directly measured using reliable sensors, which poses a challenge for the real-time power optimization. To address this issue, this paper investigates the operating mechanism of DMFC system and establishes a system power model. Based on the established model, reinforcement learning using Q-learning algorithm is proposed to control methanol supply to optimize DMFC system power under varying operating conditions. This algorithm is simple, easy to implement, and does not rely on methanol concentration measurements. To validate the effectiveness of the proposed algorithm, simulation comparisons between the proposed method and the traditional perturbation and observation (P&O) algorithm are implemented under different operating conditions. The results show that proposed power optimization based on Q-learning algorithm improves net power by 1% and eliminates the fluctuation of methanol supply caused by P&O. For practical implementation considerations and real-time requirements of the algorithm, hardware-in-the-loop (HIL) experiments are conducted. The experiment results demonstrate that the proposed methods optimize net power under different operating conditions. Additionally, in terms of model accuracy, the experimental results are well matched with the simulation. Moreover, under varying load condition, compared with P&O, proposed power optimization based on Q-learning algorithm reduces root mean square error (RMSE) from 7.271% to 2.996% and mean absolute error (MAE) from 5.036% to 0.331%.
引用
收藏
页码:1241 / 1253
页数:13
相关论文
共 50 条
  • [21] Anisotropic Q-learning and waiting estimation based real-time routing for automated guided vehicles at container terminals
    Zhou, Pengfei
    Lin, Li
    Kim, Kap Hwan
    JOURNAL OF HEURISTICS, 2023, 29 (2-3) : 207 - 228
  • [22] Anisotropic Q-learning and waiting estimation based real-time routing for automated guided vehicles at container terminals
    Pengfei Zhou
    Li Lin
    Kap Hwan Kim
    Journal of Heuristics, 2023, 29 : 207 - 228
  • [23] Discrete-Time Optimal Control Scheme Based on Q-Learning Algorithm
    Wei, Qinglai
    Liu, Derong
    Song, Ruizhuo
    2016 SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2016, : 125 - 130
  • [24] Heat and power management of a direct-methanol-fuel-cell (DMFC) system
    Dohle, H
    Mergel, J
    Stolten, D
    JOURNAL OF POWER SOURCES, 2002, 111 (02) : 268 - 282
  • [25] Neural network-based adaptive control and energy management system of a direct methanol fuel cell in a hybrid renewable power system
    Jienkulsawad, Prathak
    Eamsiri, Kornkamol
    Chen, Yong-Song
    Arpornwichanop, Amornchai
    SUSTAINABLE CITIES AND SOCIETY, 2022, 87
  • [26] Q-learning algorithm based method for enhancing resiliency of integrated energy system
    Wu X.
    Tang Z.
    Xu Q.
    Zhou Y.
    Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2020, 40 (04): : 146 - 152
  • [27] Q-learning based energy management strategy for a hybrid multi-stack fuel cell system considering degradation
    Ghaderi, Razieh
    Kandidayeni, Mohsen
    Boulon, Loic
    Trovao, Joao P.
    ENERGY CONVERSION AND MANAGEMENT, 2023, 293
  • [28] Q-Learning based Maximum Power Extraction for Wind Energy Conversion System With Variable Wind Speed
    Kushwaha, Ashish
    Gopal, Madan
    Singh, Bhim
    IEEE TRANSACTIONS ON ENERGY CONVERSION, 2020, 35 (03) : 1160 - 1170
  • [29] Multi-AGV route planning in automated warehouse system based on shortest-time Q-learning algorithm
    Zhang, Zheng
    Chen, Juan
    Zhao, Wenbing
    ASIAN JOURNAL OF CONTROL, 2024, 26 (02) : 683 - 702
  • [30] Real-Time Optimization Energy Management Strategy for Fuel Cell Hybrid Ships Considering Power Sources Degradation
    Zhang, Zehui
    Guan, Cong
    Liu, Zhiyong
    IEEE ACCESS, 2020, 8 : 87046 - 87059