Energy management strategies based on soft actor critic reinforcement learning with a proper reward function design based on battery state of charge constraints

被引:3
作者
Baeumler, Antoine [1 ]
Benterki, Abdelmoudjib [1 ]
Meng, Jianwen [1 ]
Azib, Toufik [1 ]
Boukhnifer, Moussa [2 ]
机构
[1] ESTACA Lab Paris Saclay, ESTACA, F-78180 Montigny Le Bretonneux, France
[2] Univ Lorraine, LCOMS, F-57000 Metz, France
关键词
Energy management strategy; Reward function; Reinforcement learning; Fuel cell hybrid vehicle; VEHICLES;
D O I
10.1016/j.est.2024.111797
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The environmental concerns have pushed our society to find solutions to reduce the emission of transportation systems. One promising solution is the fuel cell hybrid electric vehicle (FCHEV), with a proton exchange membrane fuel cell (PEMFC) and a lithium-ion battery. The presence of two different energy sources requires a smart energy management strategy (EMS) which must not only optimize energy consumption (conventional approaches), but also mitigate component's degradation (New approaches : Health-conscious strategies). For that, a wide range of techniques has been proposed in the literature, and the most recent techniques are based on machine learning. Reinforcement learning (RL) is usually the most chosen strategy, but as in the optimization techniques, a strong design of the objective function (called reward function in RL) is required. The main challenge in the reward function is to encourage a proper battery state of charge (SOC) management, while reducing the energy consumption and avoiding the factors that accelerate the FC aging. The current advancement in the literature shows satisfactory results, but the reward function design have great improvement potential in the chosen approach. Indeed, most approaches in RL for SOC management with continuous actions are based on the reference SOC principle, which reduces the EMS ability to optimize the other part of the reward function. This paper reveals the challenges associated with the introduction of SOC limits in the reward function and proposes an approach, based on SOC boundaries in the reward function. The contribution allows to better consider the parts to optimize, as it gives more freedom than previous reference SOC techniques all reducing the consumption by 12.9%.
引用
收藏
页数:9
相关论文
共 51 条
[1]  
Andrychowicz Marcin., 2017, HINDSIGHT EXPERIENCE
[2]   Production of greenhouse gas free hydrogen by thermocatalytic decomposition of methane - A review [J].
Ashik, U. P. M. ;
Daud, W. M. A. Wan ;
Abbas, Hazzim F. .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2015, 44 :221-256
[3]  
Azib T., 2009, 2009 IEEE Vehicle Power and Propulsion Conference (VPPC), P1858, DOI 10.1109/VPPC.2009.5289678
[4]   Saturation Management of a Controlled Fuel-Cell/Ultracapacitor Hybrid Vehicle [J].
Azib, Toufik ;
Bethoux, Olivier ;
Remy, Ghislain ;
Marchand, Claude .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2011, 60 (09) :4127-4138
[5]   A System-Level Modeling of PEMFC Considering Degradation Aspect towards a Diagnosis Process [J].
Baumler, Antoine ;
Meng, Jianwen ;
Benterki, Abdelmoudjib ;
Azib, Toufik ;
Boukhnifer, Moussa .
ENERGIES, 2023, 16 (14)
[6]  
Bernard J, 2006, 2006 IEEE VEHICLE POWER AND PROPULSION CONFERENCE, P28
[7]   Multi-Criteria Optimal Design for FUEL Cell Hybrid Power Sources [J].
Ceschia, Adriano ;
Azib, Toufik ;
Bethoux, Olivier ;
Alves, Francisco .
ENERGIES, 2022, 15 (09)
[8]   Control System Design of Power Tracking for PEM Fuel Cell Automotive Application [J].
Chen, F. X. ;
Yu, Y. ;
Chen, J. X. .
FUEL CELLS, 2017, 17 (05) :671-681
[9]   Lifetime prediction and the economic lifetime of Proton Exchange Membrane fuel cells [J].
Chen, Huicui ;
Pei, Pucheng ;
Song, Mancun .
APPLIED ENERGY, 2015, 142 :154-163
[10]   Deep reinforcement learning based energy management strategy of fuel cell hybrid railway vehicles considering fuel cell aging [J].
Deng, Kai ;
Liu, Yingxu ;
Hai, Di ;
Peng, Hujun ;
Lowenstein, Lars ;
Pischinger, Stefan ;
Hameyer, Kay .
ENERGY CONVERSION AND MANAGEMENT, 2022, 251