Deep Reinforcement Learning-Based Energy Management for a Series Hybrid Electric Vehicle Enabled by History Cumulative Trip Information

被引:141
作者
Li, Yuecheng [1 ]
He, Hongwen [1 ]
Peng, Jiankun [1 ]
Wang, Hong [2 ]
机构
[1] Beijing Inst Technol, Natl Engn Lab Elect Vehicles, Sch Mech Engn, Beijing 100081, Peoples R China
[2] Univ Waterloo, Dept Mech & Mechatron Engn, Waterloo, ON N2L 3G1, Canada
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Deep reinforcement learning; dynamic programming; energy management; generalization; model predictive control; optimality; MODEL-PREDICTIVE CONTROL; POWER MANAGEMENT; RECENT PROGRESS; STRATEGIES; NETWORK; SYSTEMS; HEVS; BUS;
D O I
10.1109/TVT.2019.2926472
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
It is essential to develop proper energy management strategies (EMSs) with broad adaptability for hybrid electric vehicles (HEVs). This paper utilizes deep reinforcement learning (DRL) to develop EMSs for a series HEV due to DRL's advantages of requiring no future driving information in derivation and good generalization in solving energy management problem formulated as a Markov decision process. History cumulative trip information is also integrated for effective state of charge guidance in DRL-based EMSs. The proposed method is systematically introduced from offline training to online applications; its learning ability, optimality, and generalization are validated by comparisons with fuel economy benchmark optimized by dynamic programming, and real-time EMSs based on model predictive control (MPC). Simulation results indicate that without a priori knowledge of future trip, original DRL-based EMS achieves an average 3.5% gap from benchmark, superior to MPC-based EMS with accurate prediction; after further applying output frequency adjustment, a mean gap of 8.7%, which is comparable with MPC-based EMS with mean prediction error of 1 m/s, is maintained with concurrently noteworthy improvement in reducing engine start times. Besides, its impressive computation speed of about 0.001 s per simulation step proves its practical application potential, and this method is independent of powertrain topology such that it is applicative for any type of HEVs even when future driving information is unavailable.
引用
收藏
页码:7416 / 7430
页数:15
相关论文
共 41 条
[1]   MPC-Based Energy Management of a Power-Split Hybrid Electric Vehicle [J].
Borhan, Hoseinali ;
Vahidi, Ardalan ;
Phillips, Anthony M. ;
Kuang, Ming L. ;
Kolmanovsky, Ilya V. ;
Di Cairano, Stefano .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2012, 20 (03) :593-603
[2]   Energy Management for a Power-Split Plug-in Hybrid Electric Vehicle Based on Dynamic Programming and Neural Networks [J].
Chen, Zheng ;
Mi, Chunting Chris ;
Xu, Jun ;
Gong, Xianzhi ;
You, Chenwen .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2014, 63 (04) :1567-1580
[3]  
Glorot X., 2010, P 13 INT C ART INT S, V9, P249
[4]   A novel MPC-based adaptive energy management strategy in plug-in hybrid electric vehicles [J].
Guo Jinquan ;
He Hongwen ;
Peng Jiankun ;
Zhou Nana .
ENERGY, 2019, 175 :378-392
[5]  
Hausknecht M., 2016, DEEP REINFORCEMENT L
[6]   Real-time global driving cycle construction and the application to economy driving pro system in plug-in hybrid electric vehicles [J].
He Hongwen ;
Guo Jinquan ;
Peng Jiankun ;
Tan Huachun ;
Sun Chao .
ENERGY, 2018, 152 :95-107
[7]   Energy Management Strategy for a Hybrid Electric Vehicle Based on Deep Reinforcement Learning [J].
Hu, Yue ;
Li, Weimin ;
Xu, Kun ;
Zahid, Taimoor ;
Qin, Feiyan ;
Li, Chenming .
APPLIED SCIENCES-BASEL, 2018, 8 (02)
[8]   Model predictive control power management strategies for HEVs: A review [J].
Huang, Yanjun ;
Wang, Hong ;
Khajepour, Amir ;
He, Hongwen ;
Ji, Jie .
JOURNAL OF POWER SOURCES, 2017, 341 :91-106
[9]   A predictive power management controller for service vehicle anti-idling systems without a priori information [J].
Huang, Yanjun ;
Khajepour, Amir ;
Wang, Hong .
APPLIED ENERGY, 2016, 182 :548-557
[10]   Predictive AECMS by Utilization of Intelligent Transportation Systems for Hybrid Electric Vehicle Powertrain Control [J].
Kazemi, Hadi ;
Fallah, Yaser P. ;
Nix, Andrew ;
Wayne, Scott .
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2017, 2 (02) :75-84