Interpretable Deep Reinforcement Learning With Imitative Expert Experience for Smart Charging of Electric Vehicles

被引:1
作者
Li, Shuangqi [1 ,2 ]
Zhao, Alexis Pengfei [3 ,4 ]
Gu, Chenghong [2 ]
Bu, Siqi [1 ,5 ]
Chung, Edward [6 ]
Tian, Zhongbei [7 ]
Li, Jianwei [2 ]
Cheng, Shuang [2 ]
机构
[1] Hong Kong Polytech Univ, Res Ctr Grid Modernizat, Dept Elect & Elect Engn, Kowloon, Hong Kong, Peoples R China
[2] Univ Bath, Dept Elect & Elect Engn, Bath BA2 7AY, England
[3] Chinese Acad Sci, Inst Automat, Beijing 100045, Peoples R China
[4] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100190, Peoples R China
[5] Hong Kong Polytech Univ, Policy Res Ctr Innovat & Technol, Kowloon, Hong Kong, Peoples R China
[6] Hong Kong Polytech Univ, Dept Elect & Elect Engn, Hong Kong, Peoples R China
[7] Univ Birmingham, Birmingham Ctr Railway Res & Educ BCRRE, Transport Energy Syst, Birmingham B15 2TT, England
基金
英国工程与自然科学研究理事会;
关键词
Vehicle-to-grid; Batteries; Renewable energy sources; Reinforcement learning; Training; Aging; Imitation learning; Electric vehicle; vehicle grid integration; smart charging; renewable energy; battery aging; cost-benefit analysis; RENEWABLE ENERGY-SOURCES; OPTIMIZATION; MANAGEMENT; SYSTEMS;
D O I
10.1109/TPWRS.2024.3425843
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep reinforcement learning (DRL) is a promising candidate for realizing online complex system optimal control because of its high computation efficiency. However, the interpretability and reliability problems limit its engineering application in smart grid energy management. This paper for the first time designs a novel imitative learning framework to provide a reliable solution for computation-efficient grid-connected electric vehicles (GEVs) charging management in smart grids. The optimal strategies are derived by a priors optimization model based on vehicle-to-grid (V2G) cost-benefit analysis. With better interpretability and ensured optimality, the derived strategies are used to construct an experience pool for configuring the learning environment. Then, a novel imitative learning mechanism is designed to facilitate the knowledge transfer between expert experience and reinforcement learning model. Further, a novel dual actor-imitator learning network to enable flexible scheduling of V2G power of GEVs. With the dual network structure, the expert experience can be effectively utilized to enhance the training efficiency and performance of the DRL-based V2G coordinator. The effectiveness of the developed method in improving V2G benefit and mitigating battery aging is validated on a demonstrative microgrid in the U.K.
引用
收藏
页码:1228 / 1240
页数:13
相关论文
共 56 条
[1]  
[Anonymous], 2020, Western Power Distribution
[2]   Artificial intelligence and machine learning approaches to energy demand-side response: A systematic review [J].
Antonopoulos, Ioannis ;
Robu, Valentin ;
Couraud, Benoit ;
Kirli, Desen ;
Norbu, Sonam ;
Kiprakis, Aristides ;
Flynn, David ;
Elizondo-Gonzalez, Sergio ;
Wattam, Steve .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2020, 130 (130)
[3]   An Optimal Day-Ahead Scheduling Framework for E-Mobility Ecosystem Operation With Drivers' Preferences [J].
Bagheri Tookanlou, Mahsa ;
Pourmousavi Kani, S. Ali ;
Marzband, Mousa .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2021, 36 (06) :5245-5257
[4]  
Baomar H, 2016, INT CONF UNMAN AIRCR, P1023, DOI 10.1109/ICUAS.2016.7502578
[5]   Sizing of Energy Storage for Grid Inertial Support in Presence of Renewable Energy [J].
Bera, Atri ;
Chalamala, Babu R. ;
Byrne, Raymond H. ;
Mitra, Joydeep .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2022, 37 (05) :3769-3778
[6]   Global against divided optimization for the participation of an EV aggregator in the day-ahead electricity market. Part II: Numerical analysis [J].
Bessa, R. J. ;
Matos, M. A. .
ELECTRIC POWER SYSTEMS RESEARCH, 2013, 95 :319-329
[7]   Reinforcement Learning and Its Applications in Modern Power and Energy Systems: A Review [J].
Cao, Di ;
Hu, Weihao ;
Zhao, Junbo ;
Zhang, Guozhou ;
Zhang, Bin ;
Liu, Zhou ;
Chen, Zhe ;
Blaabjerg, Frede .
JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY, 2020, 8 (06) :1029-1042
[8]   Joint Optimization of Delay-Tolerant Autonomous Electric Vehicles Charge Scheduling and Station Battery Degradation [J].
Cao, Yongsheng ;
Li, Demin ;
Zhang, Yihong ;
Chen, Xuemin .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (09) :8590-8599
[9]   Interpretable End-to-End Urban Autonomous Driving With Latent Deep Reinforcement Learning [J].
Chen, Jianyu ;
Li, Shengbo Eben ;
Tomizuka, Masayoshi .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) :5068-5078
[10]   Online Scheduling for Hierarchical Vehicle-to-Grid System: Design, Formulation, and Algorithm [J].
Chen, Xiangyu ;
Leung, Ka-Cheong ;
Lam, Albert Y. S. ;
Hill, David. J. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) :1302-1317