Optimal Priority Rule-Enhanced Deep Reinforcement Learning for Charging Scheduling in an Electric Vehicle Battery Swapping Station

被引:19
作者
Jin, Jiangliang [1 ]
Mao, Shuai [2 ]
Xu, Yunjian [2 ,3 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 200051, Peoples R China
[2] Chinese Univ Hong Kong, Dept Mech & Automat Engn, Hong Kong, Peoples R China
[3] CUHK Shenzhen Res Inst, Shenzhen 518172, Peoples R China
基金
中国国家自然科学基金;
关键词
Electric vehicle; battery swapping station; Markov decision process; deep reinforcement learning; renewable generation; OPERATION MODEL; OPTIMIZATION; MANAGEMENT; SYSTEMS;
D O I
10.1109/TSG.2023.3250505
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For a battery swapping station (BSS) with solar generation, N charging bays, and an inventory of M batteries, we study the charging scheduling problem under random EV arrivals, renewable generation, and electricity prices. To minimize the expected weighted sum of charging cost (sum of electricity and battery degradation costs) and EV owners' waiting cost, we formulate the problem as a Markov decision process with unknown state transition probability. Under a mild heavy-traffic assumption, we rigorously establish the optimality of the Less Demand First (LDF) priority rule under arbitrary system dynamics: batteries with less demand shall be charged first. The optimality result enables us to integrate the LDF rule into a state-of-the-art deep reinforcement learning (DRL) method, proximal policy optimization (PPO), reducing the dimensionality of its output from O(M+N) to O(1), without loss of optimality in the heavy-traffic scenario. Numerical results (on real-world data) demonstrate that the proposed LDF enhanced PPO approach significantly outperforms classical DRL methods and FCFS (first come, first served) priority rule based DRL counterparts.
引用
收藏
页码:4581 / 4593
页数:13
相关论文
共 49 条
  • [1] [Anonymous], 2019, Electric vehicle charging station usage PA
  • [2] [Anonymous], 2022, Nio to build "several hundred" battery swap stations in europe by 2025
  • [3] [Anonymous], 2019, Supply and renewables
  • [4] [Anonymous], 2019, Real-time price
  • [5] Integrated Operation Model for Autonomous Mobility-on-Demand Fleet and Battery Swapping Station
    Ding, Zhaohao
    Tan, Wenrui
    Lee, Wei-Jen
    Pan, Xuyang
    Gao, Shiqiao
    [J]. IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2021, 57 (06) : 5593 - 5602
  • [6] Deep Reinforcement Learning Based Optimal Schedule for a Battery Swapping Station Considering Uncertainties
    Gao, Yuan
    Yang, Jiajun
    Yang, Ming
    Li, Zhengshuo
    [J]. IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2020, 56 (05) : 5775 - 5784
  • [7] Groves J., 2022, Nio power swap station: Does it work?
  • [8] Guterres A., 2020, CARBON NEUTRALITY 20
  • [9] Hill, 2018, STABLE BASELINES
  • [10] MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS
    HORNIK, K
    STINCHCOMBE, M
    WHITE, H
    [J]. NEURAL NETWORKS, 1989, 2 (05) : 359 - 366