Optimal Priority Rule-Enhanced Deep Reinforcement Learning for Charging Scheduling in an Electric Vehicle Battery Swapping Station

被引:19
作者
Jin, Jiangliang [1 ]
Mao, Shuai [2 ]
Xu, Yunjian [2 ,3 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 200051, Peoples R China
[2] Chinese Univ Hong Kong, Dept Mech & Automat Engn, Hong Kong, Peoples R China
[3] CUHK Shenzhen Res Inst, Shenzhen 518172, Peoples R China
基金
中国国家自然科学基金;
关键词
Electric vehicle; battery swapping station; Markov decision process; deep reinforcement learning; renewable generation; OPERATION MODEL; OPTIMIZATION; MANAGEMENT; SYSTEMS;
D O I
10.1109/TSG.2023.3250505
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For a battery swapping station (BSS) with solar generation, N charging bays, and an inventory of M batteries, we study the charging scheduling problem under random EV arrivals, renewable generation, and electricity prices. To minimize the expected weighted sum of charging cost (sum of electricity and battery degradation costs) and EV owners' waiting cost, we formulate the problem as a Markov decision process with unknown state transition probability. Under a mild heavy-traffic assumption, we rigorously establish the optimality of the Less Demand First (LDF) priority rule under arbitrary system dynamics: batteries with less demand shall be charged first. The optimality result enables us to integrate the LDF rule into a state-of-the-art deep reinforcement learning (DRL) method, proximal policy optimization (PPO), reducing the dimensionality of its output from O(M+N) to O(1), without loss of optimality in the heavy-traffic scenario. Numerical results (on real-world data) demonstrate that the proposed LDF enhanced PPO approach significantly outperforms classical DRL methods and FCFS (first come, first served) priority rule based DRL counterparts.
引用
收藏
页码:4581 / 4593
页数:13
相关论文
共 49 条
  • [21] Efficient decentralized coordination of large-scale plug-in electric vehicle charging
    Ma, Zhongjing
    Zou, Suli
    Ran, Long
    Shi, Xingyu
    Hiskens, Ian A.
    [J]. AUTOMATICA, 2016, 69 : 35 - 47
  • [22] McKerracher C., 2022, ELECT VEHICLE OUTLOO
  • [23] A Graph Automorphic Approach for Placement and Sizing of Charging Stations in EV Network Considering Traffic
    Parastvand, Hossein
    Moghaddam, Valeh
    Bass, Octavian
    Masoum, Mohammad A. S.
    Chapman, Airlie
    Lachowicz, Stefan
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (05) : 4190 - 4200
  • [24] Optimization of Battery Charging and Purchasing at Electric Vehicle Battery Swap Stations
    Schneider, Frank
    Thonemann, Ulrich W.
    Klabjan, Diego
    [J]. TRANSPORTATION SCIENCE, 2018, 52 (05) : 1211 - 1234
  • [25] Schulman J, 2018, Arxiv, DOI [arXiv:1506.02438, 10.48550/arXiv.1506.02438]
  • [26] Schulman J, 2017, Arxiv, DOI arXiv:1707.06347
  • [27] A Cluster-Based Operation Model of Aggregated Battery Swapping Stations
    Sepetanc, Karlo
    Pandzic, Hrvoje
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (01) : 249 - 260
  • [28] A Dynamic Optimal Battery Swapping Mechanism for Electric Vehicles Using an LSTM-Based Rolling Horizon Approach
    Shalaby, Ahmed A.
    Shaaban, Mostafa F.
    Mokhtar, Mohamed
    Zeineldin, Hatem H.
    El-Saadany, Ehab F.
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 15218 - 15232
  • [29] Optimal battery purchasing and charging strategy at electric vehicle battery swap stations
    Sun, Bo
    Sun, Xu
    Tsang, Danny H. K.
    Whitt, Ward
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 279 (02) : 524 - 539
  • [30] Sutton RS, 2018, ADAPT COMPUT MACH LE, P1