Optimal Policy Characterization Enhanced Actor-Critic Approach for Electric Vehicle Charging Scheduling in a Power Distribution Network

被引:71
作者
Jin, Jiangliang [1 ,2 ]
Xu, Yunjian [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Mech & Automat Engn, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Shun Hing Inst Adv Engn, Hong Kong, Peoples R China
关键词
Electric vehicle charging; Optimal scheduling; Stochastic processes; Distribution networks; Reinforcement learning; Solar power generation; Dynamic programming; deep reinforcement learning; electric vehicle charging; actor-critic approach; power distribution network; DEMAND RESPONSE; ENERGY-STORAGE; REINFORCEMENT; SYSTEMS;
D O I
10.1109/TSG.2020.3028470
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We study the scheduling of large-scale electric vehicle (EV) charging in a power distribution network under random renewable generation and electricity prices. The problem is formulated as a stochastic dynamic program with unknown state transition probability. To mitigate the curse of dimensionality, we establish the nodal multi-target (NMT) characterization of the optimal scheduling policy: all EVs with the same deadline at the same bus should be charged to approach a single target of remaining energy demand. We prove that the NMT characterization is optimal under arbitrarily random system dynamics. To adaptively learn the dynamics of system uncertainty, we propose a model-free soft-actor-critic (SAC) based method to determine the target levels for the characterized NMT policy. The proposed SAC + NMT approach significantly outperforms existing deep reinforcement learning methods (in our numerical experiments on the IEEE 37-node test feeder), as the established NMT characterization sharply reduces the dimensionality of neural network outputs without loss of optimality.
引用
收藏
页码:1416 / 1428
页数:13
相关论文
共 47 条
  • [1] [Anonymous], 2017, EL CHARG AN 2017 PUB
  • [2] [Anonymous], 1992, IEEE 13 NOD DISTR TE
  • [3] [Anonymous], 2017, PV LIV
  • [4] [Anonymous], 2017, MARK OP
  • [5] [Anonymous], 2017, HIST MARK DAT
  • [6] Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model
    Azar, Mohammad Gheshlaghi
    Munos, Remi
    Kappen, Hilbert J.
    [J]. MACHINE LEARNING, 2013, 91 (03) : 325 - 349
  • [7] An Online Learning Algorithm for Demand Response in Smart Grid
    Bahraini, Shahab
    Wong, Vincent W. S.
    Huang, Jianwei
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2018, 9 (05) : 4712 - 4725
  • [8] OPTIMAL CAPACITOR PLACEMENT ON RADIAL-DISTRIBUTION SYSTEMS
    BARAN, ME
    WU, FF
    [J]. IEEE TRANSACTIONS ON POWER DELIVERY, 1989, 4 (01) : 725 - 734
  • [9] NETWORK RECONFIGURATION IN DISTRIBUTION-SYSTEMS FOR LOSS REDUCTION AND LOAD BALANCING
    BARAN, ME
    WU, FF
    [J]. IEEE TRANSACTIONS ON POWER DELIVERY, 1989, 4 (02) : 1401 - 1407
  • [10] Coordinated charging and discharging strategies for plug-in electric bus fast charging station with energy storage system
    Chen, Huimiao
    Hu, Zechun
    Zhang, Hongcai
    Luo, Haocheng
    [J]. IET GENERATION TRANSMISSION & DISTRIBUTION, 2018, 12 (09) : 2019 - 2028