共 27 条
Cost-Effective Power Delivery via Deep Reinforcement Learning-Based Dynamic Electric Vehicle Transportation
被引:9
作者:
Bao, Zheng
[1
]
Tang, Changbing
[2
]
Yu, Xinghuo
[3
]
Lin, Feilong
[1
]
Wen, Guanghui
[4
]
Zheng, Zhonglong
[1
]
机构:
[1] Zhejiang Normal Univ, Sch Comp Sci & Technol, Jinhua 321004, Peoples R China
[2] Zhejiang Normal Univ, Coll Phys & Elect Informat Engn, Jinhua 321004, Peoples R China
[3] RMIT Univ, Sch Engn, Melbourne, Vic 3000, Australia
[4] Southeast Univ, Sch Automat, Nanjing 211189, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Power transmission lines;
Batteries;
Costs;
Transportation;
Power system reliability;
Load modeling;
Load flow;
Reliability;
Power system dynamics;
Load shedding;
Electric vehicle;
load shedding;
Markov decision process (MDP);
power delivery;
reinforcement learning (RL);
MANAGEMENT;
D O I:
10.1109/JIOT.2025.3552823
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Power delivery issues are increasingly evident in cyber-physical smart grid systems as energy transactions frequently overlook the physical constraints of distribution, leading to transmission congestion and compromising network security and reliability. This article presents a novel and cost-effective solution to power delivery challenges by utilizing electric vehicles (EVs) with dynamic transportation capabilities as free carriers. Unlike traditional approaches, a deep reinforcement learning (DRL)-based optimization framework is designed to effectively manage incomplete information in real-time. Our method first introduces an investment-free model that leverages existing EV routes to transport energy during congestion, operating in a "free-riding" transmission mode. This not only enhances network reliability but also curtails costs. Then, we develop a Markov decision process (MDP) for sequential decision-making of 24-h optimal control, aimed at minimizing operational losses including load shedding and battery degradation. To deal with the stochastic nature of energy requests and EV routes in the control problem, we employ a model-free DRL algorithm to tackle the challenge of incomplete information. An Actor-Critic network, combining value-based and policy-based approaches, helps discover approximately optimal strategies in a continuous action space. Finally, the simulation results numerically demonstrate the performance of the proposed method.
引用
收藏
页码:23245 / 23256
页数:12
相关论文