Dynamic energy scheduling and routing of a large fleet of electric vehicles using multi-agent reinforcement learning

被引：34

作者：

Alqahtani, Mohammed ^{[1
,2
]}

Scott, Michael J. ^{[2
]}

Hu, Mengqi ^{[2
]}

机构：

[1] King Khalid Univ, Dept Ind Engn, King Fahad St,Guraiger, Abha 62529, Saudi Arabia

[2] Univ Illinois, Dept Mech & Ind Engn, 842 Taylor St, Chicago, IL 60607 USA

来源：

COMPUTERS & INDUSTRIAL ENGINEERING | 2022年 / 169卷

基金：

美国国家科学基金会;

关键词：

Electric vehicle; Vehicle routing; Energy scheduling; Multi-agent reinforcement learning; Deep reinforcement learning; RENEWABLE ENERGY; ENVIRONMENTAL OPTIMIZATION; DECOMPOSITION METHOD; CHARGING PATTERNS; POWER-SYSTEM; SCALE; MODEL; MANAGEMENT; OPERATION; STORAGE;

D O I：

10.1016/j.cie.2022.108180

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

As the world's population and economy grow, demand for energy increases as well. Smart grids can be a costeffective solution to overcome increases in energy demand and ensure power security. Current applications of smart grids involve a large numbers of agents (e.g., electric vehicles). Since each agent must interact with other agents when taking decisions (e.g., movement and scheduling), the computational complexity of smart grid systems increases exponentially with the number of agents. Computational tractability of planning is a significant barrier to implementation of large-scale smart grids of electric vehicles.Existing solution approaches such as mixed-integer programming and dynamic programming are not computationally efficient for high-dimensional problems. This paper proposes a reformulation of a Mixed-Integer Programming model into a Decentralized Markov Decision Process model and solves it using a Multi-Agent Reinforcement Learning algorithm to address the scalability issues of large-scale smart grid systems. The Decentralized Markov Decision Process model uses centralized training and distributed execution: agents are trained using a unique actor network for each agent and a shared critic network, and then agent execute actions independently from other agents to reduce computation time. The performance of the Multi-Agent Reinforcement Learning model is assessed under different configurations of customers and electric vehicles, and compared to the results from deep reinforcement learning and three heuristic algorithms. The simulation results demonstrate that the Multi-Agent Reinforcement Learning algorithm can reduce simulation time significantly compared to deep reinforcement learning, genetic algorithm, particle swarm optimization, and the artificial fish swarm algorithm. The superior performance of the proposed method indicates that it may be a realistic solution for large-scale implementation.

引用

页数：19

共 103 条

[1]

A Beginner's Guide to Deep Reinforcement Learning, 2021, NEURAL NETWORKS DEEP

[2]

Abramson David, 1991, CITESEER

[3] Smart home energy management using hybrid robust-stochastic optimization [J].