Multi-vehicle routing problems with soft time windows: A multi-agent reinforcement learning approach

被引:98
|
作者
Zhang, Ke [1 ]
He, Fang [2 ]
Zhang, Zhengchao [1 ]
Lin, Xi [1 ]
Li, Meng [1 ]
机构
[1] Tsinghua Univ, Dept Civil Engn, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Ind Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Vehicle routing problem; Attention mechanism; Computational efficiency; Multi-agent; ALGORITHMS;
D O I
10.1016/j.trc.2020.102861
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Multi-vehicle routing problem with soft time windows (MVRPSTW) is an indispensable constituent in urban logistics distribution systems. Over the past decade, numerous methods for MVRPSTW have been proposed, but most are based on heuristic rules that require a large amount of computation time. With the current rapid increase of logistics demands, traditional methods incur the dilemma between computational efficiency and solution quality. To efficiently solve the problem, we propose a novel reinforcement learning algorithm called the Multi-Agent Attention Model that can solve routing problem instantly benefit from lengthy offline training. Specifically, the vehicle routing problem is regarded as a vehicle tour generation process, and an encoder decoder framework with attention layers is proposed to generate tours of multiple vehicles iteratively. Furthermore, a multi-agent reinforcement learning method with an unsupervised auxiliary network is developed for the model training. By evaluated on four synthetic networks with different scales, the results demonstrate that the proposed method consistently outperforms Google OR-Tools and traditional methods with little computation time. In addition, we validate the robustness of the well-trained model by varying the number of customers and the capacities of vehicles.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Coordinated multi-agent hierarchical deep reinforcement learning to solve multi-trip vehicle routing problems with soft time windows
    Zhang, Zixian
    Qi, Geqi
    Guan, Wei
    IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (10) : 2034 - 2051
  • [2] A Multi-agent Approach to the Dynamic Vehicle Routing Problem with Time Windows
    Barbucha, Dariusz
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2013, 8083 : 467 - 476
  • [3] Multi-Vehicle Collaborative Lane Changing Based on Multi-Agent Reinforcement Learning
    Zhang, Xiang
    Li, Shihao
    Wang, Boyang
    Xue, Mingxuan
    Li, Zhiwei
    Liu, Haiou
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1214 - 1221
  • [4] Robust Multi-vehicle Routing with Communication Enhanced Multi-agent Reinforcement Learning for Last-Mile Logistics
    Wang, Hai
    Wang, Shuai
    Wang, Shuai
    Zhou, Xiaolei
    WEB AND BIG DATA, APWEB-WAIM 2024, PT V, 2024, 14965 : 470 - 480
  • [5] Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
    Mak, Stephen
    Xu, Liming
    Pearce, Tim
    Ostroumov, Michael
    Brintrup, Alexandra
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 157
  • [6] A multi-agent model for the Vehicle Routing Problem with Time Windows
    Kefi, M
    Ghédira, K
    URBAN TRANSPORT X: URBAN TRANSPORT AND THE ENVIRONMENT IN THE 21ST CENTURY, 2004, 16 : 227 - 234
  • [7] Real-Time Multi-Vehicle Scheduling in Tasks With Dependency Relationships Using Multi-Agent Reinforcement Learning
    Zhang, Shupei
    Shi, Huapeng
    Zhang, Wei
    Pang, Ying
    Sun, Pengju
    IEEE ACCESS, 2024, 12 : 81453 - 81470
  • [8] A multi-agent deep reinforcement learning approach for solving the multi-depot vehicle routing problem
    Arishi, Ali
    Krishnan, Krishna
    JOURNAL OF MANAGEMENT ANALYTICS, 2023, 10 (03) : 493 - 515
  • [9] Improved Multi-Agent System for the Vehicle Routing Problem with Time Windows
    但正刚
    蔡临宁
    郑力
    TsinghuaScienceandTechnology, 2009, 14 (03) : 407 - 412
  • [10] Multi-agent reinforcement learning for electric vehicle decarbonized routing and scheduling
    Wang, Yi
    Qiu, Dawei
    He, Yinglong
    Zhou, Quan
    Strbac, Goran
    ENERGY, 2023, 284