Graph Q-learning Assisted Ant Colony Optimization for Vehicle Routing Problems with Time Windows

被引:2
|
作者
Yue, Peng [1 ]
Liu, Shiqing [2 ]
Jin, Yaochu [2 ]
机构
[1] Northeastern Univ, Shenyang, Liaoning, Peoples R China
[2] Bielefeld Univ, Bielefeld, Germany
关键词
Ant Colony Optimization; Deep Q-Learning; Graph Neural Network; Neural Combinatorial Optimization;
D O I
10.1145/3583133.3596423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vehicle routing problem with time windows (VRPTW) is a typical class of constrained path planning problems in the field of combinatorial optimization. VRPTW considers a delivery task for a given set of customers with time windows, and the target is to find optimal routes for a group of vehicles that can minimize the total transportation cost. The traditional heuristics suffer from several limitations when solving VRPTW, such as poor scalability, sensitivity to hyperparameters and difficulty in handling complex constraints. Recent advance in machine learning makes it possible to enhance heuristic approaches via learned knowledge. In this paper, we propose a graph Q-learning assisted ant colony optimization algorithm named GQL-ACO to solve VRPTW. Compared to vanilla ant colony optimization (ACO), our proposed method first employs the learned heuristic values by using graph Q learning, instead of handcrafted ones, to define the hyperparameters of ACO. Second, we design a collaborative search strategy by combining ACO and Q-learning effectively, which can adaptively adjust the hyperparameters of ACO based on the search experiences.
引用
收藏
页码:7 / 8
页数:2
相关论文
共 50 条
  • [21] An Improved Ant Colony Algorithm for Open Vehicle Routing Problem with Time Windows
    Li Guiyun
    2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT, INNOVATION MANAGEMENT AND INDUSTRIAL ENGINEERING, VOL 2, PROCEEDINGS, 2009, : 616 - 619
  • [22] The optimal design of the vehicle routing problem with time windows by ant colony system
    Ono, Hiroaki
    Mori, Yasuchika
    PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8, 2007, : 1321 - 1325
  • [23] A Multiple Ant Colony System for the Electric Vehicle Routing Problem with Time Windows
    Mavrovouniotis, Michalis
    Ellinas, Georgios
    Li, Changhe
    Polycarpou, Marios
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 796 - 803
  • [24] Ant Colony Optimization for a Stochastic Vehicle Routing Problem with Driver Learning
    Schneider, Michael
    Doppstadt, Christian
    Stenger, Andreas
    Schwind, Michael
    2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [25] Vehicle routing problems with time windows based on the improved hybrid fish swarm-ant colony algorithm
    Zhang, Jun
    Zhang, Jing
    Qin, Zhentao
    Jia, Yan
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2022,
  • [26] Modified artificial bee colony for the vehicle routing problems with time windows
    Alzaqebah, Malek
    Abdullah, Salwani
    Jawarneh, Sana
    SPRINGERPLUS, 2016, 5
  • [27] A stable ant colony routing algorithm based on Q-learning for Ad Hoc Networks
    Wang, Q.-W. (wqw013890@163.com), 1600, Harbin Institute of Technology (44):
  • [28] Improved ant colony optimization for multi-depot heterogeneous vehicle routing problem with soft time windows
    Tang, Yalian
    Cai, Yanguang
    Yang, Qijiang
    Journal of Southeast University (English Edition), 2015, 31 (01) : 94 - 99
  • [29] A hybrid ant colony optimization algorithm for a multi-objective vehicle routing problem with flexible time windows
    Zhang, Huizhen
    Zhang, Qinwan
    Ma, Liang
    Zhang, Ziying
    Liu, Yun
    INFORMATION SCIENCES, 2019, 490 : 166 - 190
  • [30] Q-Learning Ant Colony Optimization supported by Deep Learning for Target Set Selection
    Ramirez Sanchez, Jairo Enrique
    Sartori, Camilo Chacon
    Blum, Christian
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 357 - 374