Graph Q-learning Assisted Ant Colony Optimization for Vehicle Routing Problems with Time Windows

被引：2

作者：

Yue, Peng ^{[1
]}

Liu, Shiqing ^{[2
]}

Jin, Yaochu ^{[2
]}

机构：

[1] Northeastern Univ, Shenyang, Liaoning, Peoples R China

[2] Bielefeld Univ, Bielefeld, Germany

来源：

PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION | 2023年

关键词：

Ant Colony Optimization; Deep Q-Learning; Graph Neural Network; Neural Combinatorial Optimization;

D O I：

10.1145/3583133.3596423

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Vehicle routing problem with time windows (VRPTW) is a typical class of constrained path planning problems in the field of combinatorial optimization. VRPTW considers a delivery task for a given set of customers with time windows, and the target is to find optimal routes for a group of vehicles that can minimize the total transportation cost. The traditional heuristics suffer from several limitations when solving VRPTW, such as poor scalability, sensitivity to hyperparameters and difficulty in handling complex constraints. Recent advance in machine learning makes it possible to enhance heuristic approaches via learned knowledge. In this paper, we propose a graph Q-learning assisted ant colony optimization algorithm named GQL-ACO to solve VRPTW. Compared to vanilla ant colony optimization (ACO), our proposed method first employs the learned heuristic values by using graph Q learning, instead of handcrafted ones, to define the hyperparameters of ACO. Second, we design a collaborative search strategy by combining ACO and Q-learning effectively, which can adaptively adjust the hyperparameters of ACO based on the search experiences.

引用

页码：7 / 8

页数：2

共 50 条

[21] An Improved Ant Colony Algorithm for Open Vehicle Routing Problem with Time Windows
Li Guiyun
2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT, INNOVATION MANAGEMENT AND INDUSTRIAL ENGINEERING, VOL 2, PROCEEDINGS, 2009, : 616 - 619
[22] The optimal design of the vehicle routing problem with time windows by ant colony system
Ono, Hiroaki
Mori, Yasuchika
PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8, 2007, : 1321 - 1325
[23] A Multiple Ant Colony System for the Electric Vehicle Routing Problem with Time Windows
Mavrovouniotis, Michalis
Ellinas, Georgios
Li, Changhe
Polycarpou, Marios
2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 796 - 803
[24] Ant Colony Optimization for a Stochastic Vehicle Routing Problem with Driver Learning
Schneider, Michael
Doppstadt, Christian
Stenger, Andreas
Schwind, Michael
2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
[25] Vehicle routing problems with time windows based on the improved hybrid fish swarm-ant colony algorithm
Zhang, Jun
Zhang, Jing
Qin, Zhentao
Jia, Yan
INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2022,
[26] Modified artificial bee colony for the vehicle routing problems with time windows
Alzaqebah, Malek
Abdullah, Salwani
Jawarneh, Sana
SPRINGERPLUS, 2016, 5
[27] A stable ant colony routing algorithm based on Q-learning for Ad Hoc Networks
Wang, Q.-W. (wqw013890@163.com), 1600, Harbin Institute of Technology (44):
[28] Improved ant colony optimization for multi-depot heterogeneous vehicle routing problem with soft time windows
Tang, Yalian
Cai, Yanguang
Yang, Qijiang
Journal of Southeast University (English Edition), 2015, 31 (01) : 94 - 99
[29] A hybrid ant colony optimization algorithm for a multi-objective vehicle routing problem with flexible time windows
Zhang, Huizhen
Zhang, Qinwan
Ma, Liang
Zhang, Ziying
Liu, Yun
INFORMATION SCIENCES, 2019, 490 : 166 - 190
[30] Q-Learning Ant Colony Optimization supported by Deep Learning for Target Set Selection
Ramirez Sanchez, Jairo Enrique
Sartori, Camilo Chacon
Blum, Christian
PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 357 - 374

← 1 2 3 4 5 →