Multiobjective Vehicle Routing Optimization With Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II

被引:0
|
作者
Wu, Rixin [1 ,2 ]
Wang, Ran [1 ,2 ]
Hao, Jie [1 ,2 ]
Wu, Qiang [1 ,2 ]
Wang, Ping [3 ]
Niyato, Dusit [4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210038, Jiangsu, Peoples R China
[3] York Univ, Lassonde Sch Engn, Dept Elect Engn & Comp Sci, Toronto, ON M3J1P3, Canada
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore City 639798, Singapore
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Optimization; Heuristic algorithms; Genetic algorithms; Vehicle routing; Training; Costs; Transportation; Deep reinforcement learning; Customer satisfaction; Sorting; Multiobjective optimization; vehicle routing problem; deep reinforcement learning; transformer; weight-aware strategy; GENETIC ALGORITHM;
D O I
10.1109/TITS.2024.3515997
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This paper proposes a weight-aware deep reinforcement learning (WADRL) approach designed to address the multiobjective vehicle routing problem with time windows (MOVRPTW), aiming to use a single deep reinforcement learning (DRL) model to solve the entire multiobjective optimization problem. The Non-dominated sorting genetic algorithm-II (NSGA-II) method is then employed to optimize the outcomes produced by the WADRL, thereby mitigating the limitations of both approaches. Firstly, we design an MOVRPTW model to balance the minimization of travel cost and the maximization of customer satisfaction. Subsequently, we present a novel DRL framework that incorporates a transformer-based policy network. This network is composed of an encoder module, a weight embedding module where the weights of the objective functions are incorporated, and a decoder module. NSGA-II is then utilized to optimize the solutions generated by WADRL. Finally, extensive experimental results demonstrate that our method outperforms the existing and traditional methods. Due to the numerous constraints in VRPTW, generating initial solutions of the NSGA-II algorithm can be time-consuming. However, using solutions generated by the WADRL as initial solutions for NSGA-II significantly reduces the time required for generating initial solutions. Meanwhile, the NSGA-II algorithm can enhance the quality of solutions generated by WADRL, resulting in solutions with better scalability. Notably, the weight-aware strategy significantly reduces the training time of DRL while achieving better results, enabling a single DRL model to solve the entire multiobjective optimization problem.
引用
收藏
页码:4032 / 4047
页数:16
相关论文
共 50 条
  • [1] NSGA-II with objective-specific variation operators for multiobjective vehicle routing problem with time windows
    Srivastava, Gaurav
    Singh, Alok
    Mallipeddi, Rammohan
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176
  • [2] Multiobjective Hyperparameter Optimization for Deep Learning Interatomic Potential Training Using NSGA-II
    Coletti, Mark
    Sedova, Ada
    Chahal, Rajni
    Gibson, Luke
    Roy, Santanu
    Bryantsev, Vyacheslav S.
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP-W 2023, 2023, : 172 - 179
  • [3] Multiobjective Optimal Secure Routing Algorithm Using NSGA-II
    Han, Dan
    Hu Guang-min
    Lu, Cai
    2008 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 884 - 888
  • [4] Deep Reinforcement Learning for the Electric Vehicle Routing Problem With Time Windows
    Lin, Bo
    Ghaddar, Bissan
    Nathwani, Jatin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 11528 - 11538
  • [5] Optimization of Fuzzy Neural Network using Multiobjective NSGA-II
    Gope, Monika
    Omar, Mehnuma Tabassum
    Shill, Pintu Chandra
    PROCEEDINGS OF 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE 2016), 2016, : 300 - 305
  • [6] Interactive fuzzy multiobjective reliability optimization using NSGA-II
    Kishor A.
    Yadav S.P.
    Kumar S.
    OPSEARCH, 2009, 46 (2) : 214 - 224
  • [7] An Or-opt NSGA-II Algorithm for Multi-Objective Vehicle Routing Problem with Time Windows
    Xu, Huayu
    Fan, Wenhui
    Wei, Tian
    Yu, Lijun
    2008 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING, VOLS 1 AND 2, 2008, : 309 - +
  • [8] Optimization of the Vehicle Routing Problem with Demand Responsive Transport Using the NSGA-II Algorithm
    Mendes, Renan S.
    Wanner, Elizabeth F.
    Sarubbi, Joao F. M.
    Martins, Flavio V. C.
    2016 IEEE 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2016, : 2657 - 2662
  • [9] Robust Multiobjective Optimization for Vehicle Routing Problem With Time Windows
    Duan, Jiahui
    He, Zhenan
    Yen, Gary G.
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 8300 - 8314
  • [10] Multiobjective optimization of fluphenazine nanocomposite formulation using NSGA-II method
    Abu Sharar, Ahmed Adnan
    Ramadan, Saleem Z.
    Hussein-Al-Ali, Samer Hasan
    MATERIALS SCIENCE-POLAND, 2021, 39 (04) : 517 - 544