Simulated annealing with reinforcement learning for the set team orienteering problem with time windows

被引:7
作者
Yu, Vincent F. [1 ,2 ]
Salsabila, Nabila Yuraisyah [1 ]
Lin, Shih-Wei [3 ,4 ,5 ]
Gunawan, Aldy [6 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei, Taiwan
[2] Natl Taiwan Univ Sci & Technol, Ctr Cyber Phys Syst Innovat, Taipei, Taiwan
[3] Chang Gung Univ, Dept Informat Management, Taoyuan, Taiwan
[4] Ming Chi Univ Technol, Dept Ind Engn & Management, Taipei, Taiwan
[5] Keelung Chang Gung Mem Hosp, Dept Emergency Med, Keelung City, Taiwan
[6] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
关键词
Team orienteering problem with time windows; Set orienteering problem; Simulated annealing; LOCAL SEARCH; HEURISTICS; ALGORITHMS;
D O I
10.1016/j.eswa.2023.121996
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research investigates the Set Team Orienteering Problem with Time Windows (STOPTW), a new variant of the well-known Team Orienteering Problem with Time Windows and Set Orienteering Problem. In the STOPTW, customers are grouped into clusters. Each cluster is associated with a profit attainable when a customer in the cluster is visited within the customer's time window. A Mixed Integer Linear Programming model is formulated for STOPTW to maximizing total profit while adhering to time window constraints. Since STOPTW is an NP-hard problem, a Simulated Annealing with Reinforcement Learning (SARL) algorithm is developed. The proposed SARL incorporates the core concepts of reinforcement learning, utilizing the epsilon-greedy algorithm to learn the fitness values resulting from neighborhood moves. Numerical experiments are conducted to assess the performance of SARL, comparing the results with those obtained by CPLEX and Simulated Annealing (SA). For small instances, both SARL and SA algorithms outperform CPLEX by obtaining eight optimal solutions and 12 better solutions. For large instances, both algorithms obtain better solutions to 28 out of 29 instances within shorter computational times compared to CPLEX. Overall, SARL outperforms SA by resulting in lower gap percentages within the same computational times. Specifically, SARL outperforms SA in solving 13 large STOPTW benchmark instances. Finally, a sensitivity analysis is conducted to derive managerial insights.
引用
收藏
页数:15
相关论文
共 45 条
  • [31] Green Fuzzy Tourist Trip Design Problem
    Ruiz-Meza, Jose
    Brito, Julio
    Montoya-Torres, Jairo R.
    Castro-Vergara, Ana
    [J]. ADVANCES IN OPERATIONS RESEARCH, 2022, 2022
  • [32] Novel hybrid algorithm for Team Orienteering Problem with Time Windows for rescue applications
    Saeedvand, Saeed
    Aghdasi, Hadi S.
    Baltes, Jacky
    [J]. APPLIED SOFT COMPUTING, 2020, 96
  • [33] Adaptive neighborhood simulated annealing for sustainability-oriented single machine scheduling with deterioration effect
    Salama, Mohamed
    Srinivas, Sharan
    [J]. APPLIED SOFT COMPUTING, 2021, 110
  • [34] Truck scheduling in a multi-door cross-docking center with partial unloading - Reinforcement learning-based simulated annealing approaches
    Shahmardan, Amin
    Sajadieh, Mohsen S.
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 139
  • [35] ALGORITHMS FOR THE VEHICLE-ROUTING AND SCHEDULING PROBLEMS WITH TIME WINDOW CONSTRAINTS
    SOLOMON, MM
    [J]. OPERATIONS RESEARCH, 1987, 35 (02) : 254 - 265
  • [36] The Multiconstraint Team Orienteering Problem with Multiple Time Windows
    Souffriau, Wouter
    Vansteenwegen, Pieter
    Vanden Berghe, Greet
    Van Oudheusden, Dirk
    [J]. TRANSPORTATION SCIENCE, 2013, 47 (01) : 53 - 63
  • [37] A TABU search heuristic for the team orienteering problem
    Tang, H
    Miller-Hooks, E
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2005, 32 (06) : 1379 - 1407
  • [38] Sustainable fuzzy multi-trip location-routing problem for medical waste management during the COVID-19 outbreak
    Tirkolaee, Erfan Babaee
    Abbasian, Parvin
    Weber, Gerhard-Wilhelm
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 756
  • [39] Developing an applied algorithm for multi-trip vehicle routing problem with time windows in urban waste collection: A case study
    Tirkolaee, Erfan Babaee
    Abbasian, Parvin
    Soltani, Mehdi
    Ghaffarian, Seyed Ali
    [J]. WASTE MANAGEMENT & RESEARCH, 2019, 37 (1_suppl) : 4 - 13
  • [40] TSILIGIRIDES T, 1984, J OPER RES SOC, V35, P797, DOI 10.2307/2582629