Reinforcement learning for the traveling salesman problem with refueling

被引:33
作者
Ottoni, Andre L. C. [1 ]
Nepomuceno, Erivelton G. [2 ]
Oliveira, Marcos S. de [3 ]
Oliveira, Daniela C. R. de [3 ]
机构
[1] Fed Univ Reconcavo Bahia UFRB, Technol & Exact Ctr, Cruz Das Almas, Brazil
[2] Fed Univ Sao Joao Del Rei UFSJ, Dept Elect Engn, Control & Modelling Grp GCOM, Sao Joao Del Rei, Brazil
[3] Fed Univ Sao Joao Del Rei UFSJ, Dept Math & Stat, Sao Joao Del Rei, Brazil
关键词
Reinforcement learning; Traveling salesman with refueling problem; Tuning of parameters; DECISION-SUPPORT-SYSTEM; MOBILE ROBOT; GENETIC ALGORITHM; VEHICLE; OPTIMIZATION; SOLVE; TIME; CARRIERS;
D O I
10.1007/s40747-021-00444-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The traveling salesman problem (TSP) is one of the best-known combinatorial optimization problems. Many methods derived from TSP have been applied to study autonomous vehicle route planning with fuel constraints. Nevertheless, less attention has been paid to reinforcement learning (RL) as a potential method to solve refueling problems. This paper employs RL to solve the traveling salesman problem With refueling (TSPWR). The technique proposes a model (actions, states, reinforcements) and RL-TSPWR algorithm. Focus is given on the analysis of RL parameters and on the refueling influence in route learning optimization of fuel cost. Two RL algorithms: Q-learning and SARSA are compared. In addition, RL parameter estimation is performed by Response Surface Methodology, Analysis of Variance and Tukey Test. The proposed method achieves the best solution in 15 out of 16 case studies.
引用
收藏
页码:2001 / 2015
页数:15
相关论文
共 82 条
[1]   A hybrid algorithm using a genetic algorithm and multiagent reinforcement learning heuristic to solve the traveling salesman problem [J].
Alipour, Mir Mohammad ;
Razavi, Seyed Naser ;
Derakhshi, Mohammad Reza Feizi ;
Balafar, Mohammad Ali .
NEURAL COMPUTING & APPLICATIONS, 2018, 30 (09) :2935-2951
[2]   A new multiagent reinforcement learning algorithm to solve the symmetric traveling salesman problem [J].
Alipour, Mir Mohammad ;
Razavi, Seyed Naser .
MULTIAGENT AND GRID SYSTEMS, 2015, 11 (02) :107-119
[3]  
[Anonymous], 2014, ARTIF INTELL APPL
[4]  
Applegate David L., 2011, TRAVELING SALESMAN P
[5]   Integrating estimation of distribution algorithms versus Q-learning into Meta-RaPS for solving the 0-1 multidimensional knapsack problem [J].
Arin, Arif ;
Rabadi, Ghaith .
COMPUTERS & INDUSTRIAL ENGINEERING, 2017, 112 :706-720
[6]   Heuristically Accelerated Reinforcement Learning by Means of Case-Based Reasoning and Transfer Learning [J].
Bianchi, Reinaldo A. C. ;
Santos, Paulo E. ;
da Silva, Isaac J. ;
Celiberto, Luiz A., Jr. ;
de Mantaras, Ramon Lopez .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2018, 91 (02) :301-312
[7]  
Bianchi ReinaldoA.C., 2009, 1st International Workshop on Hybrid Control of Autonomous System, P49
[8]  
BODIN L, 1983, COMPUT OPER RES, V10, P63, DOI 10.1016/0305-0548(83)90030-8
[9]   Evaluation of the size of time windows for the travelling salesman problem in delivery operations [J].
Budak, Gercek ;
Chen, Xin .
COMPLEX & INTELLIGENT SYSTEMS, 2020, 6 (03) :681-695
[10]   Learning Navigation Behaviors End-to-End With AutoRL [J].
Chiang, Hao-Tien Lewis ;
Faust, Aleksandra ;
Fiser, Marek ;
Francis, Anthony .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) :2007-2014