A Hyperparameter Adaptive Genetic Algorithm Based on DQN

被引:12
作者
Zeng, Detian [1 ]
Yan, Tianwei [1 ]
Zeng, Zengri [1 ]
Liu, Hao [1 ]
Guan, Peiyuan [2 ]
机构
[1] Hunan Univ Humanities Sci & Technol, Informat Inst, Loudi 417700, Hunan, Peoples R China
[2] Univ Oslo, Dept Informat, POB 1072 Blindern, N-0316 Oslo, Norway
关键词
Deep reinforcement learning; genetic algorithm; path optimization; hyperparameter optimization; VEHICLE-ROUTING PROBLEM; SYSTEM;
D O I
10.1142/S0218126623500627
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The hyperparameters of the metaheuristic algorithm are difficult to determine when solving optimization problems. The existing methods mainly adjust hyperparameters through preset rules or traditional RL. The performance of the above methods is unsatisfactory and the generalization is poor. This work proposes a deep Q-learning network (DQN)-based dynamic setting framework for combinatorial hyperparameters, and applies it to a Genetic algorithm (GA) to improve its performance. By defining the four elements of the environment, state, action and reward required for learning strategy in advance, the parametrized strategy can be trained offline and different DQN models can be studied. Our method was compared with other algorithms and achieved the shortest path on 14 of 15 public TSP instances. Meanwhile, the test results on our simulation TSP validation dataset revealed that Category DQN achieved the best performance. This means the proposed method can effectively solve the problem of combinatorial hyperparameters setting, and bring more solving advantages to the GA.
引用
收藏
页数:24
相关论文
共 43 条
[1]   Aquila Optimizer: A novel meta-heuristic optimization algorithm [J].
Abualigah, Laith ;
Yousri, Dalia ;
Abd Elaziz, Mohamed ;
Ewees, Ahmed A. ;
Al-qaness, Mohammed A. A. ;
Gandomi, Amir H. .
COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 157 (157)
[2]  
Bellemare MG, 2017, PR MACH LEARN RES, V70
[3]  
Bye RT., 2021, Innovations in Computational Intelligence and Computer Vision, P529
[4]   Solving the Flexible Job Shop Scheduling Problem With Makespan Optimization by Using a Hybrid Taguchi-Genetic Algorithm [J].
Chang, Hao-Chin ;
Chen, Yeh-Peng ;
Liu, Tung-Kuan ;
Chou, Jyh-Horng .
IEEE ACCESS, 2015, 3 :1740-1754
[5]  
Chen F., 2005, Proceedings of the International Conference on Computational Intelligence for Modelling, Control and Automation, and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, V1, P1177
[6]   Reinforcement Learning-Based Genetic Algorithm in Optimizing Multidimensional Data Discretization Scheme [J].
Chen, Qiong ;
Huang, Mengxing ;
Xu, Qiannan ;
Wang, Hao ;
Wang, Jinghui .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
[7]   A self-learning genetic algorithm based on reinforcement learning for flexible job-shop scheduling problem [J].
Chen, Ronghua ;
Yang, Bo ;
Li, Shi ;
Wang, Shilong .
COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 149
[8]   Applying ACO to Large Scale TSP Instances [J].
Chitty, Darren M. .
ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, 2018, 650 :104-118
[9]  
Cowen-Rivers A. I, ARXIV
[10]   Learning Heuristics for the TSP by Policy Gradient [J].
Deudon, Michel ;
Cournut, Pierre ;
Lacoste, Alexandre ;
Adulyasak, Yossiri ;
Rousseau, Louis-Martin .
INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, CPAIOR 2018, 2018, 10848 :170-181