Parallel hyper heuristic algorithm based on reinforcement learning for the corridor allocation problem and parallel row ordering problem

被引:19
作者
Liu, Junqi [1 ,2 ]
Zhang, Zeqiang [1 ,2 ]
Liu, Silu [1 ,2 ]
Zhang, Yu [1 ,2 ]
Wu, Tengfei [1 ,2 ]
机构
[1] Southwest Jiaotong Univ, Sch Mech Engn, Chengdu 610031, Peoples R China
[2] Technol & Equipment Rail Transit Operat & Maintena, Chengdu 610031, Peoples R China
基金
中国国家自然科学基金;
关键词
Corridor allocation problem; Parallel row ordering problem; Hyper heuristics; Reinforcement learning; Combinatorial optimisation; FACILITY LAYOUT PROBLEMS; SEARCH; OPTIMIZATION;
D O I
10.1016/j.aei.2023.101977
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hyper heuristics is a relatively new optimisation algorithm. Numerous studies have reported that hyper heu-ristics are well applied in combinatorial optimisation problems. As a classic combinatorial optimisation problem, the row layout problem has not been publicly reported on applying hyper heuristics to its various sub-problems. To fill this gap, this study proposes a parallel hyper-heuristic approach based on reinforcement learning for corridor allocation problems and parallel row ordering problems. For the proposed algorithm, an outer layer parallel computing framework was constructed based on the encoding of the problem. The simulated annealing, tabu search, and variable neighbourhood algorithms were used in the algorithm as low-level heuristic operations, and Q-learning in reinforcement learning was used as a high-level strategy. A state space containing sequences and fitness values was designed. The algorithm performance was then evaluated for benchmark instances of the corridor allocation problem (37 groups) and parallel row ordering problem (80 groups). The results showed that, in most cases, the proposed algorithm provided a better solution than the best-known solutions in the literature. Finally, the meta-heuristic algorithm applied to three low-level heuristic operations is taken as three independent algorithms and compared with the proposed hyper-heuristic algorithm on four groups of parallel row ordering problem instances. The effectiveness of Q-learning in selection is illustrated by analysing the comparison results of the four algorithms and the number of calls of the three low-level heuristic operations in the proposed method.
引用
收藏
页数:17
相关论文
共 93 条
[1]   An evaluation of Monte Carlo-based hyper-heuristic for interaction testing of industrial embedded software applications [J].
Ahmed, Bestoun S. ;
Enoiu, Eduard ;
Afzal, Wasif ;
Zamli, Kamal Z. .
SOFT COMPUTING, 2020, 24 (18) :13929-13954
[2]   Simulated annealing and tabu search approaches for the Corridor Allocation Problem [J].
Ahonen, H. ;
De Alvarenga, A.G. ;
Amaral, A.R.S. .
European Journal of Operational Research, 2014, 232 (01) :221-233
[3]   A parallel ordering problem in facilities layout [J].
Amaral, Andre R. S. .
COMPUTERS & OPERATIONS RESEARCH, 2013, 40 (12) :2930-2939
[4]   The corridor allocation problem [J].
Amaral, Andre R. S. .
COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (12) :3325-3330
[5]  
Anjos MF., 2021, Facility layout: mathematical optimization techniques and engineering applications, DOI [10.1007/978-3-030-70990-7, DOI 10.1007/978-3-030-70990-7]
[6]   Mathematical optimization approach for facility layout on several rows [J].
Anjos, Miguel F. ;
Vieira, Manuel V. C. .
OPTIMIZATION LETTERS, 2021, 15 (01) :9-23
[7]   Mathematical optimization approaches for facility layout problems: The state-of-the-art and future research directions [J].
Anjos, Miguel F. ;
Vieira, Manuel V. C. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 261 (01) :1-16
[8]   A tensor based hyper-heuristic for nurse rostering [J].
Asta, Shahriar ;
Ozcan, Ender ;
Curtois, Tim .
KNOWLEDGE-BASED SYSTEMS, 2016, 98 :185-199
[9]   A hyper-heuristic approach to sequencing by hybridization of DNA sequences [J].
Blazewicz, Jacek ;
Burke, Edmund K. ;
Kendall, Graham ;
Mruczkiewicz, Wojciech ;
Oguz, Ceyda ;
Swiercz, Aleksandra .
ANNALS OF OPERATIONS RESEARCH, 2013, 207 (01) :27-41
[10]   Online monitoring and collaborative scheduling method for wheelset cyber-physical production system: A wheelset manufacturing system case study from a Chinese high-speed train enterprise [J].
Bo, Hongguang ;
Han, Peng ;
Lu, Bo ;
Zhao, Can ;
Wang, Xingmian .
ADVANCED ENGINEERING INFORMATICS, 2021, 47