Transferable multi-objective factory layout planning using simulation-based deep reinforcement learning

被引:0
作者
Klar, Matthias [1 ,3 ]
Schworm, Philipp [1 ]
Wu, Xiangqian [1 ]
Simon, Peter [1 ]
Glatt, Moritz [1 ]
Ravani, Bahram [2 ]
Aurich, Jan C. [1 ]
机构
[1] RPTU Kaiserslautern, Inst Mfg Technol & Prod Syst, Kaiserslautern, Germany
[2] Univ Calif Davis, Dept Mech & Aerosp Engn, Davis, CA USA
[3] POB 3049, D-67653 Kaiserslautern, Germany
关键词
Facility layout problem; Reinforcement learning; Multi -objective optimization; Discrete event simulation; Material flow; GENETIC ALGORITHM; DESIGN; OPTIMIZATION; SEARCH;
D O I
10.1016/j.jmsy.2024.04.007
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Factory layout planning aims at finding an optimized layout configuration under consideration of varying influences such as the material flow characteristics. Manual layout planning can be characterized as a complex decision-making process due to a large number of possible placement options. Automated planning approaches aim at reducing the manual planning effort by generating optimized layout variants in the early stages of layout planning. Recent developments have introduced deep Reinforcement Learning (RL) based planning approaches that allow to optimize a layout under consideration of a single optimization criterion. However, within layout planning, multiple partially conflicting planning objectives have to be considered. Such multiple objectives are not considered by existing RL-based approaches. This paper addresses this research gap by presenting a novel deep RL-based layout planning approach that allows consideration of multiple objectives for optimization. Furthermore, existing RL-based planning approaches only consider analytically formulated objectives such as the transportation distance. Consequently, dynamic influences in the material flow are neglected which can result in higher operational costs of the future factory. To address this issue, a discrete event simulation module is developed that allows simulating manufacturing and material flow processes simultaneously for any layout configuration generated by the RL approach. Consequently, the presented approach considers material flow simulation results for multi-objective optimization. To investigate the capabilities of RL-based factory layout planning, different RL architectures are compared based on a simplified application scenario. Throughput time, media supply, and material flow clarity are considered as optimization objectives. The best performing architecture is then applied to an exemplary application scenario and compared with the results obtained by a combined version of the genetic algorithm and tabu search, the non-dominated sorting genetic algorithm, and the optimal solution. Finally, two industrial planning scenarios, one focusing on brownfield and one on greenfield planning, are considered. The results show that the performance of RL compared to meta-heuristics depends on the considered computation time. With time the results generated by the RL approach exceed the quality of the best conventional solution by up to 11%. Finally, the potential of applying transfer learning is investigated for three different application scenarios. It is observed that RL can learn generalized patterns for factory layout planning, which allows to significantly reduce the required training time and can lead to improved solution quality. Thus, the use of pre-trained RL models shows a substantial performance potential for automated factory layout planning which cannot be achieved with conventional automated planning approaches.
引用
收藏
页码:487 / 511
页数:25
相关论文
共 80 条
  • [21] Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
  • [22] Grundig C.-G, 2021, Fabrikplanung: Planungssystematik. Methoden. Anwendungen
  • [23] Multi-objective particle swarm optimization for multi-workshop facility layout problem
    Guan, Chao
    Zhang, Zeqiang
    Liu, Silu
    Gong, Juhua
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2019, 53 : 32 - 48
  • [24] A practical guide to multi-objective reinforcement learning and planning
    Hayes, Conor F.
    Radulescu, Roxana
    Bargiacchi, Eugenio
    Kallstrom, Johan
    Macfarlane, Matthew
    Reymond, Mathieu
    Verstraeten, Timothy
    Zintgraf, Luisa M.
    Dazeley, Richard
    Heintz, Fredrik
    Howley, Enda
    Irissappane, Athirai A.
    Mannion, Patrick
    Nowe, Ann
    Ramos, Gabriel
    Restelli, Marcello
    Vamplew, Peter
    Roijers, Diederik M.
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (01)
  • [25] Heinbach B, 2024, Oper Res Forum, V5, DOI [10.1007/s43069-024-00301-3, DOI 10.1007/S43069-024-00301-3]
  • [26] Deep reinforcement learning for layout planning - An MDP-based approach for the facility layout problem
    Heinbach, Benjamin
    Burggraef, Peter
    Wagner, Johannes
    [J]. MANUFACTURING LETTERS, 2023, 38 : 40 - 43
  • [27] Hessel M., 2017, ARXIV
  • [28] GENETIC ALGORITHMS
    HOLLAND, JH
    [J]. SCIENTIFIC AMERICAN, 1992, 267 (01) : 66 - 72
  • [29] Classification of facility layout problems: a review study
    Hosseini-Nasab, Hasan
    Fereidouni, Sepideh
    Ghomi, Seyyed Mohammad Taghi Fatemi
    Fakhrzad, Mohammad Bagher
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2018, 94 (1-4) : 957 - 977
  • [30] Anti-conflict AGV path planning in automated container terminals based on multi-agent reinforcement learning
    Hu, Hongtao
    Yang, Xurui
    Xiao, Shichang
    Wang, Feiyang
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2023, 61 (01) : 65 - 80