A general assembly sequence planning algorithm based on hierarchical reinforcement learning

被引:0
|
作者
Zhao M.-H. [1 ,2 ]
Zhang X.-B. [1 ,2 ]
Guo X. [1 ,2 ]
Ou Y.-S. [3 ]
机构
[1] Institute of Robotics and Automatic Information System, Nankai University, Tianjin
[2] Key Laboratory of Intelligent Robotics, Tianjin
[3] Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen
来源
Kongzhi yu Juece/Control and Decision | 2022年 / 37卷 / 04期
关键词
Assembly sequence planning; Deep reinforcement learning; Hierarchical reinforcement learning; Intelligent assembly; Multi-configuration; Target-oriented;
D O I
10.13195/j.kzyjc.2020.1289
中图分类号
学科分类号
摘要
For assembly sequence planning problems, most of the existing algorithms focus on a single target configuration. For multi-target configurations and large-scale problems, existing algorithms often have dimension disaster problems with poor generalization ability. Therefore, this paper uses the characteristics of the hierarchical structure of assembly sequence planning problems and conducts a general assembly sequence planning method based on hierarchical reinforcement learning, which is suitable for multi-configuration assembly tasks. First of all, this paper constructs the assembly sequence planning problem as a hierarchical Markov decision process, in which the upper layer performs sequence planning, and the lower layer carries out workpiece motion planning, which conforms to the hierarchical structure of the assembly process, making the planning method more flexible and interpretable. Then, in view of the hierarchical Markov decision process, this paper proposes a general assembly sequence planning algorithm based on hierarchical reinforcement learning, which improves the adaptability and generalization ability of the planning method to multi-target tasks and the information utilization of the target configuration. Finally, the proposed method is verified on the built simulation platform. The results show that the proposed method can extract general information about assembly problems, and it has good decision-making ability for any initial state and other various configurations assembly tasks, which verifies the effectiveness and flexibility of the method. Thus, a more general and flexible assembly sequence planning algorithm suitable for multiple configurations is realized. Copyright ©2022 Control and Decision.
引用
收藏
页码:861 / 870
页数:9
相关论文
共 27 条
  • [1] Wang X W., Interpretation of made in China 2025, pp. 5-10, (2015)
  • [2] Liu W, Liu F, Ni Y Y, Et al., Application research of intelligent assembly technology for complicate products, Astronautical Systems Engineering Technology, 2, 1, pp. 33-36, (2018)
  • [3] Wan X Q, Yan H S, Wang Z., Scheduling and self-reconfiguration of an aircraft engine assembly line in knowledgeable manufacturing, Acta Automatica Sinica, 41, 1, pp. 136-146, (2015)
  • [4] Xu L D, Wang C G, Bi Z M, Et al., AutoAssem: An automated assembly planning system for complex products, IEEE Transactions on Industrial Informatics, 8, 3, pp. 669-678, (2012)
  • [5] Jones R E, Wilson R H, Calton T L., On constraints in assembly planning, IEEE Transactions on Robotics and Automation, 14, 6, pp. 849-863, (1998)
  • [6] Homem de Mello L S, Sanderson A C., Representations of mechanical assembly sequences, IEEE Transactions on Robotics and Automation, 7, 2, pp. 211-227, (1991)
  • [7] Homem de Mello L S, Sanderson A C., A correct and complete algorithm for the generation of mechanical assembly sequences, IEEE Transactions on Robotics and Automation, 7, 2, pp. 228-240, (1991)
  • [8] Karjalainen I, Xing Y, Chen G, Et al., Assembly sequence planning of automobile body components based on liaison graph, Assembly Automation, 27, 2, pp. 157-164, (2007)
  • [9] Li R, Fu Y L, Feng H B., Assembly sequence planning based on connecter-structure knowledge, Computer Integrated Manufacturing Systems, 14, 6, pp. 1130-1135, (2008)
  • [10] Kashkoush M, ElMaraghy H., Knowledge-based model for constructing master assembly sequence, Journal of Manufacturing Systems, 34, pp. 43-52, (2015)