Automated guided vehicle dispatching and routing integration via digital twin with deep reinforcement learning

被引：20

作者：

Zhang, Lixiang ^{[1
]}

Yang, Chen ^{[2
]}

Yan, Yan ^{[1
]}

Cai, Ze ^{[1
]}

Hu, Yaoguang ^{[1
]}

机构：

[1] Beijing Inst Technol, Lab Ind & Intelligent Syst Engn, Beijing 100081, Peoples R China

[2] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China

来源：

JOURNAL OF MANUFACTURING SYSTEMS | 2024年 / 72卷

基金：

中国国家自然科学基金;

关键词：

Dispatching; Routing; Digital twin; Reinforcement learning; Automated guided vehicle; INDUSTRY; 4.0; ALGORITHM;

D O I：

10.1016/j.jmsy.2023.12.008

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The manufacturing industry has witnessed a significant shift towards high flexibility and adaptability, driven by personalized demands. However, automated guided vehicle (AGV) dispatching optimization is still challenging when considering AGV routing with the spatial -temporal and kinematics constraints in intelligent production logistics systems, limiting the evolving industry applications. Against this backdrop, this paper presents a digital twin (DT) -enhanced deep reinforcement learning -based optimization framework to integrate AGV dispatching and routing at both horizontal and vertical levels. First, the proposed framework leverages a digital twin model of the shop floor to provide a simulation environment that closely mimics the actual manufacturing process, enabling the AGV dispatching agent to be trained in a realistic setting, thus reducing the risk of finding unrealistic solutions under specific shop-floor settings and preventing time-consuming trial -and -error processes. Then, the AGV dispatching with the routing problem is modeled as a Markov Decision Process to optimize tardiness and energy consumption. An improved dueling double deep Q network algorithm with count -based exploration is developed to learn a better -dispatching policy by interacting with the high-fidelity DT model that integrates a static path planning agent using A* and a dynamic collision avoidance agent using a deep deterministic policy gradient to prevent the congestion and deadlock. Experimental results show that our method outperforms four state-of-the-art methods with shorter tardiness, lower energy consumption, and better stability. The proposed method provides significant potential to utilize the digital twin and reinforcement learning in the decision -making and optimization of manufacturing processes.

引用

页码：492 / 503

页数：12

共 41 条

[11] Digital twin-driven deep reinforcement learning for adaptive task allocation in robotic construction [J].

Lee, Dongmin ;

Lee, SangHyun ;

Masoud, Neda ;

Krishnan, M. S. ;

Li, Victor C. .

ADVANCED ENGINEERING INFORMATICS, 2022, 53

[12] A mechanism for scheduling multi robot intelligent warehouse system face with dynamic demand [J].

Li, Zhi ;

Barenji, Ali Vatankhah ;

Jiang, Jiazhi ;

Zhong, Ray Y. ;

Xu, Gangyan .

JOURNAL OF INTELLIGENT MANUFACTURING, 2020, 31 (02) :469-480

[13] Invasive weed optimization for multi-AGVs dispatching problem in a matrix manufacturing workshop [J].

Li, Zhong-Kai ;

Sang, Hong-Yan ;

Li, Jun-Qing ;

Han, Yu-Yan ;

Gao, Kai-Zhou ;

Zheng, Zhi-Xin ;

Liu, Li-li .

SWARM AND EVOLUTIONARY COMPUTATION, 2023, 77

[14] Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay [J].

Luo, Biao ;

Yang, Yin ;

Liu, Derong .

IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (12) :3337-3348

[15] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[16]

Müller-Zhang Z, 2020, IEEE INT C EMERG, P1757, DOI [10.1109/ETFA46521.2020.9211946, 10.1109/etfa46521.2020.9211946]

[17]

Nazari M, 2018, ADV NEUR IN, V31

[18] Field-synchronized Digital Twin framework for production scheduling with uncertainty [J].

Negri, Elisa ;

Pandhare, Vibhor ;

Cattaneo, Laura ;

Singh, Jaskaran ;

Macchi, Marco ;

Lee, Jay .

JOURNAL OF INTELLIGENT MANUFACTURING, 2021, 32 (04) :1207-1228

[19] Petri Net Decomposition Approach for Dispatching and Conflict-Free Routing of Bidirectional Automated Guided Vehicle Systems [J].

Nishi, Tatsushi ;

Tanaka, Yuki .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2012, 42 (05) :1230-1243

[20] A bilevel decomposition algorithm for simultaneous production scheduling and conflict-free routing for automated guided vehicles [J].

Nishi, Tatsushi ;

Hiranaka, Yuichiro ;

Grossmann, Ignacio E. .

COMPUTERS & OPERATIONS RESEARCH, 2011, 38 (05) :876-888

← 1 2 3 4 5 →