Automated guided vehicle dispatching and routing integration via digital twin with deep reinforcement learning

被引:20
作者
Zhang, Lixiang [1 ]
Yang, Chen [2 ]
Yan, Yan [1 ]
Cai, Ze [1 ]
Hu, Yaoguang [1 ]
机构
[1] Beijing Inst Technol, Lab Ind & Intelligent Syst Engn, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Dispatching; Routing; Digital twin; Reinforcement learning; Automated guided vehicle; INDUSTRY; 4.0; ALGORITHM;
D O I
10.1016/j.jmsy.2023.12.008
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The manufacturing industry has witnessed a significant shift towards high flexibility and adaptability, driven by personalized demands. However, automated guided vehicle (AGV) dispatching optimization is still challenging when considering AGV routing with the spatial -temporal and kinematics constraints in intelligent production logistics systems, limiting the evolving industry applications. Against this backdrop, this paper presents a digital twin (DT) -enhanced deep reinforcement learning -based optimization framework to integrate AGV dispatching and routing at both horizontal and vertical levels. First, the proposed framework leverages a digital twin model of the shop floor to provide a simulation environment that closely mimics the actual manufacturing process, enabling the AGV dispatching agent to be trained in a realistic setting, thus reducing the risk of finding unrealistic solutions under specific shop-floor settings and preventing time-consuming trial -and -error processes. Then, the AGV dispatching with the routing problem is modeled as a Markov Decision Process to optimize tardiness and energy consumption. An improved dueling double deep Q network algorithm with count -based exploration is developed to learn a better -dispatching policy by interacting with the high-fidelity DT model that integrates a static path planning agent using A* and a dynamic collision avoidance agent using a deep deterministic policy gradient to prevent the congestion and deadlock. Experimental results show that our method outperforms four state-of-the-art methods with shorter tardiness, lower energy consumption, and better stability. The proposed method provides significant potential to utilize the digital twin and reinforcement learning in the decision -making and optimization of manufacturing processes.
引用
收藏
页码:492 / 503
页数:12
相关论文
共 41 条
[11]   Digital twin-driven deep reinforcement learning for adaptive task allocation in robotic construction [J].
Lee, Dongmin ;
Lee, SangHyun ;
Masoud, Neda ;
Krishnan, M. S. ;
Li, Victor C. .
ADVANCED ENGINEERING INFORMATICS, 2022, 53
[12]   A mechanism for scheduling multi robot intelligent warehouse system face with dynamic demand [J].
Li, Zhi ;
Barenji, Ali Vatankhah ;
Jiang, Jiazhi ;
Zhong, Ray Y. ;
Xu, Gangyan .
JOURNAL OF INTELLIGENT MANUFACTURING, 2020, 31 (02) :469-480
[13]   Invasive weed optimization for multi-AGVs dispatching problem in a matrix manufacturing workshop [J].
Li, Zhong-Kai ;
Sang, Hong-Yan ;
Li, Jun-Qing ;
Han, Yu-Yan ;
Gao, Kai-Zhou ;
Zheng, Zhi-Xin ;
Liu, Li-li .
SWARM AND EVOLUTIONARY COMPUTATION, 2023, 77
[14]   Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay [J].
Luo, Biao ;
Yang, Yin ;
Liu, Derong .
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (12) :3337-3348
[15]   Human-level control through deep reinforcement learning [J].
Mnih, Volodymyr ;
Kavukcuoglu, Koray ;
Silver, David ;
Rusu, Andrei A. ;
Veness, Joel ;
Bellemare, Marc G. ;
Graves, Alex ;
Riedmiller, Martin ;
Fidjeland, Andreas K. ;
Ostrovski, Georg ;
Petersen, Stig ;
Beattie, Charles ;
Sadik, Amir ;
Antonoglou, Ioannis ;
King, Helen ;
Kumaran, Dharshan ;
Wierstra, Daan ;
Legg, Shane ;
Hassabis, Demis .
NATURE, 2015, 518 (7540) :529-533
[16]  
Müller-Zhang Z, 2020, IEEE INT C EMERG, P1757, DOI [10.1109/ETFA46521.2020.9211946, 10.1109/etfa46521.2020.9211946]
[17]  
Nazari M, 2018, ADV NEUR IN, V31
[18]   Field-synchronized Digital Twin framework for production scheduling with uncertainty [J].
Negri, Elisa ;
Pandhare, Vibhor ;
Cattaneo, Laura ;
Singh, Jaskaran ;
Macchi, Marco ;
Lee, Jay .
JOURNAL OF INTELLIGENT MANUFACTURING, 2021, 32 (04) :1207-1228
[19]   Petri Net Decomposition Approach for Dispatching and Conflict-Free Routing of Bidirectional Automated Guided Vehicle Systems [J].
Nishi, Tatsushi ;
Tanaka, Yuki .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2012, 42 (05) :1230-1243
[20]   A bilevel decomposition algorithm for simultaneous production scheduling and conflict-free routing for automated guided vehicles [J].
Nishi, Tatsushi ;
Hiranaka, Yuichiro ;
Grossmann, Ignacio E. .
COMPUTERS & OPERATIONS RESEARCH, 2011, 38 (05) :876-888