Digital Twin-Driven Reinforcement Learning Method for Marine Equipment Vehicles Scheduling Problem

被引：6

作者：

Shen, Xingwang ^{[1
]}

Liu, Shimin ^{[2
]}

Zhou, Bin ^{[3
]}

Wu, Tao ^{[1
]}

Zhang, Qi ^{[1
]}

Bao, Jinsong ^{[1
]}

机构：

[1] Donghua Univ, Coll Mech Engn, Shanghai 201620, Peoples R China

[2] Hong Kong Polytech Univ, Dept Ind & Syst Engn, Hong Kong, Peoples R China

[3] Univ Shanghai Sci & Technol, Sch Mech Engn, Shanghai 200093, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2024年 / 21卷 / 03期

关键词：

Digital twin; Q-learning; vehicle scheduling; marine equipment;

D O I：

10.1109/TASE.2023.3289915

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the traditional marine equipment construction process, the material transportation vehicle scheduling method dominated by manual experience has shown great limitations, which is inefficient, costly, wasteful of human resources, and unable to cope with complex and changing scheduling scenarios. The existing scheduling system cannot realize the information interaction and collaborative integration between the physical world and the virtual world, while the digital twin (DT) technology can effectively solve the problem of real-time information interaction and the reinforcement learning (RL) method can cope with dynamic scenarios. Therefore, this paper proposed a DT-driven RL method to solve the marine equipment vehicle scheduling problem. Given the dynamic nature of transportation tasks, the diversity of transported goods, and the optimization characteristics of transportation requirements, a framework for scheduling transportation vehicle operations based on DT is constructed, and a RL-based vehicle scheduling method in a dynamic task environment is proposed. A Markov decision process (MDP) model of the vehicle scheduling process is established to realize one-to-one mapping between information and physical elements. An improved RL method based on Q-learning is proposed to solve the MDP model, and the value function approximation and convergence enhancement methods are applied to optimize the solving process. Finally, a case study is used for example verification to prove the superiority and effectiveness of the proposed method in this paper. Note to Practitioners-The motivation of this paper is to optimize material transportation vehicle scheduling in dynamic task environments and to improve logistics transportation efficiency. Therefore, a DT-based vehicle scheduling method for marine equipment is proposed. Firstly, a framework of vehicle scheduling based on DT is designed to establish a MDP model of the vehicle scheduling process, and the dynamic task characteristics are described by mathematical methods in the design of the elements of the model. A RL-based vehicle scheduling method is proposed. The value function approximation method and the convergence enhancement method of the algorithm are investigated for the characteristics of continuous dynamic action features leading to huge state space and non-convergence of the algorithm. The algorithm performance is verified and analyzed through data validation of actual cases.

引用

页码：2173 / 2183

页数：11

共 35 条

[11] A digital thread-driven distributed collaboration mechanism between digital twin manufacturing units [J].

Liu, Shimin ;

Lu, Yuqian ;

Shen, Xingwang ;

Bao, Jinsong .

JOURNAL OF MANUFACTURING SYSTEMS, 2023, 68 :145-159

[12] A review of digital twin-driven machining: From digitization to intellectualization [J].

Liu, Shimin ;

Bao, Jinsong ;

Zheng, Pai .

JOURNAL OF MANUFACTURING SYSTEMS, 2023, 67 :361-378

[13] A digital twin-based sim-to-real transfer for deep reinforcement learning-enabled industrial robot grasping [J].

Liu, Yongkui ;

Xu, He ;

Liu, Ding ;

Wang, Lihui .

ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 78

[14] Digital Twin-driven smart manufacturing: Connotation, reference model, applications and research issues [J].

Lu, Yuqian ;

Liu, Chao ;

Wang, Kevin I-Kai ;

Huang, Huiyue ;

Xu, Xun .

ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2020, 61

[15]

Müller-Zhang Z, 2020, IEEE INT C EMERG, P1757, DOI [10.1109/etfa46521.2020.9211946, 10.1109/ETFA46521.2020.9211946]

[16] A Knowledge-Based Two-Population Optimization Algorithm for Distributed Energy-Efficient Parallel Machines Scheduling [J].

Pan, Zixiao ;

Lei, Deming ;

Wang, Ling .

IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) :5051-5063

[17] Digital twin application with horizontal coordination for reinforcement-learning-based production control in a re-entrant job shop [J].

Park, Kyu Tae ;

Jeon, Seung-Woo ;

Noh, Sang Do .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2022, 60 (07) :2151-2167

[18]

Shang Jingu, 2011, Journal of Wuhan University of Technology, V33, P72, DOI 10.3963/j.issn.1671-4431.2011.03.016

[19] Digital twin-based scheduling method for marine equipment material transportation vehicles [J].

Shen, Xingwang ;

Liu, Shimin ;

Zhou, Bin ;

Zheng, Yu ;

Bao, Jinsong .

2022 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2022, :100-105

[20] Intelligent material distribution and optimization in the assembly process of large offshore crane lifting equipment [J].

Shen, Xingwang ;

Liu, Shimin ;

Zhang, Can ;

Bao, Jinsong .

COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 159

← 1 2 3 4 →