Anti-conflict AGV path planning in automated container terminals based on multi-agent reinforcement learning

被引:91
作者
Hu, Hongtao [1 ]
Yang, Xurui [2 ]
Xiao, Shichang [1 ]
Wang, Feiyang [1 ]
机构
[1] Shanghai Maritime Univ, Sch Logist Engn, Shanghai, Peoples R China
[2] Shanghai Maritime Univ, Inst Logist Sci & Engn, Shanghai 201306, Peoples R China
基金
中国国家自然科学基金;
关键词
Automated container terminal; AGV path planning; Anti-conflict; reinforcement learning; policy gradient; GUIDED VEHICLES; PREVENTION; SYSTEMS;
D O I
10.1080/00207543.2021.1998695
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
AGV conflict prevention path planning is a key factor to improve transportation cost and operation efficiency of the container terminal. This paper studies the anti-conflict path planning problem of Automated Guided Vehicle (AGV) in the horizontal transportation area of the Automated Container Terminals (ACTs). According to the characteristics of magnetic nail guided AGVs, a node network is constructed. Through the analysis of two conflict situations, namely the opposite conflict situation and same point occupation conflict situation, an integer programming model is established to obtain the shortest path. The Multi-Agent Deep Deterministic Policy Gradient (MADDPG) method is proposed to solve the problem, and the Gumbel-Softmax strategy is applied to discretize the scenario created by the node network. A series of numerical experiments are conducted to verify the effectiveness and the efficiency of the model and the algorithm.
引用
收藏
页码:65 / 80
页数:16
相关论文
共 39 条
[1]   Path and Speed Optimization for Conflict-Free Pickup and Delivery Under Time Windows [J].
Adamo, Tommaso ;
Bektas, Tolga ;
Ghiani, Gianpaolo ;
Guerriero, Emanuele ;
Manni, Emanuele .
TRANSPORTATION SCIENCE, 2018, 52 (04) :739-755
[2]   Space-time routing in dedicated automated vehicle zones [J].
An, Yunlong ;
Li, Meng ;
Lin, Xi ;
He, Fang ;
Yang, Haolin .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 120
[3]   A set-covering model for a bidirectional multi-shift full truckload vehicle routing problem [J].
Bai, Ruibin ;
Xue, Ning ;
Chen, Jianjun ;
Roberts, Gethin Wyn .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2015, 79 :134-148
[4]  
Brockman Greg, 2016, ARXIV160601540
[5]   When Targets Strike Back: How Negative Workplace Gossip Triggers Political Acts by Employees [J].
Cheng, Bao ;
Dong, Yun ;
Zhang, Zhenduo ;
Shaalan, Ahmed ;
Guo, Gongxing ;
Peng, Yan .
JOURNAL OF BUSINESS ETHICS, 2022, 175 (02) :289-302
[6]   Dynamic Self-Optimization of the Antenna Tilt for Best Trade-off Between Coverage and Capacity in Mobile Networks [J].
Dandanov, Nikolay ;
Al-Shatri, Hussein ;
Klein, Anja ;
Poulkov, Vladimir .
WIRELESS PERSONAL COMMUNICATIONS, 2017, 92 (01) :251-278
[7]   Suboptimal and conflict-free control of a fleet of AGVs to serve online requests [J].
Drotos, Marton ;
Gyorgyi, Peter ;
Horvath, Marko ;
Kis, Tamas .
COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 152
[8]   Flow-shop path planning for multi-automated guided vehicles in intelligent textile spinning cyber-physical production systems dynamic environment [J].
Farooq, Basit ;
Bao, Jinsong ;
Raza, Hanan ;
Sun, Yicheng ;
Ma, Qingwen .
JOURNAL OF MANUFACTURING SYSTEMS, 2021, 59 :98-116
[9]   Methodologies to Optimize Automated Guided Vehicle Scheduling and Routing Problems: A Review Study [J].
Fazlollahtabar, Hamed ;
Saidi-Mehrabad, Mohammad .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2015, 77 (3-4) :525-545
[10]  
Foerster J, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P122