Anti-conflict AGV path planning in automated container terminals based on multi-agent reinforcement learning

被引:71
作者
Hu, Hongtao [1 ]
Yang, Xurui [2 ]
Xiao, Shichang [1 ]
Wang, Feiyang [1 ]
机构
[1] Shanghai Maritime Univ, Sch Logist Engn, Shanghai, Peoples R China
[2] Shanghai Maritime Univ, Inst Logist Sci & Engn, Shanghai 201306, Peoples R China
基金
中国国家自然科学基金;
关键词
Automated container terminal; AGV path planning; Anti-conflict; reinforcement learning; policy gradient; GUIDED VEHICLES; PREVENTION; SYSTEMS;
D O I
10.1080/00207543.2021.1998695
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
AGV conflict prevention path planning is a key factor to improve transportation cost and operation efficiency of the container terminal. This paper studies the anti-conflict path planning problem of Automated Guided Vehicle (AGV) in the horizontal transportation area of the Automated Container Terminals (ACTs). According to the characteristics of magnetic nail guided AGVs, a node network is constructed. Through the analysis of two conflict situations, namely the opposite conflict situation and same point occupation conflict situation, an integer programming model is established to obtain the shortest path. The Multi-Agent Deep Deterministic Policy Gradient (MADDPG) method is proposed to solve the problem, and the Gumbel-Softmax strategy is applied to discretize the scenario created by the node network. A series of numerical experiments are conducted to verify the effectiveness and the efficiency of the model and the algorithm.
引用
收藏
页码:65 / 80
页数:16
相关论文
共 39 条
  • [1] Path and Speed Optimization for Conflict-Free Pickup and Delivery Under Time Windows
    Adamo, Tommaso
    Bektas, Tolga
    Ghiani, Gianpaolo
    Guerriero, Emanuele
    Manni, Emanuele
    [J]. TRANSPORTATION SCIENCE, 2018, 52 (04) : 739 - 755
  • [2] Space-time routing in dedicated automated vehicle zones
    An, Yunlong
    Li, Meng
    Lin, Xi
    He, Fang
    Yang, Haolin
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 120
  • [3] [Anonymous], 2016, ARXIV160601540
  • [4] A set-covering model for a bidirectional multi-shift full truckload vehicle routing problem
    Bai, Ruibin
    Xue, Ning
    Chen, Jianjun
    Roberts, Gethin Wyn
    [J]. TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2015, 79 : 134 - 148
  • [5] When Targets Strike Back: How Negative Workplace Gossip Triggers Political Acts by Employees
    Cheng, Bao
    Dong, Yun
    Zhang, Zhenduo
    Shaalan, Ahmed
    Guo, Gongxing
    Peng, Yan
    [J]. JOURNAL OF BUSINESS ETHICS, 2022, 175 (02) : 289 - 302
  • [6] Dynamic Self-Optimization of the Antenna Tilt for Best Trade-off Between Coverage and Capacity in Mobile Networks
    Dandanov, Nikolay
    Al-Shatri, Hussein
    Klein, Anja
    Poulkov, Vladimir
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2017, 92 (01) : 251 - 278
  • [7] Suboptimal and conflict-free control of a fleet of AGVs to serve online requests
    Drotos, Marton
    Gyorgyi, Peter
    Horvath, Marko
    Kis, Tamas
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 152
  • [8] Flow-shop path planning for multi-automated guided vehicles in intelligent textile spinning cyber-physical production systems dynamic environment
    Farooq, Basit
    Bao, Jinsong
    Raza, Hanan
    Sun, Yicheng
    Ma, Qingwen
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2021, 59 : 98 - 116
  • [9] Methodologies to Optimize Automated Guided Vehicle Scheduling and Routing Problems: A Review Study
    Fazlollahtabar, Hamed
    Saidi-Mehrabad, Mohammad
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2015, 77 (3-4) : 525 - 545
  • [10] Foerster J, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P122