Self-Adaptive Traffic Control Model With Behavior Trees and Reinforcement Learning for AGV in Industry 4.0

被引:34
作者
Hu, Hao [1 ]
Jia, Xiaoliang [1 ]
Liu, Kuo [1 ]
Sun, Bingyang [1 ]
机构
[1] Northwestern Polytech Univ, Sch Mech Engn, Dept Mech Engn & Automat, Xian 710072, Peoples R China
基金
美国国家科学基金会;
关键词
Traffic control; Industries; Production; Adaptation models; Decision making; Task analysis; Robots; Automated guided vehicle (AGV); behavior trees (BTs); Industrial; 4; 0; reinforcement learning (RL); self-adaptive control; SYSTEM; DESIGN; GAME; AI;
D O I
10.1109/TII.2021.3059676
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automated guided vehicles (AGVs) are considered as an enabling technology to realize smart manufacturing in the upcoming Industrial 4.0 era. However, several challenges including efficiency, timeliness, and safety still exist in AGVs system in discrete manufacturing shopfloor. To address these challenges, a self-adaptive traffic control model combining behavior trees (BTs) and reinforcement learning (RL) is proposed to implement optimal decisions according to diverse, dynamic and complex situations in Industry 4.0 environments. A cyber-physical systems using multiagent system technology is designed in which components such as AGVs and traffic commander are defined as specific agent that cooperates autonomously with each other. Then, the behavior construction model is constructed by BTs to enumerate all the possible states in AGVs traffic control. An RL model is further developed based on the BTs. By using this approach, in this article, AGVs have the ability to adaptively choose the optimal rule-based strategy from existing optional strategies. The case study of the scenario avoiding collisions at intersections illustrates that the proposed model can enhance self-adaptive capability of AGVs traffic control and simultaneously guarantees efficiency, timeliness, and safety.
引用
收藏
页码:7968 / 7979
页数:12
相关论文
共 40 条
  • [1] Robot soccer control using behaviour trees and fuzzy logic
    Abiyev, Rahib H.
    Gunsel, Irfan
    Akkaya, Nurullah
    Aytac, Ersin
    Cagman, Ahmet
    Abizada, Sanan
    [J]. 12TH INTERNATIONAL CONFERENCE ON APPLICATION OF FUZZY SYSTEMS AND SOFT COMPUTING, ICAFS 2016, 2016, 102 : 477 - 484
  • [2] Deep learning
    LeCun, Yann
    Bengio, Yoshua
    Hinton, Geoffrey
    [J]. NATURE, 2015, 521 (7553) : 436 - 444
  • [3] Decentralized autonomous AGV system for material handling
    Berman, S
    Edan, Y
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2002, 40 (15) : 3995 - 4006
  • [4] Evolutionary Dynamics of Multi-Agent Learning: A Survey
    Bloembergen, Daan
    Tuyls, Karl
    Hennes, Daniel
    Kaisers, Michael
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2015, 53 : 659 - 697
  • [5] Hierarchical Bayesian Inverse Reinforcement Learning
    Choi, Jaedeug
    Kim, Kee-Eung
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (04) : 793 - 805
  • [6] Decentralized Motion Planning and Scheduling of AGVs in an FMS
    Demesure, Guillaume
    Defoort, Michael
    Bekrar, Abdelghani
    Trentesaux, Damien
    Djemai, Mohamed
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (04) : 1744 - 1752
  • [7] The dynamic window approach to collision avoidance
    Fox, D
    Burgard, W
    Thrun, S
    [J]. IEEE ROBOTICS & AUTOMATION MAGAZINE, 1997, 4 (01) : 23 - 33
  • [8] A dynamic-zone strategy for vehicle-collision prevention and load balancing in an AGV system with a single-loop guide path
    Ho, YC
    [J]. COMPUTERS IN INDUSTRY, 2000, 42 (2-3) : 159 - 176
  • [9] Motion Segmentation and Balancing for a Biped Robot's Imitation Learning
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Chen, Yu-Jen
    Shi, Haobin
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (03) : 1099 - 1108
  • [10] Data-Driven Flotation Industrial Process Operational Optimal Control Based on Reinforcement Learning
    Jiang, Yi
    Fan, Jialu
    Chai, Tianyou
    Li, Jinna
    Lewis, Frank L.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (05) : 1974 - 1989