Self-Adaptive Traffic Control Model With Behavior Trees and Reinforcement Learning for AGV in Industry 4.0

被引:38
作者
Hu, Hao [1 ]
Jia, Xiaoliang [1 ]
Liu, Kuo [1 ]
Sun, Bingyang [1 ]
机构
[1] Northwestern Polytech Univ, Sch Mech Engn, Dept Mech Engn & Automat, Xian 710072, Peoples R China
基金
美国国家科学基金会;
关键词
Traffic control; Industries; Production; Adaptation models; Decision making; Task analysis; Robots; Automated guided vehicle (AGV); behavior trees (BTs); Industrial; 4; 0; reinforcement learning (RL); self-adaptive control; SYSTEM; DESIGN; GAME; AI;
D O I
10.1109/TII.2021.3059676
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automated guided vehicles (AGVs) are considered as an enabling technology to realize smart manufacturing in the upcoming Industrial 4.0 era. However, several challenges including efficiency, timeliness, and safety still exist in AGVs system in discrete manufacturing shopfloor. To address these challenges, a self-adaptive traffic control model combining behavior trees (BTs) and reinforcement learning (RL) is proposed to implement optimal decisions according to diverse, dynamic and complex situations in Industry 4.0 environments. A cyber-physical systems using multiagent system technology is designed in which components such as AGVs and traffic commander are defined as specific agent that cooperates autonomously with each other. Then, the behavior construction model is constructed by BTs to enumerate all the possible states in AGVs traffic control. An RL model is further developed based on the BTs. By using this approach, in this article, AGVs have the ability to adaptively choose the optimal rule-based strategy from existing optional strategies. The case study of the scenario avoiding collisions at intersections illustrates that the proposed model can enhance self-adaptive capability of AGVs traffic control and simultaneously guarantees efficiency, timeliness, and safety.
引用
收藏
页码:7968 / 7979
页数:12
相关论文
共 40 条
[1]   Robot soccer control using behaviour trees and fuzzy logic [J].
Abiyev, Rahib H. ;
Gunsel, Irfan ;
Akkaya, Nurullah ;
Aytac, Ersin ;
Cagman, Ahmet ;
Abizada, Sanan .
12TH INTERNATIONAL CONFERENCE ON APPLICATION OF FUZZY SYSTEMS AND SOFT COMPUTING, ICAFS 2016, 2016, 102 :477-484
[2]   Decentralized autonomous AGV system for material handling [J].
Berman, S ;
Edan, Y .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2002, 40 (15) :3995-4006
[3]   Evolutionary Dynamics of Multi-Agent Learning: A Survey [J].
Bloembergen, Daan ;
Tuyls, Karl ;
Hennes, Daniel ;
Kaisers, Michael .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2015, 53 :659-697
[4]   Hierarchical Bayesian Inverse Reinforcement Learning [J].
Choi, Jaedeug ;
Kim, Kee-Eung .
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (04) :793-805
[5]   Decentralized Motion Planning and Scheduling of AGVs in an FMS [J].
Demesure, Guillaume ;
Defoort, Michael ;
Bekrar, Abdelghani ;
Trentesaux, Damien ;
Djemai, Mohamed .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (04) :1744-1752
[6]   The dynamic window approach to collision avoidance [J].
Fox, D ;
Burgard, W ;
Thrun, S .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 1997, 4 (01) :23-33
[7]   A dynamic-zone strategy for vehicle-collision prevention and load balancing in an AGV system with a single-loop guide path [J].
Ho, YC .
COMPUTERS IN INDUSTRY, 2000, 42 (2-3) :159-176
[8]   Motion Segmentation and Balancing for a Biped Robot's Imitation Learning [J].
Hwang, Kao-Shing ;
Jiang, Wei-Cheng ;
Chen, Yu-Jen ;
Shi, Haobin .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (03) :1099-1108
[9]   Data-Driven Flotation Industrial Process Operational Optimal Control Based on Reinforcement Learning [J].
Jiang, Yi ;
Fan, Jialu ;
Chai, Tianyou ;
Li, Jinna ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (05) :1974-1989
[10]   Reinforcement learning: A survey [J].
Kaelbling, LP ;
Littman, ML ;
Moore, AW .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285