Federated Discrete Reinforcement Learning for Automatic Guided Vehicle Control

被引:2
作者
Sierra-Garcia, J. Enrique [1 ]
Santos, Matilde [2 ]
机构
[1] Univ Burgos, Electromech Engn Dept, Burgos 09006, Spain
[2] Univ Complutense Madrid, Inst Knowledge Technol, Madrid 28040, Spain
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2024年 / 150卷
关键词
Automated guided vehicle (AGV); Federated learning; Industry; 4.0; Intelligent control; Path following; Reinforcement learning; AGV;
D O I
10.1016/j.future.2023.08.021
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Under the federated learning paradigm, the agents learn in parallel and combine their knowledge to build a global knowledge model. This new machine learning strategy increases privacy and reduces communication costs, some benefits that can be very useful for industry applications deployed in the edge. Automatic Guided Vehicles (AGVs) can take advantage of this approach since they can be considered intelligent agents, operate in fleets, and are normally managed by a central system that can run in the edge and handles the knowledge of each of them to obtain a global emerging behavioral model. Furthermore, this idea can be combined with the concept of reinforcement learning (RL). This way, the AGVs can interact with the system to learn according to the policy implemented by the RL algorithm in order to follow specified routes, and send their findings to the main system. The centralized system collects this information in a group policy to turn it over to the AGVs. In this work, a novel Federated Discrete Reinforcement Learning (FDRL) approach is implemented to control the trajectories of a fleet of AGVs. Each industrial AGV runs the modules that correspond to an RL system: a state estimator, a rewards calculator, an action selector, and a policy update algorithm. AGVs share their policy variation with the federated server, which combines them into a group policy with a learning aggregation function. To validate the proposal, simulation results of the FDRL control for five hybrid tricycle-differential AGVs and four different trajectories (ellipse, lemniscate, octagon, and a closed 16-polyline) have been obtained and compared with a Proportional Integral Derivative (PID) controller optimized with genetic algorithms. The intelligent control approach shows an average improvement of 78% in mean absolute error, 75% in root mean square error, and 73% in terms of standard deviation. It has been shown that this approach also accelerates the learning up to a 50 % depending on the trajectory, with an average of 36% speed up while allowing precise tracking. The suggested federated-learning based technique outperforms an optimized fuzzy logic controller (FLC) for all of the measured trajectories as well. In addition, different learning aggregation functions have been proposed and evaluated. The influence of the number of vehicles (from 2 to 10) on the path following performance and on network transmission has been analyzed too.& COPY; 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:78 / 89
页数:12
相关论文
共 29 条
  • [1] Bonawitz K., 2019, P MACH LEARN SYST, V1, P374
  • [2] Design and Implementation of Deep Neural Network-Based Control for Automatic Parking Maneuver Process
    Chai, Runqi
    Tsourdos, Antonios
    Savvaris, Al
    Chai, Senchun
    Xia, Yuanqing
    Chen, C. L. Philip
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (04) : 1400 - 1413
  • [3] Wind turbine pitch reinforcement learning control improved by PID regulator and learning observer
    Enrique Sierra-Garcia, J.
    Santos, Matilde
    Pandit, Ravi
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 111
  • [4] Multi-AGV transport of a load: state of art and centralized proposal
    Espinosa, F.
    Santos, C.
    Sierra-Garcia, J. E.
    [J]. REVISTA IBEROAMERICANA DE AUTOMATICA E INFORMATICA INDUSTRIAL, 2021, 18 (01): : 82 - 91
  • [5] Ghasempour A., 2023, 2023 IEEE KANS POW E
  • [6] Federated Deep Reinforcement Learning for Task Scheduling in Heterogeneous Autonomous Robotic System
    Ho, Tai Manh
    Nguyen, Kim-Khoa
    Cheriet, Mohamed
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (01) : 528 - 540
  • [7] Self-Adaptive Traffic Control Model With Behavior Trees and Reinforcement Learning for AGV in Industry 4.0
    Hu, Hao
    Jia, Xiaoliang
    Liu, Kuo
    Sun, Bingyang
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (12) : 7968 - 7979
  • [8] Deep reinforcement learning based AGVs real-time scheduling with mixed rule for flexible shop floor in industry 4.0
    Hu, Hao
    Jia, Xiaoliang
    He, Qixuan
    Fu, Shifeng
    Liu, Kuo
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 149
  • [9] Prediction-Based Energy Saving Mechanism in 3GPP NB-IoT Networks
    Lee, Jinseong
    Lee, Jaiyong
    [J]. SENSORS, 2017, 17 (09):
  • [10] A loosely-coupled deep reinforcement learning approach for order acceptance decision of mass-individualized printed circuit board manufacturing in industry 4.0
    Leng, Jiewu
    Ruan, Guolei
    Song, Yuan
    Liu, Qiang
    Fu, Yingbin
    Ding, Kai
    Chen, Xin
    [J]. JOURNAL OF CLEANER PRODUCTION, 2021, 280