Time optimal trajectory planning of excavator based on deep reinforcement learning

被引:0
|
作者
Zhang Y.-Y. [1 ]
Sun Z.-Y. [1 ]
Sun Q.-L. [1 ]
Wang Y. [1 ]
机构
[1] Department of Electronics and Information Engineering, Taiyuan University of Science and Technology, Taiyuan
来源
Kongzhi yu Juece/Control and Decision | 2024年 / 39卷 / 05期
关键词
autonomous operation; excavator; intelligent decision-making; multi-agent; PPO algorithm; trajectory planning;
D O I
10.13195/j.kzyjc.2022.0811
中图分类号
学科分类号
摘要
Aiming at the autonomous operation scenarios of excavators, a time optimal trajectory planning method based on reinforcement learning is proposed. This method builds a simulation environment to generate data. The angle and velocity of the boom, arm and bucket joints are used as state observation variables, and the angle acceleration of each joint is used as action information, and the simulation environment and autonomous learning are realized through the state observation information. The interaction of the algorithm is designed to train the policy network parameters using whether the joint motion of the boom, arm and bucket exceeds the allowable range, the total time to complete the task and the relative distance of the target as the reward function to train the policy network parameters. Finally, using the improved proximal policy optimization (PPO) realizes the time optimal trajectory planning of the excavator. At the same time, compared with the results of the different reinforcement learning algorithms with continuous action spaces, the experimental results show that the proposed optimization algorithm has higher efficiency, faster convergence speed, and smoother operation trajectory, which can effectively avoid the large impact on each joint and contribute to the efficient and stable operation of the excavator. © 2024 Northeast University. All rights reserved.
引用
收藏
页码:1433 / 1440
页数:7
相关论文
共 26 条
  • [1] Dadhich S, Bodin U, Andersson U., Key challenges in automation of earth-moving machines, Automation in Construction, 68, pp. 212-222, (2016)
  • [2] Jung T, Raduenz H, Krus P, Et al., Boom energy recuperation system and control strategy for hydraulic hybrid excavators, Automation in Construction, 135, (2022)
  • [3] Vahdatikhaki F, Langroodi A K, Scholtenhuis L O, Et al., Feedback support system for training of excavator operators, Automation in Construction, 136, (2022)
  • [4] Koivo A J, Thoma M, Kocaoglan E, Et al., Modeling and control of excavator dynamics during digging operation, Journal of Aerospace Engineering, 9, 1, pp. 10-18, (1996)
  • [5] Li Y H, Fan R J, Yang L M, Et al., Research status and development trend of intelligent excavators, Journal of Mechanical Engineering, 56, 13, pp. 165-178, (2020)
  • [6] Ge L, Dong Z X, Li Y H, Et al., Research on digital prototypes of serial hydraulic excavators, Journal of Mechanical Engineering, 55, 14, pp. 186-196, (2019)
  • [7] Assadzadeh A, Arashpour M, Brilakis I, Et al., Vision-based excavator pose estimation using synthetically generated datasets with domain randomization, Automation in Construction, 134, (2022)
  • [8] Yoo S, Park C G, Lim B, Et al., Bandwidth maximizing design for hydraulically actuated excavators, Journal of Vibration and Control, 16, 14, pp. 2109-2130, (2010)
  • [9] Ren Z M., Rsearch on walking trajectory control of tracked hydraulic excavator based on SLAM and IMU fusion positioning, pp. 74-95, (2020)
  • [10] Zhang J., Research on electro-hydraulic servo control strategy for trajectory tracking of robotic excavator, pp. 46-60, (2020)