An End-to-End Deep Reinforcement Learning Model Based on Proximal Policy Optimization Algorithm for Autonomous Driving of Off-Road Vehicle

被引:0
作者
Wang, Yiquan [1 ,2 ]
Wang, Jingguo [2 ]
Yang, Yu [1 ]
Li, Zhaodong [1 ]
Zhao, Xijun [1 ]
机构
[1] China North Artificial Intelligence & Innovat Res, Beijing, Peoples R China
[2] Jiuquan Satellite Launch Ctr, Jiuquan, Gansu, Peoples R China
来源
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022 | 2023年 / 1010卷
关键词
Reinforcement Learning; End-to-End; UGV; Wild Environment; GROUND VEHICLE;
D O I
10.1007/978-981-99-0479-2_248
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most conventional unmanned vehicle control algorithms require human adjustment of parameters and design of precise rules, thus failing to adapt quickly to multiple situations when facing complex environments in the wild. To address these problems, this paper adopts an end-to-end deep reinforcement learning model based on proximal policy optimization algorithm to control the steering, speed and braking of an unmanned vehicle, allowing it to autonomously learn motion control strategies from perceptionmap in un-known environments. A novel environment simulator which contains variable passable areas and obstacles is also proposed to support agents to achieve target reward. The proposed agent model has been proved to receive the highest reward over SAC and has the ability to overcome the complexity of the wild environment generated by the simulator.
引用
收藏
页码:2692 / 2704
页数:13
相关论文
共 21 条
  • [1] Chae H, 2017, IEEE INT C INTELL TR
  • [2] Chen JY, 2019, IEEE INT C INTELL TR, P2765, DOI [10.1109/itsc.2019.8917306, 10.1109/ITSC.2019.8917306]
  • [3] A reinforcement learning-based approach for modeling and coverage of an unknown field using a team of autonomous ground vehicles
    Faryadi, Saba
    Mohammadpour Velni, Javad
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (02) : 1069 - 1084
  • [4] Fujimura K., 2017, TACTICAL DECISION MA
  • [5] Hoel CJ, 2018, IEEE INT C INTELL TR, P2148, DOI 10.1109/ITSC.2018.8569568
  • [6] James Queeney, 2021, Advances in Neural Information Processing Systems, V34, P11909
  • [7] Deep Reinforcement Learning for Safe Local Planning of a Ground Vehicle in Unknown Rough Terrain
    Josef, Shirel
    Degani, Amir
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6748 - 6755
  • [8] Kendall A, 2019, IEEE INT CONF ROBOT, P8248, DOI [10.1109/icra.2019.8793742, 10.1109/ICRA.2019.8793742]
  • [9] LEWIS T, 2021, 2021 WORLD AUT C WAC, P31
  • [10] Rover-IRL: Inverse Reinforcement Learning With Soft Value Iteration Networks for Planetary Rover Path Planning
    Pflueger, Max
    Agha, Ali
    Sukhatme, Gaurav S.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02): : 1387 - 1394