An End-to-End Deep Reinforcement Learning Model Based on Proximal Policy Optimization Algorithm for Autonomous Driving of Off-Road Vehicle

被引：0

作者：

Wang, Yiquan ^{[1
,2
]}

Wang, Jingguo ^{[2
]}

Yang, Yu ^{[1
]}

Li, Zhaodong ^{[1
]}

Zhao, Xijun ^{[1
]}

机构：

[1] China North Artificial Intelligence & Innovat Res, Beijing, Peoples R China

[2] Jiuquan Satellite Launch Ctr, Jiuquan, Gansu, Peoples R China

来源：

PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022 | 2023年 / 1010卷

关键词：

Reinforcement Learning; End-to-End; UGV; Wild Environment; GROUND VEHICLE;

D O I：

10.1007/978-981-99-0479-2_248

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Most conventional unmanned vehicle control algorithms require human adjustment of parameters and design of precise rules, thus failing to adapt quickly to multiple situations when facing complex environments in the wild. To address these problems, this paper adopts an end-to-end deep reinforcement learning model based on proximal policy optimization algorithm to control the steering, speed and braking of an unmanned vehicle, allowing it to autonomously learn motion control strategies from perceptionmap in un-known environments. A novel environment simulator which contains variable passable areas and obstacles is also proposed to support agents to achieve target reward. The proposed agent model has been proved to receive the highest reward over SAC and has the ability to overcome the complexity of the wild environment generated by the simulator.

引用

页码：2692 / 2704

页数：13

共 21 条

[1] Chae H, 2017, IEEE INT C INTELL TR
[2] Chen JY, 2019, IEEE INT C INTELL TR, P2765, DOI [10.1109/itsc.2019.8917306, 10.1109/ITSC.2019.8917306]
[3] A reinforcement learning-based approach for modeling and coverage of an unknown field using a team of autonomous ground vehicles
Faryadi, Saba
Mohammadpour Velni, Javad
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (02) : 1069 - 1084
[4] Fujimura K., 2017, TACTICAL DECISION MA
[5] Hoel CJ, 2018, IEEE INT C INTELL TR, P2148, DOI 10.1109/ITSC.2018.8569568
[6] James Queeney, 2021, Advances in Neural Information Processing Systems, V34, P11909
[7] Deep Reinforcement Learning for Safe Local Planning of a Ground Vehicle in Unknown Rough Terrain
Josef, Shirel
Degani, Amir
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6748 - 6755
[8] Kendall A, 2019, IEEE INT CONF ROBOT, P8248, DOI [10.1109/icra.2019.8793742, 10.1109/ICRA.2019.8793742]
[9] LEWIS T, 2021, 2021 WORLD AUT C WAC, P31
[10] Rover-IRL: Inverse Reinforcement Learning With Soft Value Iteration Networks for Planetary Rover Path Planning
Pflueger, Max
Agha, Ali
Sukhatme, Gaurav S.
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02): : 1387 - 1394

← 1 2 3 →