Path planning via reinforcement learning with closed-loop motion control and field tests

被引：0

作者：

Feher, Arpad ^{[1
]}

Domina, Adam ^{[2
]}

Bardos, Adam ^{[2
]}

Aradi, Szilard ^{[1
]}

Becsi, Tamas ^{[1
]}

机构：

[1] Budapest Univ Technol & Econ, Fac Transportat Engn & Vehicle Engn, Dept Control Transportat & Vehicle Syst, Muegyet Rkp 3, H-1111 Budapest, Hungary

[2] Budapest Univ Technol & Econ, Dept Automot Technol, Fac Transportat Engn & Vehicle Engn, Muegyetem Rkp 3, H-1111 Budapest, Hungary

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 142卷

关键词：

Vehicle dynamics; Advanced driver assistance systems; Machine learning; Reinforcement learning; Model predictive control; ACTIVE STEERING CONTROL; MODEL; SIMULATION; VEHICLES;

D O I：

10.1016/j.engappai.2024.109870

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Performing evasive maneuvers with highly automated vehicles is a challenging task. The algorithm must fulfill safety constraints and complete the task while keeping the car in a controllable state. Furthermore, considering all aspects of vehicle dynamics, the path generation problem is numerically complex. Hence its classical solutions can hardly meet real-time requirements. On the other hand, single reinforcement learning based approaches only could handle this problem as a simple driving task and would not provide feasibility information on the whole task's horizon. Therefore, this paper presents a hierarchical method for obstacle avoidance of an automated vehicle to overcome this issue, where the geometric path generation is provided by a single-step continuous Reinforcement Learning agent, while a model-predictive controller deals with lateral control to perform a double lane change maneuver. As the agent plays the optimization role in this architecture, it is trained in various scenarios to provide the necessary parameters fora geometric path generator in a onestep neural network output. During the training, the controller that follows the track evaluates the feasibility of the generated path whose performance metrics provide feedback to the agent so it can further improve its performance. The framework can train an agent fora given problem with various parameters. Asa use case, it is presented as a static obstacle avoidance maneuver. the proposed framework was tested on an automotive proving ground with the geometric constraints of the ISO-3888-2 test. The results proved its real-time capability and performance compared to human drivers' abilities.

引用

页数：13

共 50 条

[1] Closed-Loop Control of Direct Ink Writing via Reinforcement Learning
Piovarci, Michal
Foshey, Michael
Xu, Jie
Erps, Timmothy
Babaei, Vahid
Didyk, Piotr
Rusinkiewicz, Szymon
Matusik, Wojciech
Bickel, Bernd
ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (04):
[2] Closed-Loop Control of Fluid Resuscitation Using Reinforcement Learning
Estiri, Elham
Mirinejad, Hossein
IEEE ACCESS, 2023, 11 : 140569 - 140581
[3] Hierarchical Evasive Path Planning Using Reinforcement Learning and Model Predictive Control
Feher, Arpad
Aradi, Szilard
Becsi, Tamas
IEEE ACCESS, 2020, 8 : 187470 - 187482
[4] Closed-loop control of bevel-tip needles based on path planning
Huo, Benyan
Zhao, Xingang
Han, Jianda
Xu, Weiliang
ROBOTICA, 2018, 36 (12) : 1857 - 1873
[5] Closed-loop control of anesthesia and mean arterial pressure using reinforcement learning
Padmanabhan, Regina
Meskin, Nader
Haddad, Wassim M.
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 22 : 54 - 64
[6] Model-Based Reinforcement Learning for Closed-Loop Dynamic Control of Soft Robotic Manipulators
Thuruthel, Thomas George
Falotico, Egidio
Renda, Federico
Laschi, Cecilia
IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (01) : 124 - 134
[7] Deep reinforcement learning for closed-loop blood glucose control: two approaches
Di Felice, Francesco
Borri, Alessandro
Di Benedetto, Maria Domenica
IFAC PAPERSONLINE, 2022, 55 (40): : 115 - 120
[8] Adaptive closed-loop maneuver planning for low-thrust spacecraft using reinforcement learning
LaFarge, Nicholas B.
Howell, Kathleen C.
Folta, David C.
ACTA ASTRONAUTICA, 2023, 211 : 142 - 154
[9] An architecture for the closed-loop control of droplet thermocapillary motion
De Marchi, Alberto
Hanczyc, Martin M.
FOURTEENTH EUROPEAN CONFERENCE ON ARTIFICIAL LIFE (ECAL 2017), 2017, : 483 - 489
[10] A statistical learning strategy for closed-loop control of fluid flows
Florimond Guéniat
Lionel Mathelin
M. Yousuff Hussaini
Theoretical and Computational Fluid Dynamics, 2016, 30 : 497 - 510

← 1 2 3 4 5 →