Path Planning and Tracking Control for Parking via Soft Actor-Critic Under Non-Ideal Scenarios

被引：18

作者：

Tang, Xiaolin ^{[1
]}

Yang, Yuyou ^{[1
]}

Liu, Teng ^{[1
,2
,3
]}

Lin, Xianke ^{[4
]}

Yang, Kai ^{[1
]}

Li, Shen ^{[5
]}

机构：

[1] Chongqing Univ, Coll Mech & Vehicle Engn, Chongqing 400044, Peoples R China

[2] Chongqing Univ Three Gorges Hosp, Three Gorges Hosp, Clin Res Ctr, Wanzhou 404000, Peoples R China

[3] Chongqing Univ, Three Gorges Hosp, Med Pathol Ctr, Wanzhou 404000, Peoples R China

[4] Ontario Tech Univ, Dept Automot & Mechatron Engn, Oshawa, ON L1G 0C5, Canada

[5] Tsinghua Univ, Sch Civil Engn, Beijing 100084, Peoples R China

来源：

IEEE-CAA JOURNAL OF AUTOMATICA SINICA | 2024年 / 11卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Automatic parking; control strategy; parking deviation (APS); soft actor-critic (SAC);

D O I：

10.1109/JAS.2023.123975

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Parking in a small parking lot within limited space poses a difficult task. It often leads to deviations between the final parking posture and the target posture. These deviations can lead to partial occupancy of adjacent parking lots, which poses a safety threat to vehicles parked in these parking lots. However, previous studies have not addressed this issue. In this paper, we aim to evaluate the impact of parking deviation of existing vehicles next to the target parking lot (PDEVNTPL) on the automatic ego vehicle (AEV) parking, in terms of safety, comfort, accuracy, and efficiency of parking. A segmented parking training framework (SPTF) based on soft actor-critic (SAC) is proposed to improve parking performance. In the proposed method, the SAC algorithm incorporates strategy entropy into the objective function, to enable the AEV to learn parking strategies based on a more comprehensive understanding of the environment. Additionally, the SPTF simplifies complex parking tasks to maintain the high performance of deep reinforcement learning (DRL). The experimental results reveal that the PDEVNTPL has a detrimental influence on the AEV parking in terms of safety, accuracy, and comfort, leading to reductions of more than 27%, 54%, and 26% respectively. However, the SAC-based SPTF effectively mitigates this impact, resulting in a considerable increase in the parking success rate from 71% to 93%. Furthermore, the heading angle deviation is significantly reduced from 2.25 degrees to 0.43 degrees.

引用

页码：181 / 195

页数：15

共 44 条

[1] Akanksha Eisha, 2021, Proceedings of 5th International Conference on Computing Methodologies and Communication (ICCMC 2021), P1416, DOI 10.1109/ICCMC51019.2021.9418283
[2] Deep Reinforcement Learning With NMPC Assistance Nash Switching for Urban Autonomous Driving
Alighanbari, Sina
Azad, Nasser L.
[J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (03): : 2604 - 2615
[3] Bernhard J, 2018, IEEE INT C INTELL TR, P3175, DOI 10.1109/ITSC.2018.8569436
[4] Cai M., 2022, P IEEE INT C NETW SE, P1
[5] Trajectory Planning for Automated Parking Systems Using Deep Reinforcement Learning
Du, Zhuo
Miao, Qiheng
Zong, Changfu
[J]. INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2020, 21 (04) : 881 - 887
[6] Duan JL, 2021, Arxiv, DOI arXiv:2109.05540
[7] Haarnoja T, 2017, PR MACH LEARN RES, V70
[8] Haarnoja Tuomas, 2018, INT C MACH LEARN, V80
[9] Investigation on AEB Key Parameters for Improving Car to Two-Wheeler Collision Safety Using In-Depth Traffic Accident Data
Hu, Lin
Li, Haibo
Yi, Ping
Huang, Jing
Lin, Miao
Wang, Hong
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) : 113 - 124
[10] A study on energy distribution strategy of electric vehicle hybrid energy storage system considering driving style based on real urban driving data
Hu, Lin
Tian, Qingtao
Zou, Changfu
Huang, Jing
Ye, Yao
Wu, Xianhui
[J]. RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2022, 162

← 1 2 3 4 5 →