Path Planning and Tracking Control for Parking via Soft Actor-Critic Under Non-Ideal Scenarios

被引:18
作者
Tang, Xiaolin [1 ]
Yang, Yuyou [1 ]
Liu, Teng [1 ,2 ,3 ]
Lin, Xianke [4 ]
Yang, Kai [1 ]
Li, Shen [5 ]
机构
[1] Chongqing Univ, Coll Mech & Vehicle Engn, Chongqing 400044, Peoples R China
[2] Chongqing Univ Three Gorges Hosp, Three Gorges Hosp, Clin Res Ctr, Wanzhou 404000, Peoples R China
[3] Chongqing Univ, Three Gorges Hosp, Med Pathol Ctr, Wanzhou 404000, Peoples R China
[4] Ontario Tech Univ, Dept Automot & Mechatron Engn, Oshawa, ON L1G 0C5, Canada
[5] Tsinghua Univ, Sch Civil Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Automatic parking; control strategy; parking deviation (APS); soft actor-critic (SAC);
D O I
10.1109/JAS.2023.123975
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Parking in a small parking lot within limited space poses a difficult task. It often leads to deviations between the final parking posture and the target posture. These deviations can lead to partial occupancy of adjacent parking lots, which poses a safety threat to vehicles parked in these parking lots. However, previous studies have not addressed this issue. In this paper, we aim to evaluate the impact of parking deviation of existing vehicles next to the target parking lot (PDEVNTPL) on the automatic ego vehicle (AEV) parking, in terms of safety, comfort, accuracy, and efficiency of parking. A segmented parking training framework (SPTF) based on soft actor-critic (SAC) is proposed to improve parking performance. In the proposed method, the SAC algorithm incorporates strategy entropy into the objective function, to enable the AEV to learn parking strategies based on a more comprehensive understanding of the environment. Additionally, the SPTF simplifies complex parking tasks to maintain the high performance of deep reinforcement learning (DRL). The experimental results reveal that the PDEVNTPL has a detrimental influence on the AEV parking in terms of safety, accuracy, and comfort, leading to reductions of more than 27%, 54%, and 26% respectively. However, the SAC-based SPTF effectively mitigates this impact, resulting in a considerable increase in the parking success rate from 71% to 93%. Furthermore, the heading angle deviation is significantly reduced from 2.25 degrees to 0.43 degrees.
引用
收藏
页码:181 / 195
页数:15
相关论文
共 44 条
  • [1] Akanksha Eisha, 2021, Proceedings of 5th International Conference on Computing Methodologies and Communication (ICCMC 2021), P1416, DOI 10.1109/ICCMC51019.2021.9418283
  • [2] Deep Reinforcement Learning With NMPC Assistance Nash Switching for Urban Autonomous Driving
    Alighanbari, Sina
    Azad, Nasser L.
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (03): : 2604 - 2615
  • [3] Bernhard J, 2018, IEEE INT C INTELL TR, P3175, DOI 10.1109/ITSC.2018.8569436
  • [4] Cai M., 2022, P IEEE INT C NETW SE, P1
  • [5] Trajectory Planning for Automated Parking Systems Using Deep Reinforcement Learning
    Du, Zhuo
    Miao, Qiheng
    Zong, Changfu
    [J]. INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2020, 21 (04) : 881 - 887
  • [6] Duan JL, 2021, Arxiv, DOI arXiv:2109.05540
  • [7] Haarnoja T, 2017, PR MACH LEARN RES, V70
  • [8] Haarnoja Tuomas, 2018, INT C MACH LEARN, V80
  • [9] Investigation on AEB Key Parameters for Improving Car to Two-Wheeler Collision Safety Using In-Depth Traffic Accident Data
    Hu, Lin
    Li, Haibo
    Yi, Ping
    Huang, Jing
    Lin, Miao
    Wang, Hong
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) : 113 - 124
  • [10] A study on energy distribution strategy of electric vehicle hybrid energy storage system considering driving style based on real urban driving data
    Hu, Lin
    Tian, Qingtao
    Zou, Changfu
    Huang, Jing
    Ye, Yao
    Wu, Xianhui
    [J]. RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2022, 162