Smooth reinforcement learning-based trajectory tracking for articulated vehicles

被引：0

作者：

Chen, Liangfa ^{[1
]}

Song, Xujie ^{[2
]}

Xiao, Liming ^{[1
]}

Gao, Lulu ^{[1
]}

Zhang, Fawang ^{[3
]}

Li, Shengbo ^{[2
]}

Ma, Fei ^{[1
]}

Duan, Jingliang ^{[1
]}

机构：

[1] School of Mechanical Engineering, University of Science and Technology Beijing, Beijing

[2] School of Vehicle and Mobility, Tsinghua University, Beijing

[3] School of Mechanical Engineering, Beijing Institute of Technology, Beijing

来源：

Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology | 2024年 / 56卷 / 12期

关键词：

action smoothing; articulated vehicle; automatic drive; reinforcement learning; trajectory tracking;

D O I：

10.11918/202310026

中图分类号：

学科分类号：

摘要：

This research tackles the challenge of action fluctuation in articulated vehicle trajectory tracking control, aiming to enhance both accuracy and smoothness. It introduces a novel approach: a smooth tracking control methodology grounded in reinforcement learning (RL). Firstly, to improve the control accuracy, we incorporate trajectory preview information as input to both the policy and value networks and establish a predictive policy iteration framework. Then, to ensure control smoothness, we employ the LipsNet network to approximate the policy function, to realize the adaptive restriction of the Lipschitz constant of the policy network. Finally, coupled with distributional RL theory, we formulate an articulated vehicle trajectory tracking control method, named smooth distributional soft actor-critic (SDSAC), focusing on achieving synergistic optimization of both control precision and action smoothness. The simulation results demonstrate that the proposed method can maintain good action smoothing ability under six different noise levels, and has strong noise robustness and high tracking accuracy. Compared with traditional value distribution reinforcement learning distributional soft actor-critic (DSAC), SDSAC improves action smoothness by more than 5. 8 times under high noise conditions. In addition, compared with model predictive control, SDSAC’ s average single-step solution speed is improved by about 60 times, and it has higher online computing efficiency. © 2024 Harbin Institute of Technology. All rights reserved.

引用

页码：116 / 123

页数：7

共 29 条

[1]

YU Xiangjun, HUAI Yuanhui, YAO Zongwei, Et al., Key technologies in autonomousvehicle for engineering, Journal of Jilin University (Engineering and Technology Edition), 51, 4, (2021)

[2]

TAN Senqi, ZHAO Xinxin, YANG Jue, Et al., A path tracking algorithm for articulated vehicle: development and simulations, 2017 IEEE Transportation Electrification Conference and Expo, Asia-Pacific (ITEC Asia-Pacific), (2017)

[3]

MENG Yu, GAN Xin, WANG Yu, Et al., LQR-GA controller for articulated dump truck path tracking system, Journal of Shanghai Jiaotong University (Science), 24, 1, (2019)

[4]

YAKUB F, MORI Y., Comparative study of autonomous path-following vehicle control via model predictive control and linear quadratic control, Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering, 229, 12, (2015)

[5]

TIAN Haiyong, SHEN Yanhua, ZHANG Wenming, Et al., Slip ratio control for articulated dump truck based on fuzzy sliding mode, 2011 International Conference on Consumer Electronics, Communications and Networks (CECNet), (2011)

[6]

BAI Guoxing, LIU Li, MENG Yu, Et al., Path tracking of mining vehicles based on nonlinear model predictive control [ J], Applied Sciences, 9, 7, (2019)

[7]

LI Sixu, XU Biao, HU Manjiang, Et al., A dynamic model predictive control approach for multipoint preview path tracking of articulated vehicles, Automotive Engineering, 43, 8, (2021)

[8]

LIN Fen, WANG Shaobo, ZHAO Youqun, Et al., Research on autonomous vehicle path tracking control considering roll stability, Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering, 235, 1, (2021)

[9]

LIU Qinghe, WANG Zewen, ZHAO Lijun, Vehicle tracking optimization based on adaptive LOS guidance and MPC control, Journal of Harbin Institute of Technology, 54, 1, (2022)

[10]

GE Qiang, SARTORETTI G, DUAN Jingliang, Et al., Distributed model predictive control of connected multi-vehicle systems at unsignalized intersections, 2022 IEEE International Conference on Unmanned Systems (ICUS), (2022)

← 1 2 3 →