Real-Time Resilient Tracking Control for Autonomous Vehicles Through Triple Iterative Approximate Dynamic Programming

被引：0

作者：

Li, Wenyu ^{[1
,2
]}

Geng, Jiale ^{[1
,2
]}

Cheng, Yunqi ^{[3
]}

Tang, Liye ^{[4
]}

Duan, Jingliang ^{[4
,5
]}

Duan, Feng ^{[1
,2
]}

Li, Shengbo Eben ^{[4
]}

机构：

[1] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China

[2] Nankai Univ, Tianjin Key Lab Intervent Brain Comp Interface & I, Tianjin 300350, Peoples R China

[3] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230026, Anhui, Peoples R China

[4] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China

[5] Univ Sci & Technol Beijing, Sch Mech Engn, Beijing 100083, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2025年 / 26卷 / 01期

关键词：

Vehicle dynamics; Real-time systems; Iterative methods; Trajectory tracking; Safety; Optimal control; Dynamic programming; Trajectory; Convergence; Autonomous vehicles; Approximate dynamic programming; autonomous vehicles; neural network; resilient tracking control; NONLINEAR-SYSTEMS; DESIGN; STABILITY; MPC;

D O I：

10.1109/TITS.2024.3489019

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Enhancing control precision, mitigating external disturbances, and ensuring real-time responsiveness stand as the cornerstone of autonomous vehicle tracking endeavors, each of which intricately interwoven to uphold operational safety. In pursuit of addressing these issues, this paper presents a triple iterative control method inspired by approximate dynamic programming (ADP) tailored for real-time disturbance avoidance. The control framework orchestrates simultaneous iterations of value function, control policy, and disturbance policy, engineered to optimize tracking control amidst external disturbances cast as a zero-sum differential game, tackled adeptly through deep neural networks. Rigorous mathematical proof underpins its triple iteration, coupled with assurances of residual error convergence, solidifying its safety guarantee ability and algorithmic resilience. To validate its effectiveness, both numerical simulations and experiments on a real micro-vehicle platform were conducted. Results underscore the feasibility of this new method, showcasing its energy-saving capability and a four-times acceleration compared to conventional model predictive control (MPC) approaches when confronted with lateral disturbances. Notably, the single-step calculation time of this method on the Raspberry Pi is only 1.44ms, affirming its practical viability and real-world applicability.

引用

页码：1015 / 1028

页数：14

共 35 条

[1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].

Abu-Khalaf, M ;

Lewis, FL .

AUTOMATICA, 2005, 41 (05) :779-791

[2] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].

Al-Tamimi, Asma ;

Lewis, Frank .

2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, :38-+

[3] Safety Envelope for Orthogonal Collocation Methods in Embedded Optimal Control [J].

Allamaa, Jean Pierre ;

Patrinos, Panagiotis ;

Van der Auweraer, Herman ;

Son, Tong Duy .

2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,

[4]

Batkovic I, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P256, DOI [10.23919/ecc.2019.8796099, 10.23919/ECC.2019.8796099]

[5]

Borrelli F., 2005, International Journal of Vehicle Autonomous Systems, V3, P265, DOI 10.1504/IJVAS.2005.008237

[6] Resilient Control Design for Lateral Motion Regulation of Intelligent Vehicle [J].

Chang, Xiao-Heng ;

Liu, Yi ;

Shen, Mouquan .

IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2019, 24 (06) :2488-2497

[7] Relaxed Actor-Critic With Convergence Guarantees for Continuous-Time Optimal Control of Nonlinear Systems [J].

Duan, Jingliang ;

Li, Jie ;

Ge, Qiang ;

Li, Shengbo Eben ;

Bujarbaruah, Monimoy ;

Ma, Fei ;

Zhang, Dezhao .

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (05) :3299-3311

[8] Hierarchical reinforcement learning for self-driving decision-making without reliance on labelled driving data [J].

Duan, Jingliang ;

Eben Li, Shengbo ;

Guan, Yang ;

Sun, Qi ;

Cheng, Bo .

IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (05) :297-305

[9] Collision avoidance maneuver for an autonomous vehicle [J].

Durali, M. ;

Javid, G. Amini ;

Kasaiezadeh, A. .

9TH IEEE INTERNATIONAL WORKSHOP ON ADVANCED MOTION CONTROL, VOLS 1 AND 2, PROCEEDINGS, 2006, :249-+

[10] Accelerated convergence of time-splitting algorithm by relaxation method [J].

Gao, Jiaxin ;

Li, Shengbo Eben ;

Ma, Fei ;

Li, Wenyu ;

Sun, Hao ;

Maihemuti, Maierdanjiang ;

Jin, Chun .

IET CONTROL THEORY AND APPLICATIONS, 2022, 16 (08) :776-788

← 1 2 3 4 →