USV Trajectory Tracking Control Based on Receding Horizon Reinforcement Learning

被引：2

作者：

Wen, Yinghan ^{[1
]}

Chen, Yuepeng ^{[1
]}

Guo, Xuan ^{[2
]}

机构：

[1] Wuhan Univ Technol, Sch Automat, Wuhan 430070, Peoples R China

[2] Wuhan Univ Technol, Sch Informat Engn, Wuhan 430070, Peoples R China

来源：

SENSORS | 2024年 / 24卷 / 09期

关键词：

unmanned surface vehicle; receding horizon reinforcement learning; trajectory tracking; executive-evaluator; LATERAL CONTROL;

D O I：

10.3390/s24092771

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

We present a novel approach for achieving high-precision trajectory tracking control in an unmanned surface vehicle (USV) through utilization of receding horizon reinforcement learning (RHRL). The control architecture for the USV involves a composite of feedforward and feedback components. The feedforward control component is derived directly from the curvature of the reference path and the dynamic model. Feedback control is acquired through application of the RHRL algorithm, effectively addressing the problem of achieving optimal tracking control. The methodology introduced in this paper synergizes with the rolling time domain optimization mechanism, converting the perpetual time domain optimal control predicament into a succession of finite time domain control problems amenable to resolution. In contrast to Lyapunov model predictive control (LMPC) and sliding mode control (SMC), our proposed method employs the RHRL controller, which yields an explicit state feedback control law. This characteristic endows the controller with the dual capabilities of direct offline and online learning deployment. Within each prediction time domain, we employ a time-independent executive-evaluator network structure to glean insights into the optimal value function and control strategy. Furthermore, we substantiate the convergence of the RHRL algorithm in each prediction time domain through rigorous theoretical proof, with concurrent analysis to verify the stability of the closed-loop system. To conclude, USV trajectory control tests are carried out within a simulated environment.

引用

页数：19

共 32 条

[1] Safe Adaptive Deep Reinforcement Learning for Autonomous Driving in Urban Environments. Additional Filter? How and Where? [J].

Alighanbari, Sina ;

Azad, Nasser L. .

IEEE ACCESS, 2021, 9 :141347-141359

[2]

Alim M. F. A., 2021, IOP Conference Series: Earth and Environmental Science, V649, DOI 10.1088/1755-1315/649/1/012058

[3] Model Predictive Control for Vehicle Stabilization at the Limits of Handling [J].

Beal, Craig Earl ;

Gerdes, J. Christian .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2013, 21 (04) :1258-1269

[4] Enhancing the Performance of a Safe Controller Via Supervised Learning for Truck Lateral Control [J].

Chen, Yuxiao ;

Hereid, Ayonga ;

Peng, Huei ;

Grizzle, Jessy .

JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2019, 141 (10)

[5] Adaptive fuzzy tracking control for underactuated surface vessels with unmodeled dynamics and input saturation [J].

Deng, Yingjie ;

Zhang, Xianku ;

Im, Namkyun ;

Zhang, Guoqing ;

Zhang, Qiang .

ISA TRANSACTIONS, 2020, 103 :52-62

[6] Autonomous cooperative formation control of underactuated USVs based on improved MPC in complex ocean environment [J].

Dong, Zaopeng ;

Zhang, Zhengqi ;

Qi, Shijie ;

Zhang, Haisheng ;

Li, Jiakang ;

Liu, Yuanchang .

OCEAN ENGINEERING, 2023, 270

[7] Feedback motion planning of unmanned surface vehicles via random sequential composition [J].

Ege, Emre ;

Ankarali, Mustafa Mert .

TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2019, 41 (12) :3321-3330

[8] Predictive active steering control for autonomous vehicle systems [J].

Falcone, Paolo ;

Borrelli, Francesco ;

Asgari, Jahan ;

Tseng, Hongtei Eric ;

Hrovat, Davor .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2007, 15 (03) :566-580

[9] Finite-time dynamic positioning control design for surface vessels with external disturbances, input saturation and error constraints [J].

Gong, Chenglong ;

Su, Yixin ;

Zhu, Quanxin ;

Zhang, Danhong ;

Hu, Xin .

OCEAN ENGINEERING, 2023, 276

[10] Study on lateral fuzzy control of unmanned vehicles via genetic algorithms [J].

Guo, Jinghua ;

Hu, Ping ;

Li, Linhui ;

Wang, Rongben ;

Zhang, Mingheng ;

Guo, Lie .

Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2012, 48 (06) :76-82

← 1 2 3 4 →