Multi-ASV Coordinated Tracking With Unknown Dynamics and Input Underactuation via Model-Reference Reinforcement Learning Control

被引:9
作者
Hu, Wenbo [1 ,2 ]
Chen, Fei [1 ,2 ]
Xiang, Linying [1 ,2 ]
Chen, Guanrong [3 ]
机构
[1] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110004, Peoples R China
[2] Northeastern Univ Qinhuangdao, Sch Control Engn, Qinhuangdao 066004, Hebei, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Reinforcement learning; Uncertainty; Vehicle dynamics; Protocols; Adaptation models; Surges; Damping; Coordinated tracking; input underactuation; model-reference control; reinforcement learning; unknown dynamics; SYSTEMS; CHALLENGES; NAVIGATION; TRENDS; SHIPS;
D O I
10.1109/TCYB.2022.3203507
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article studies coordinated tracking of underactuated and uncertain autonomous surface vehicles (ASVs) via model-reference reinforcement learning control. It considered how model-reference control can be incorporated with reinforcement learning to address the challenges caused by model uncertainties and input underactuation, and how existing results may be employed to realize adaptive communication amongst ASVs. It is demonstrated that the proposed algorithm has a better performance over baseline control and effectively improves the training efficiency over reinforcement learning.
引用
收藏
页码:6588 / 6597
页数:10
相关论文
共 37 条
[1]   Coordinated Tracking in Mean Square for a Multi-Agent System With Noisy Channels and Switching Directed Network Topologies [J].
Chen, Fei ;
Xiang, Linying ;
Lan, Weiyao ;
Chen, Guanrong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2012, 59 (11) :835-839
[2]   Distributed Average Tracking of Multiple Time-Varying Reference Signals With Bounded Derivatives [J].
Chen, Fei ;
Cao, Yongcan ;
Ren, Wei .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (12) :3169-3174
[3]   Concurrent learning adaptive control of linear systems with exponentially convergent bounds [J].
Chowdhary, Girish ;
Yucelen, Tansel ;
Muehlegg, Maximillian ;
Johnson, Eric N. .
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2013, 27 (04) :280-301
[4]  
Chu T., 2020, ARXIV
[5]   Observer-Based Robust Control of Vehicle Dynamics for Rollover Mitigation in Critical Situations [J].
Dahmani, Hamid ;
Pages, Olivier ;
El Hajjaji, Ahmed ;
Daraoui, Nawal .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (01) :274-284
[6]   Global robust adaptive path following of underactuated ships [J].
Do, K. D. ;
Pan, J. .
AUTOMATICA, 2006, 42 (10) :1713-1722
[7]   The Autonomous Maritime Navigation (AMN) Project: Field Tests, Autonomous and Cooperative Behaviors, Data Fusion, Sensors, and Vehicles [J].
Elkins, Les ;
Sellers, Drew ;
Monach, W. Reynolds .
JOURNAL OF FIELD ROBOTICS, 2010, 27 (06) :790-818
[8]  
Foerster J, 2017, PR MACH LEARN RES, V70
[9]  
Foerster JN, 2016, ADV NEUR IN, V29
[10]   A NOVEL-APPROACH TO THE DESIGN OF UNKNOWN INPUT OBSERVERS [J].
GUAN, Y ;
SAIF, M .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1991, 36 (05) :632-635