Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles

被引:0
作者
Zhang, Qingrui [1 ,2 ]
Pan, Wei [2 ]
Reppa, Vasso [1 ]
机构
[1] Delft Univ Technol, Dept Maritime & Transport Technol, Delft, Netherlands
[2] Delft Univ Technol, Dept Cognit Robot, Delft, Netherlands
来源
2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC) | 2020年
关键词
ADAPTIVE-CONTROL; ROBUST; SYSTEM;
D O I
10.1109/cdc42340.2020.9304347
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel model-reference reinforcement learning control method for uncertain autonomous surface vehicles. The proposed control combines a conventional model-based control method with deep reinforcement learning. With the conventional model-based control, we can ensure the learning-based control law provides closed-loop stability for the trajectory tracking control of the overall system, and increase the sample efficiency of the deep reinforcement learning. With reinforcement learning, we can directly learn a control law to compensate for modeling uncertainties. In the proposed control, a nominal system is employed for the design of a baseline control law using a conventional control approach. The nominal system also defines the desired performance for uncertain autonomous vehicles to follow. In comparison with traditional deep reinforcement learning methods, our proposed learning-based control can provide stability guarantees and better sample efficiency. We demonstrate the performance of the new algorithm via extensive simulation results.
引用
收藏
页码:5291 / 5296
页数:6
相关论文
共 50 条
  • [31] Adaptive dynamic surface longitudinal tracking control of autonomous vehicles
    Guo, Jinghua
    Luo, Yugong
    Li, Keqiang
    Guo, Lie
    IET INTELLIGENT TRANSPORT SYSTEMS, 2019, 13 (08) : 1272 - 1280
  • [32] MODEL-REFERENCE ADAPTIVE-CONTROL FOR ROBOTIC MANIPULATORS WITHOUT VELOCITY-MEASUREMENTS
    SCHWARTZ, HM
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 1994, 8 (03) : 279 - 285
  • [33] Docking Control of an Autonomous Underwater Vehicle Using Reinforcement Learning
    Anderlini, Enrico
    Parker, Gordon G.
    Thomas, Giles
    APPLIED SCIENCES-BASEL, 2019, 9 (17):
  • [34] Anti-windup for model-reference adaptive control schemes with rate-limits
    Turner, Matthew C.
    Sofrony, Jorge
    Prempain, Emmanuel
    SYSTEMS & CONTROL LETTERS, 2020, 137
  • [35] Model-reference adaptive-control of a PFC-equipped battery-charger
    Farhangi, S
    Yazdani, A
    Fahimi, B
    IECON'01: 27TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-3, 2001, : 1015 - 1020
  • [36] Model-Reference Control Approach to Obstacle Avoidance for a Human-Operated Mobile Robot
    Uchiyama, Naoki
    Hashimoto, Tatsuhiro
    Sano, Shigenori
    Takagi, Shoji
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2009, 56 (10) : 3892 - 3896
  • [37] COOR-PLT: A hierarchical control model for coordinating adaptive platoons of connected and autonomous vehicles at signal-free intersections based on deep reinforcement learning
    Li, Duowei
    Zhu, Feng
    Chen, Tianyi
    Wong, Yiik Diew
    Zhu, Chunli
    Wu, Jianping
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 146
  • [38] Double look-ahead reference point control for autonomous agricultural vehicles
    Bodur, Mehmet
    Kiani, Ehsan
    Hacisevki, Hasan
    BIOSYSTEMS ENGINEERING, 2012, 113 (02) : 173 - 186
  • [39] A Novel Approach to Model-Reference Adaptive PID Controllers
    Tavazoei, Mohammad
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2024, : 371 - 381
  • [40] Autonomous UAV Navigation with Adaptive Control Based on Deep Reinforcement Learning
    Yin, Yongfeng
    Wang, Zhetao
    Zheng, Lili
    Su, Qingran
    Guo, Yang
    ELECTRONICS, 2024, 13 (13)