Multi-path Following for Underactuated USV Based on Deep Reinforcement Learning

被引:1
|
作者
Wang, Zihao [1 ]
Wu, Yaoxin [2 ]
Song, Wen [1 ]
机构
[1] Shandong Univ, Inst Marine Sci & Technol, Qingdao, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Underactuated USV; Path following; Deep reinforcement learning; Multi-task learning;
D O I
10.1007/978-981-99-0479-2_325
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, deep reinforcement learning (DRL) has attracted much attention in learning control policies for unmanned surface vehicles (USV) path following. However, existing DRL methods can only learn to follow one target path, and generalize poorly to other paths. This paper proposes a novel DRL based multi-path following method. Based on the generalized hindsight mechanism, our method can effectively reuse data generated in one path for training on other paths. Experiments show that the control policy trained by our method generalizes well to a wide range of paths, with significantly higher accuracy and success rate.
引用
收藏
页码:3525 / 3535
页数:11
相关论文
共 50 条
  • [1] Path Following Optimization for an Underactuated USV Using Smoothly-Convergent Deep Reinforcement Learning
    Zhao, Yujiao
    Qi, Xin
    Ma, Yong
    Li, Zhixiong
    Malekian, Reza
    Sotelo, Miguel Angel
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (10) : 6208 - 6220
  • [2] Multi-path Scheduling with Deep Reinforcement Learning
    Molla Rosello, Marc
    2019 EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS (EUCNC), 2019, : 400 - 405
  • [3] Multi-Path Routing Algorithm Based on Deep Reinforcement Learning for SDN
    Zhang, Yi
    Qiu, Lanxin
    Xu, Yangzhou
    Wang, Xinjia
    Wang, Shengjie
    Paul, Agyemang
    Wu, Zhefu
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [4] USV Path-Following Control Based On Deep Reinforcement Learning and Adaptive Control
    Gonzalez-Garcia, Alejandro
    Castaneda, Herman
    Garrido, Leonardo
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [5] Fairness Analysis of Deep Reinforcement Learning based Multi-Path QUIC Scheduling
    Quevedo, Ernesto
    Donahoo, Jeff
    Cerny, Tomas
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1772 - 1781
  • [6] Underactuated USV path following mechanism based on the cascade method
    Mingzhen Lin
    Zhiqiang Zhang
    Yandong Pang
    Hongsheng Lin
    Qing Ji
    Scientific Reports, 12
  • [7] Underactuated USV path following mechanism based on the cascade method
    Lin, Mingzhen
    Zhang, Zhiqiang
    Pang, Yandong
    Lin, Hongsheng
    Ji, Qing
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [8] Path following control of underactuated USV based on asymmetric model
    Wan, Lei, 1600, Editorial office of Ship Building of China, China (57):
  • [9] Research on Underactuated USV Path Following Algorithm
    Yi, Ge
    Liu, Zhong
    Zhang, Jian-qiang
    Dong, Jiao
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 2141 - 2145
  • [10] ADSA: A Multi-path Transmission Scheduling Algorithm based on Deep Reinforcement Learning in Vehicle Networks
    Yin, Chenyang
    Dong, Ping
    Du, Xiaojiang
    Zhang, Yuyang
    Zhang, Hongke
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 5058 - 5063