Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm

被引:17
作者
Guan, Wei [1 ]
Cui, Zhewen [1 ]
Zhang, Xianku [1 ]
机构
[1] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
基金
中国国家自然科学基金;
关键词
decision-making; deep reinforcement learning; Nomoto; PPO; SMASS; COLLISION-AVOIDANCE;
D O I
10.3390/s22155732
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
With the development of artificial intelligence technology, the behavior decision-making of an intelligent smart marine autonomous surface ship (SMASS) has become particularly important. This research proposed local path planning and a behavior decision-making approach based on improved Proximal Policy Optimization (PPO), which could drive an unmanned SMASS to the target without requiring any human experiences. In addition, a generalized advantage estimation was added to the loss function of the PPO algorithm, which allowed baselines in PPO algorithms to be self-adjusted. At first, the SMASS was modeled with the Nomoto model in a simulation waterway. Then, distances, obstacles, and prohibited areas were regularized as rewards or punishments, which were used to judge the performance and manipulation decisions of the vessel Subsequently, improved PPO was introduced to learn the action-reward model, and the neural network model after training was used to manipulate the SMASS's movement. To achieve higher reward values, the SMASS could find an appropriate path or navigation strategy by itself. After a sufficient number of rounds of training, a convincing path and manipulation strategies would likely be produced. Compared with the proposed approach of the existing methods, this approach is more effective in self-learning and continuous optimization and thus closer to human manipulation.
引用
收藏
页数:33
相关论文
共 32 条
  • [1] A knowledge-free path planning approach for smart ships based on reinforcement learning
    Chen, Chen
    Chen, Xian-Qiao
    Ma, Feng
    Zeng, Xiao-Jun
    Wang, Jin
    [J]. OCEAN ENGINEERING, 2019, 189
  • [2] Deep reinforcement learning-based collision avoidance for an autonomous ship
    Chun, Do-Hyun
    Roh, Myung-Il
    Lee, Hye-Won
    Ha, Jisang
    Yu, Donghun
    [J]. OCEAN ENGINEERING, 2021, 234
  • [3] Dearden R, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P761
  • [4] Neural networks based reinforcement learning for mobile robots obstacle avoidance
    Duguleana, Mihai
    Mogan, Gheorghe
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 62 : 104 - 115
  • [5] Avalon Navigation Strategy and Trajectory Following Controller for an Autonomous Sailing Vessel
    Erckens, Hendrik
    Buesser, Gion-Andri
    Pradalier, Cedric
    Siegwart, Roland Y.
    [J]. IEEE ROBOTICS & AUTOMATION MAGAZINE, 2010, 17 (01) : 45 - 54
  • [6] Everett M, 2018, IEEE INT C INT ROBOT, P3052, DOI 10.1109/IROS.2018.8593871
  • [7] A Novel Reinforcement Learning Collision Avoidance Algorithm for USVs Based on Maneuvering Characteristics and COLREGs
    Fan, Yunsheng
    Sun, Zhe
    Wang, Guofeng
    [J]. SENSORS, 2022, 22 (06)
  • [8] Ship Steering Adaptive CGS Control Based on EKF Identification Method
    Guan, Wei
    Peng, Haowen
    Zhang, Xianku
    Sun, Hui
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (02)
  • [9] Ship Steering Control Based on Quantum Neural Network
    Guan, Wei
    Zhou, Haotian
    Su, Zuojing
    Zhang, Xianku
    Zhao, Chao
    [J]. COMPLEXITY, 2019, 2019
  • [10] Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV's Autonomous Motion Planning in Complex Unknown Environments
    Hu, Zijian
    Wan, Kaifang
    Gao, Xiaoguang
    Zhai, Yiwei
    Wang, Qianglong
    [J]. SENSORS, 2020, 20 (07)