Global path planning algorithm based on double DQN for multi-tasks amphibious unmanned surface vehicle

被引:80
作者
Xiaofei, Yang [1 ]
Yilun, Shi [1 ]
Wei, Liu [1 ]
Hui, Ye [1 ]
Weibo, Zhong [1 ]
Zhengrong, Xiang [2 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Elect & Informat, Zhenjiang 212003, Jiangsu, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Automat, Nanjing 210094, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
DDQN; Path planning; Reinforcement learning; Amphibious USV; Electrical nautical chart; DESIGN;
D O I
10.1016/j.oceaneng.2022.112809
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
It is a key to making path planning for an amphibious unmanned surface vehicle (USV). A global path planning algorithm based on double deep Q networks (DDQN) is proposed. Firstly, an environment model is constructed by an electronic nautical chart and elevation map to train and verify the algorithm. Secondly, based on the kinematics of amphibious USV, a Markov decision process (MDP) framework is built, and various reward functions are designed for diverse tasks. During the training, obstacles and water depth information of the environment are used, the amphibious USV agent is guided to the target area. Meanwhile, based on the prior knowledge, an action mask approach is integrated to deal with the invalid actions generated by the amphibious USV. Path smoothing is also integrated to smooth the path. According to different criteria, reasonable paths can be generated and adjusted by the weights of the reward function. To verify our algorithm, a small-scale simu-lation environment is established, and two scenarios are introduced. The results show that our DDQN algorithm can generate reasonable global paths for diverse tasks. Moreover, compared with DQN, A*, and RRT algorithms, the paths generated by our method have better performance.
引用
收藏
页数:14
相关论文
共 46 条
[1]   Loon Copter: Implementation of a hybrid unmanned aquatic-aerial quadcopter with active buoyancy control [J].
Alzu'bi, Hamzeh ;
Mansour, Iyad ;
Rawashdeh, Osamah .
JOURNAL OF FIELD ROBOTICS, 2018, 35 (05) :764-778
[2]   Nezha-Mini: Design and Locomotion of a Miniature Low-Cost Hybrid Aerial Underwater Vehicle [J].
Bi, Yuanbo ;
Jin, Yufei ;
Lyu, Chenxin ;
Zeng, Zheng ;
Lian, Lian .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) :6669-6676
[3]  
Changwu Zhang, 2018, 2018 IEEE 9th International Conference on Software Engineering and Service Science (ICSESS). Proceedings, P563, DOI 10.1109/ICSESS.2018.8663950
[4]   A knowledge-free path planning approach for smart ships based on reinforcement learning [J].
Chen, Chen ;
Chen, Xian-Qiao ;
Ma, Feng ;
Zeng, Xiao-Jun ;
Wang, Jin .
OCEAN ENGINEERING, 2019, 189
[5]   Concise deep reinforcement learning obstacle avoidance for underactuated unmanned marine vessels [J].
Cheng, Yin ;
Zhang, Weidong .
NEUROCOMPUTING, 2018, 272 :63-73
[6]   Deep reinforcement learning-based collision avoidance for an autonomous ship [J].
Chun, Do-Hyun ;
Roh, Myung-Il ;
Lee, Hye-Won ;
Ha, Jisang ;
Yu, Donghun .
OCEAN ENGINEERING, 2021, 234
[7]   Autonomous takeoff control system design for unmanned seaplanes [J].
Du, Huan ;
Fan, Guoliang ;
Yi, Jianqiang .
OCEAN ENGINEERING, 2014, 85 :21-31
[8]   Randomized Bidirectional B-Spline Parameterization Motion Planning [J].
Elbanhawi, Mohamed ;
Simic, Milan ;
Jazar, Reza .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (02) :406-419
[9]   Design of Amphibious Vehicle for Unmanned Mission in Water Quality Monitoring Using Internet of Things [J].
Esakki, Balasubramanian ;
Ganesan, Surendar ;
Mathiyazhagan, Silambarasan ;
Ramasubramanian, Kanagachidambaresan ;
Gnanasekaran, Bhuvaneshwaran ;
Son, Byungrak ;
Park, Su Woo ;
Choi, Jae Sung .
SENSORS, 2018, 18 (10)
[10]   A Path Planning Method for the Spherical Amphibious Robot Based on Improved A-star Algorithm [J].
Guo, Jian ;
Huo, Xiaojie ;
Guo, Shuxiang ;
Xu, Jigang .
2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2021), 2021, :1274-1279