A LARGE-SCALE PATH PLANNING ALGORITHM FOR UNDERWATER ROBOTS BASED ON DEEP REINFORCEMENT LEARNING

被引：0

作者：

Wang, Wenhui ^{[1
]}

Li, Leqing ^{[1
]}

Ye, Fumeng ^{[1
]}

Peng, Yumin ^{[1
]}

Ma, Yiming ^{[1
]}

机构：

[1] CSG PGC Power Storage Res Inst, Guangzhou 510000, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION | 2024年 / 39卷 / 03期

关键词：

DDPG algorithm; reward function; deep reinforcement learning; underwater robot; large-scale path planning; VEHICLE;

D O I：

10.2316/J.2024.206-1035

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To ensure the effect and improve the accuracy of large-scale path planning for underwater robots, a large-scale algorithm for planning the path for underwater robots based on deep reinforcement learning is proposed. Deep reinforcement learning is analysed, and the idea, structure, network update method, and training process of deep deterministic policy gradients (DDPG) algorithm are described. A fitness learning model of the robot which under water is confirmed to describe the mathematical relationship between the geographical location and operating speed of the underwater robots. On this basis, DDPG algorithm is applied in large-scale path planning of underwater robots. TensorFlow is used to build Actor and Critic neural network structures, and design environment state models, action state spaces, and reward functions. In deep reinforcement learning, the large-scale navigation planning for the underwater robot, through exploration-online trial and error, finds the optimal search strategy, and considers obtaining the maximum expected reward during the path planning procedure, achieving the large-scale path planning for the underwater robot. According to the experimental results, the proposed algorithm demonstrates good performance in large-scale path planning for underwater robots and effectively improves both the accuracy and efficiency of the planning process.

引用

页码：204 / 210

页数：7

共 20 条

[1] An Energy Efficient Solution for Fuel Cell Heat Recovery in Zero-Emission Ferry Boats: Deep Deterministic Policy Gradient
Ahmadi, Hoda
Rafiei, Mehdi
Afshari Igder, Moseyeb
Gheisarnejad, Meysam
Khooban, Mohammad-Hassan
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (08) : 7571 - 7581
[2] Aloba L.T., 2019, Shipbuilding and Marine Infrastructure, V1, P74
[3] [Быкова В.С. Bykova V.S.], 2021, [Гироскопия и навигация, Giroskopiya i navigatsiya, Giroskopiya i navigatsiya], V29, P97, DOI 10.17285/0869-7035.0058
[4] A fuzzy-based potential field hierarchical reinforcement learning approach for target hunting by multi-AUV in 3-D underwater environments
Cao, Xiang
Zuo, Fen
[J]. INTERNATIONAL JOURNAL OF CONTROL, 2021, 94 (05) : 1334 - 1343
[5] Non-Communication Decentralized Multi-Robot Collision Avoidance in Grid Map Workspace with Double Deep Q-Network
Chen, Lin
Zhao, Yongting
Zhao, Huanjun
Zheng, Bin
[J]. SENSORS, 2021, 21 (03) : 1 - 15
[6] Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
Cicek, Dogan C.
Duran, Enes
Saglam, Baturay
Mutlu, Furkan B.
Kozat, Suleyman S.
[J]. 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 1255 - 1262
[7] Krishnan A., 2020, International Journal of Robotics and Automation, V6, P1
[8] Constrained path planning of autonomous underwater vehicle using selectively-hybridized particle swarm optimization algorithms
Lim, Hui Sheng
Fan, Shuangshuang
Chin, Christopher K. H.
Chai, Shuhong
Bose, Neil
Kim, Eonjoo
[J]. IFAC PAPERSONLINE, 2019, 52 (21): : 315 - 322
[9] Sliding Mode Control with Gaussian Process Regression for Underwater Robots
Lima, Gabriel S.
Trimpe, Sebastian
Bessa, Wallace M.
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 99 (3-4) : 487 - 498
[10] Multi-autonomous underwater vehicles collaboratively search for intelligent targets in an unknown environment in the presence of interception
Ma, Xi-wen
Chen, Yan-li
Bai, Gui-qiang
Sha, Yong-bai
Liu, Jun
[J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2021, 235 (09) : 1539 - 1554

← 1 2 →