Online Unmanned Ground Vehicle Path Planning Based on Multi-Attribute Intelligent Reinforcement Learning for Mine Search and Rescue

被引：0

作者：

Zhang, Shanfan ^{[1
]}

Zeng, Qingshuang ^{[1
]}

机构：

[1] Harbin Inst Technol, Space Control & Inertial Technol Res Ctr, Harbin 150001, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 19期

基金：

中国国家自然科学基金;

关键词：

search and rescue (SAR); unmanned system; path planning; partially observable Markov decision process (POMDP); gray system; ENVIRONMENTS;

D O I：

10.3390/app14199127

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Aiming to improve the efficiency of the online process in path planning, a novel searching method is proposed based on environmental information analysis. Firstly, a search and rescue (SAR) environmental model and an unmanned ground vehicle (UGV) motion model are established according to the characteristics of a mining environment. Secondly, an online search area path-planning method is proposed based on the gray system theory and the reinforcement learning theory to handle multiple constraints. By adopting the multi-attribute intelligent (MAI) gray decision process, the action selection decision can be dynamically adjusted based on the current environment, ensuring the stable convergence of the model. Finally, experimental verification is conducted in different small-scale mine SAR simulation scenarios. The experimental results show that the proposed search planning method can capture the target in the search area with a smoother convergence effect and a shorter path length than other path-planning algorithms.

引用

页数：14

共 43 条

[1] Active sensing for motion planning in uncertain environments via mutual information policies
A Macdonald, Ryan
Smith, Stephen L.
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2019, 38 (2-3) : 146 - 161
[2] SLAP: Simultaneous Localization and Planning Under Uncertainty via Dynamic Replanning in Belief Space
Agha-mohammadi, Ali-akbar
Agarwal, Saurav
Kim, Sung-Kyun
Chakravorty, Suman
Amato, Nancy M.
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2018, 34 (05) : 1195 - 1214
[3] Coverage path planning for maritime search and rescue using reinforcement learning
Ai, Bo
Jia, Maoxin
Xu, Hanwen
Xu, Jiangling
Wen, Zhen
Li, Benshuai
Zhang, Dan
[J]. OCEAN ENGINEERING, 2021, 241
[4] An Intelligent Decision Algorithm for the Generation of Maritime Search and Rescue Emergency Response Plans
Ai, Bo
Li, Benshuai
Gao, Song
Xu, Jiangling
Shang, Hengshuai
[J]. IEEE ACCESS, 2019, 7 : 155835 - 155850
[5] Path Planning for a UGV using Salp Swarm Algorithm
AlShabi, Mohammad
Ballous, Khlaled Awad
Nassif, Ali Bou
Bettayeb, Maamar
Obaideen, Khaled
Gadsden, S. Andrew
[J]. AUTONOMOUS SYSTEMS:SENSORS, PROCESSING, AND SECURITY FOR GROUND, AIR, SEA, AND SPACE VEHICLES AND INFRASTRUCTURE 2024, 2024, 13052
[6] Amato C, 2015, IEEE INT CONF ROBOT, P1241, DOI 10.1109/ICRA.2015.7139350
[7] Posterior sampling for Monte Carlo planning under uncertainty
Bai, Aijun
Wu, Feng
Chen, Xiaoping
[J]. APPLIED INTELLIGENCE, 2018, 48 (12) : 4998 - 5018
[8] Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning
Bayerlein, Harald
Theile, Mirco
Caccamo, Marco
Gesbert, David
[J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2021, 2 : 1171 - 1187
[9] Reinforcement Learning for POMDP: Partitioned Rollout and Policy Iteration With Application to Autonomous Sequential Repair Problems
Bhattacharya, Sushmita
Badyal, Sahil
Wheeler, Thomas
Gil, Stephanie
Bertsekas, Dimitri
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03) : 3967 - 3974
[10] Deep Reinforcement Learning-Based Large-Scale Robot Exploration
Cao, Yuhong
Zhao, Rui
Wang, Yizhuo
Xiang, Bairan
Sartoretti, Guillaume
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4631 - 4638

← 1 2 3 4 5 →