Online Unmanned Ground Vehicle Path Planning Based on Multi-Attribute Intelligent Reinforcement Learning for Mine Search and Rescue

被引:0
作者
Zhang, Shanfan [1 ]
Zeng, Qingshuang [1 ]
机构
[1] Harbin Inst Technol, Space Control & Inertial Technol Res Ctr, Harbin 150001, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 19期
基金
中国国家自然科学基金;
关键词
search and rescue (SAR); unmanned system; path planning; partially observable Markov decision process (POMDP); gray system; ENVIRONMENTS;
D O I
10.3390/app14199127
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Aiming to improve the efficiency of the online process in path planning, a novel searching method is proposed based on environmental information analysis. Firstly, a search and rescue (SAR) environmental model and an unmanned ground vehicle (UGV) motion model are established according to the characteristics of a mining environment. Secondly, an online search area path-planning method is proposed based on the gray system theory and the reinforcement learning theory to handle multiple constraints. By adopting the multi-attribute intelligent (MAI) gray decision process, the action selection decision can be dynamically adjusted based on the current environment, ensuring the stable convergence of the model. Finally, experimental verification is conducted in different small-scale mine SAR simulation scenarios. The experimental results show that the proposed search planning method can capture the target in the search area with a smoother convergence effect and a shorter path length than other path-planning algorithms.
引用
收藏
页数:14
相关论文
共 43 条
  • [1] Active sensing for motion planning in uncertain environments via mutual information policies
    A Macdonald, Ryan
    Smith, Stephen L.
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2019, 38 (2-3) : 146 - 161
  • [2] SLAP: Simultaneous Localization and Planning Under Uncertainty via Dynamic Replanning in Belief Space
    Agha-mohammadi, Ali-akbar
    Agarwal, Saurav
    Kim, Sung-Kyun
    Chakravorty, Suman
    Amato, Nancy M.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2018, 34 (05) : 1195 - 1214
  • [3] Coverage path planning for maritime search and rescue using reinforcement learning
    Ai, Bo
    Jia, Maoxin
    Xu, Hanwen
    Xu, Jiangling
    Wen, Zhen
    Li, Benshuai
    Zhang, Dan
    [J]. OCEAN ENGINEERING, 2021, 241
  • [4] An Intelligent Decision Algorithm for the Generation of Maritime Search and Rescue Emergency Response Plans
    Ai, Bo
    Li, Benshuai
    Gao, Song
    Xu, Jiangling
    Shang, Hengshuai
    [J]. IEEE ACCESS, 2019, 7 : 155835 - 155850
  • [5] Path Planning for a UGV using Salp Swarm Algorithm
    AlShabi, Mohammad
    Ballous, Khlaled Awad
    Nassif, Ali Bou
    Bettayeb, Maamar
    Obaideen, Khaled
    Gadsden, S. Andrew
    [J]. AUTONOMOUS SYSTEMS:SENSORS, PROCESSING, AND SECURITY FOR GROUND, AIR, SEA, AND SPACE VEHICLES AND INFRASTRUCTURE 2024, 2024, 13052
  • [6] Amato C, 2015, IEEE INT CONF ROBOT, P1241, DOI 10.1109/ICRA.2015.7139350
  • [7] Posterior sampling for Monte Carlo planning under uncertainty
    Bai, Aijun
    Wu, Feng
    Chen, Xiaoping
    [J]. APPLIED INTELLIGENCE, 2018, 48 (12) : 4998 - 5018
  • [8] Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning
    Bayerlein, Harald
    Theile, Mirco
    Caccamo, Marco
    Gesbert, David
    [J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2021, 2 : 1171 - 1187
  • [9] Reinforcement Learning for POMDP: Partitioned Rollout and Policy Iteration With Application to Autonomous Sequential Repair Problems
    Bhattacharya, Sushmita
    Badyal, Sahil
    Wheeler, Thomas
    Gil, Stephanie
    Bertsekas, Dimitri
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03) : 3967 - 3974
  • [10] Deep Reinforcement Learning-Based Large-Scale Robot Exploration
    Cao, Yuhong
    Zhao, Rui
    Wang, Yizhuo
    Xiang, Bairan
    Sartoretti, Guillaume
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4631 - 4638