A Long-Term Target Search Method for Unmanned Aerial Vehicles Based on Reinforcement Learning

被引:0
作者
Wei, Dexing [1 ]
Zhang, Lun [1 ]
Yang, Mei [1 ]
Deng, Hanqiang [1 ]
Huang, Jian [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
关键词
UAVs; reinforcement learning; long-term search task; multi-agent;
D O I
10.3390/drones8100536
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Unmanned aerial vehicles (UAVs) are increasingly being employed in search operations. Deep reinforcement learning (DRL), owing to its robust self-learning and adaptive capabilities, has been extensively applied to drone search tasks. However, traditional DRL approaches often suffer from long training times, especially in long-term search missions for UAVs, where the interaction cycles between the agent and the environment are extended. This paper addresses this critical issue by introducing a novel method-temporally asynchronous grouped environment reinforcement learning (TAGRL). Our key innovation lies in recognizing that as the number of training environments increases, agents can learn knowledge from discontinuous trajectories. This insight leads to the design of grouped environments, allowing agents to explore only a limited number of steps within each interaction cycle rather than completing full sequences. Consequently, TAGRL demonstrates faster learning speeds and lower memory consumption compared to existing parallel environment learning methods. The results indicate that this framework enhances the efficiency of UAV search tasks, paving the way for more scalable and effective applications of RL in complex scenarios.
引用
收藏
页数:18
相关论文
共 27 条
  • [11] Makoviychuk V., 2021, Neurips Datasets Benchmarks, V1
  • [12] Moritz P, 2018, PROCEEDINGS OF THE 13TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P561
  • [13] Ren Zhicai, 2021, 2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS), P140, DOI 10.1109/ICCCS52626.2021.9449283
  • [14] Schulman J, 2017, Arxiv, DOI [arXiv:1707.06347, 10.48550/arXiv.1707.06347, DOI 10.48550/ARXIV.1707.06347]
  • [15] Multi-UAV Cooperative Search Based on Reinforcement Learning With a Digital Twin Driven Training Framework
    Shen, Gaoqing
    Lei, Lei
    Zhang, Xinting
    Li, Zhilin
    Cai, Shengsuo
    Zhang, Lijuan
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (07) : 8354 - 8368
  • [16] An improved adaptive differential evolution algorithm for single unmanned aerial vehicle multitasking
    Su, Jian-li
    Wang, Hua
    [J]. DEFENCE TECHNOLOGY, 2021, 17 (06) : 1967 - 1975
  • [17] Multi-UAV Cooperative Searching and Tracking for Moving Targets Based on Multi-Agent Reinforcement Learning
    Su, Kai
    Qian, Feng
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [18] Wang C, 2017, IEEE GLOB CONF SIG, P858, DOI 10.1109/GlobalSIP.2017.8309082
  • [19] A reinforcement learning approach for UAV target searching and tracking
    Wang, Tian
    Qin, Ruoxi
    Chen, Yang
    Snoussi, Hichem
    Choi, Chang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (04) : 4347 - 4364
  • [20] UAV Swarm Cooperative Dynamic Target Search: A MAPPO-Based Discrete Optimal Control Method
    Wei, Dexing
    Zhang, Lun
    Liu, Quan
    Chen, Hao
    Huang, Jian
    [J]. DRONES, 2024, 8 (06)