A new approach for drone tracking with drone using Proximal Policy Optimization based distributed deep reinforcement learning

被引:6
作者
Tan, Ziya [1 ]
Karakose, Mehmet [2 ]
机构
[1] Tokat Gaziosmanpasa Univ, Tokat, Turkiye
[2] Firat Univ Elazig, Elazig, Turkiye
关键词
Distributed learning; Drone tracking; Reinforcement learning; Proximal Policy Optimization; UAVS;
D O I
10.1016/j.softx.2023.101497
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, a distributed deep reinforcement learning algorithm based on Proximal Policy Optimization (PPO) is proposed for an unmanned aerial vehicle (UAV) to autonomously track another UAV. Accordingly, this paper makes three important contributions to the literature. The first one is the development of an efficient UAV tracking algorithm, the second one is the presentation of a deep reinforcement learning approach that can be adapted to the problem, and the third one is the introduction of a generalized distributed deep reinforcement learning platform that can be used in various problems such as tracking, control and mission coordination of UAVs. In order to validate the developed approaches, the PPO algorithm is simulated with the deep reinforcement learning algorithm in a distributed and non-distributed manner, a follower UAV is trained in different scenarios and the distributed and non-distributed performances of the training using CPU are obtained, scenarios using general and adaptive learning algorithms are given, and finally, the performances of the algorithms developed in the paper are presented explicitly. (c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:8
相关论文
共 37 条
  • [1] Drones Chasing Drones: Reinforcement Learning and Deep Search Area Proposal
    Akhloufi, Moulay A.
    Arola, Sebastien
    Bonnet, Alexandre
    [J]. DRONES, 2019, 3 (03) : 1 - 14
  • [2] Distributed Deep CNN-LSTM Model for Intrusion Detection Method in IoT-Based Vehicles
    Alferaidi, Ali
    Yadav, Kusum
    Alharbi, Yasser
    Razmjooy, Navid
    Viriyasitavat, Wattana
    Gulati, Kamal
    Kautish, Sandeep
    Dhiman, Gaurav
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [3] Autonomous Unmanned Aerial Vehicle navigation using Reinforcement Learning: A systematic review
    AlMahamid, Fadi
    Grolinger, Katarina
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 115
  • [4] Bertoin D, 2022, 12 INT WORKSH AG TRA, P3173
  • [5] Using Reinforcement Learning to Minimize the Probability of Delay Occurrence in Transportation
    Cao, Zhiguang
    Guo, Hongliang
    Song, Wen
    Gao, Kaizhou
    Chen, Zhenghua
    Zhang, Le
    Zhang, Xuexi
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) : 2424 - 2436
  • [6] Autonomous Tracking Using a Swarm of UAVs: A Constrained Multi-Agent Reinforcement Learning Approach
    Chen, Yu-Jia
    Chang, Deng-Kai
    Zhang, Cheng
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13702 - 13717
  • [7] Tracking Drones with Drones Using Millimeter Wave Radar
    Dogru, Sedat
    Baptista, Rui
    Marques, Lino
    [J]. FOURTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, ROBOT 2019, VOL 2, 2020, 1093 : 392 - 402
  • [8] Distributed Reinforcement Learning for scalable wireless medium access in IoTs and sensor networks
    Dutta, Hrishikesh
    Biswas, Subir
    [J]. COMPUTER NETWORKS, 2022, 202
  • [9] Espeholt L, 2018, PR MACH LEARN RES, V80
  • [10] Heess Nicolas, 2017, Emergence of locomotion behaviours in rich environments