A new approach for drone tracking with drone using Proximal Policy Optimization based distributed deep reinforcement learning

被引：6

作者：

Tan, Ziya ^{[1
]}

Karakose, Mehmet ^{[2
]}

机构：

[1] Tokat Gaziosmanpasa Univ, Tokat, Turkiye

[2] Firat Univ Elazig, Elazig, Turkiye

来源：

SOFTWAREX | 2023年 / 23卷

关键词：

Distributed learning; Drone tracking; Reinforcement learning; Proximal Policy Optimization; UAVS;

D O I：

10.1016/j.softx.2023.101497

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, a distributed deep reinforcement learning algorithm based on Proximal Policy Optimization (PPO) is proposed for an unmanned aerial vehicle (UAV) to autonomously track another UAV. Accordingly, this paper makes three important contributions to the literature. The first one is the development of an efficient UAV tracking algorithm, the second one is the presentation of a deep reinforcement learning approach that can be adapted to the problem, and the third one is the introduction of a generalized distributed deep reinforcement learning platform that can be used in various problems such as tracking, control and mission coordination of UAVs. In order to validate the developed approaches, the PPO algorithm is simulated with the deep reinforcement learning algorithm in a distributed and non-distributed manner, a follower UAV is trained in different scenarios and the distributed and non-distributed performances of the training using CPU are obtained, scenarios using general and adaptive learning algorithms are given, and finally, the performances of the algorithms developed in the paper are presented explicitly. (c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

引用

页数：8

共 37 条

[1] Drones Chasing Drones: Reinforcement Learning and Deep Search Area Proposal
Akhloufi, Moulay A.
Arola, Sebastien
Bonnet, Alexandre
[J]. DRONES, 2019, 3 (03) : 1 - 14
[2] Distributed Deep CNN-LSTM Model for Intrusion Detection Method in IoT-Based Vehicles
Alferaidi, Ali
Yadav, Kusum
Alharbi, Yasser
Razmjooy, Navid
Viriyasitavat, Wattana
Gulati, Kamal
Kautish, Sandeep
Dhiman, Gaurav
[J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
[3] Autonomous Unmanned Aerial Vehicle navigation using Reinforcement Learning: A systematic review
AlMahamid, Fadi
Grolinger, Katarina
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 115
[4] Bertoin D, 2022, 12 INT WORKSH AG TRA, P3173
[5] Using Reinforcement Learning to Minimize the Probability of Delay Occurrence in Transportation
Cao, Zhiguang
Guo, Hongliang
Song, Wen
Gao, Kaizhou
Chen, Zhenghua
Zhang, Le
Zhang, Xuexi
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) : 2424 - 2436
[6] Autonomous Tracking Using a Swarm of UAVs: A Constrained Multi-Agent Reinforcement Learning Approach
Chen, Yu-Jia
Chang, Deng-Kai
Zhang, Cheng
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13702 - 13717
[7] Tracking Drones with Drones Using Millimeter Wave Radar
Dogru, Sedat
Baptista, Rui
Marques, Lino
[J]. FOURTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, ROBOT 2019, VOL 2, 2020, 1093 : 392 - 402
[8] Distributed Reinforcement Learning for scalable wireless medium access in IoTs and sensor networks
Dutta, Hrishikesh
Biswas, Subir
[J]. COMPUTER NETWORKS, 2022, 202
[9] Espeholt L, 2018, PR MACH LEARN RES, V80
[10] Heess Nicolas, 2017, Emergence of locomotion behaviours in rich environments

← 1 2 3 4 →