Deep reinforcement learning based trajectory optimization for magnetometer-mounted UAV to landmine detection

被引:11
作者
Barnawi, Ahmed [1 ]
Kumar, Neeraj [1 ]
Budhiraja, Ishan [2 ]
Kumar, Krishan [1 ]
Almansour, Amal [1 ]
Alzahrani, Bander [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & IT, Jeddah, Saudi Arabia
[2] Bennett Univ, Sch Comp Sci Engn & Technol, Greater Noida, Uttar Pradesh, India
关键词
Landmine; UAV; DRL; Energy; Magnetometer; SCHEME; ENERGY;
D O I
10.1016/j.comcom.2022.09.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned aerial vehicles (UAVs) have emerged as a viable choice for data collection and landmine (LM) detection. The LM buried under the dirt or sand is detected using a UAV-mounted magnetometer in this paper. A UAV is deployed to gather data along the intended route when the magnetometer receives a signal from the LMs. During a whole round of data collection, we want to reduce the total energy consumption of the UAV-Magnetometer-LM system. To do this, we turn the energy consumption reduction issue into a limited combinatorial optimization problem by concurrently picking time slots and arranging the UAV's visitation sequence to identify the LM. The problem of minimizing energy usage is NP-hard, making it difficult to solve optimally. In order to tackle this challenge, we used the deep reinforcement learning (DRL) based deep deterministic policy gradient (DDPG) scheme. DDPG is used to enhance the convergence speed and eliminate redundant computations. Furthermore, to improve the detection in real-time, we proposed the proximal online policy technique (POPT). Numerical results demonstrate that the proposed scheme consumes 37.14%, 31.25%, and 21.42% better results than synthetic aperture radar (SAR), convolution neural network (CNN), and double deep recurrent Q-network (DDRQN).
引用
收藏
页码:441 / 450
页数:10
相关论文
共 39 条
[11]   Tactile Internet for Smart Communities in 5G: An Insight for NOMA-Based Solutions [J].
Budhiraja, Ishan ;
Tyagi, Sudhanshu ;
Tanwar, Sudeep ;
Kumar, Neeraj ;
Rodrigues, Joel J. P. C. .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (05) :3104-3112
[12]  
Cabreira Taua M., 2019, 2019 International Conference on Unmanned Aircraft Systems (ICUAS), P758, DOI 10.1109/ICUAS.2019.8797937
[13]   Network Service Chaining in Fog and Cloud Computing for the 5G Environment: Data Management and Security Challenges [J].
Chaudhary, Rajat ;
Kumar, Neeraj ;
Zeadally, Sherali .
IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (11) :114-122
[14]   Coverage Path Planning for UAVs Photogrammetry with Energy and Resolution Constraints [J].
Di Franco, Carmelo ;
Buttazzo, Giorgio .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2016, 83 (3-4) :445-462
[15]   Energy-aware Coverage Path Planning of UAVs [J].
Di Franco, Carmelo ;
Buttazzo, Giorgio .
2015 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC), 2015, :111-117
[16]   Synthetic Aperture Radar Imaging System for Landmine Detection Using a Ground Penetrating Radar on Board a Unmanned Aerial Vehicle [J].
Garcia Fernandez, Maria ;
Alvarez Lopez, Yuri ;
Arboleya, Ana Arboleya ;
Gonzalez Valdes, Borja ;
Rodriguez Vaqueiro, Yolanda ;
Las-Heras Andres, Fernando ;
Pino Garcia, Antonio .
IEEE ACCESS, 2018, 6 :45100-45112
[17]   Power Optimization in Device-to-Device Communications: A Deep Reinforcement Learning Approach With Dynamic Reward [J].
Ji, Zelin ;
Kiani, Adnan K. ;
Qin, Zhijin ;
Ahmad, Rizwan .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (03) :508-511
[18]  
Jiao YS, 2010, C IND ELECT APPL, P315
[19]  
Keeley R., 2017, J. Conventional Weapons Destruction, V21, P3
[20]   Collaborative Learning Automata-Based Routing for Rescue Operations in Dense Urban Regions Using Vehicular Sensor Networks [J].
Kumar, Neeraj ;
Misra, Sudip ;
Obaidat, Mohammad S. .
IEEE SYSTEMS JOURNAL, 2015, 9 (03) :1081-1090