Deep reinforcement learning based trajectory optimization for magnetometer-mounted UAV to landmine detection

被引:11
作者
Barnawi, Ahmed [1 ]
Kumar, Neeraj [1 ]
Budhiraja, Ishan [2 ]
Kumar, Krishan [1 ]
Almansour, Amal [1 ]
Alzahrani, Bander [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & IT, Jeddah, Saudi Arabia
[2] Bennett Univ, Sch Comp Sci Engn & Technol, Greater Noida, Uttar Pradesh, India
关键词
Landmine; UAV; DRL; Energy; Magnetometer; SCHEME; ENERGY;
D O I
10.1016/j.comcom.2022.09.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned aerial vehicles (UAVs) have emerged as a viable choice for data collection and landmine (LM) detection. The LM buried under the dirt or sand is detected using a UAV-mounted magnetometer in this paper. A UAV is deployed to gather data along the intended route when the magnetometer receives a signal from the LMs. During a whole round of data collection, we want to reduce the total energy consumption of the UAV-Magnetometer-LM system. To do this, we turn the energy consumption reduction issue into a limited combinatorial optimization problem by concurrently picking time slots and arranging the UAV's visitation sequence to identify the LM. The problem of minimizing energy usage is NP-hard, making it difficult to solve optimally. In order to tackle this challenge, we used the deep reinforcement learning (DRL) based deep deterministic policy gradient (DDPG) scheme. DDPG is used to enhance the convergence speed and eliminate redundant computations. Furthermore, to improve the detection in real-time, we proposed the proximal online policy technique (POPT). Numerical results demonstrate that the proposed scheme consumes 37.14%, 31.25%, and 21.42% better results than synthetic aperture radar (SAR), convolution neural network (CNN), and double deep recurrent Q-network (DDRQN).
引用
收藏
页码:441 / 450
页数:10
相关论文
共 39 条
[1]  
[Anonymous], 2017, L M REPORT CAS
[2]   Modeling and Simulation of Very High Spatial Resolution UXOs and Landmines in a Hyperspectral Scene for UAV Survey [J].
Bajic, Milan, Jr. ;
Bajic, Milan .
REMOTE SENSING, 2021, 13 (05) :1-40
[3]   Secure clustering for efficient data dissemination in vehicular cyber-physical systems [J].
Bali, Rasmeet S. ;
Kumar, Neeraj .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 56 :476-492
[4]   Applying Deep Learning to Automate UAV-Based Detection of Scatterable Landmines [J].
Baur, Jasper ;
Steinberg, Gabriel ;
Nikulin, Alex ;
Chiu, Kenneth ;
de Smet, Timothy S. .
REMOTE SENSING, 2020, 12 (05)
[5]  
Bouzid Y, 2017, IEEE INT C INT ROBOT, P979, DOI 10.1109/IROS.2017.8202264
[6]  
Budhiraja I., 2021, IEEE Trans. Mob. Comput.
[7]   Energy Consumption Minimization Scheme for NOMA-Based Mobile Edge Computation Networks Underlaying UAV [J].
Budhiraja, Ishan ;
Kumar, Neeraj ;
Tyagi, Sudhanshu ;
Tanwar, Sudeep .
IEEE SYSTEMS JOURNAL, 2021, 15 (04) :5724-5733
[8]   A Systematic Review on NOMA Variants for 5G and Beyond [J].
Budhiraja, Ishan ;
Kumar, Neeraj ;
Tyagi, Sudhanshu ;
Tanwar, Sudeep ;
Han, Zhu ;
Piran, Md Jalil ;
Suh, Doug Young .
IEEE ACCESS, 2021, 9 :85573-85644
[9]   Deep-Reinforcement-Learning-Based Proportional Fair Scheduling Control Scheme for Underlay D2D Communication [J].
Budhiraja, Ishan ;
Kumar, Neeraj ;
Tyagi, Sudhanshu .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) :3143-3156
[10]  
Budhiraja I, 2020, IEEE CONF COMPUT, P877, DOI 10.1109/INFOCOMWKSHPS50562.2020.9162839