An Adaptive Multi-Level Quantization-Based Reinforcement Learning Model for Enhancing UAV Landing on Moving Targets

被引:9
作者
Abo Mosali, Najmaddin [1 ]
Shamsudin, Syariful Syafiq [1 ]
Mostafa, Salama A. [2 ]
Alfandi, Omar [3 ]
Omar, Rosli [4 ]
Al-Fadhali, Najib [4 ]
Mohammed, Mazin Abed [5 ]
Malik, R. Q. [6 ]
Jaber, Mustafa Musa [7 ,8 ]
Saif, Abdu [9 ]
机构
[1] Univ Tun Hussein Onn Malaysia, Fac Mech & Mfg Engn, Res Ctr Unmanned Vehicles, Batu Pahat 86400, Johor, Malaysia
[2] Univ Tun Hussin Onn Malaysia, Fac Comp Sci & Informat Technol, Batu Pahat 84600, Johor, Malaysia
[3] Zayed Univ, Coll Technol Innovat, Abu Dhabi 4783, U Arab Emirates
[4] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Batu Pahat 86400, Johor, Malaysia
[5] Univ Anbar, Coll Comp Sci & Informat Technol, Ramadi 31001, Iraq
[6] Al Mustaqbal Univ Coll, Dept Med Instrumentat Tech Engn, Hillah 51001, Iraq
[7] Dijlah Univ Coll, Dept Med Instruments Engn Tech, Baghdad 10021, Iraq
[8] Al Farahidi Univ, Dept Med Instruments Engn Tech, Baghdad 10021, Iraq
[9] Univ Malaya, Dept Elect Engn, Fac Engn, Kuala Lumpur 50603, Selangor, Malaysia
关键词
unmanned aerial vehicle (UAV); autonomous landing; deep-neural network; reinforcement learning; multi-level quantization; Q-learning; VEHICLE; TRACKING;
D O I
10.3390/su14148825
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The autonomous landing of an unmanned aerial vehicle (UAV) on a moving platform is an essential functionality in various UAV-based applications. It can be added to a teleoperation UAV system or part of an autonomous UAV control system. Various robust and predictive control systems based on the traditional control theory are used for operating a UAV. Recently, some attempts were made to land a UAV on a moving target using reinforcement learning (RL). Vision is used as a typical way of sensing and detecting the moving target. Mainly, the related works have deployed a deep-neural network (DNN) for RL, which takes the image as input and provides the optimal navigation action as output. However, the delay of the multi-layer topology of the deep neural network affects the real-time aspect of such control. This paper proposes an adaptive multi-level quantization-based reinforcement learning (AMLQ) model. The AMLQ model quantizes the continuous actions and states to directly incorporate simple Q-learning to resolve the delay issue. This solution makes the training faster and enables simple knowledge representation without needing the DNN. For evaluation, the AMLQ model was compared with state-of-art approaches and was found to be superior in terms of root mean square error (RMSE), which was 8.7052 compared with the proportional-integral-derivative (PID) controller, which achieved an RMSE of 10.0592.
引用
收藏
页数:17
相关论文
共 40 条
[1]   Detection of Nuclear Sources by UAV Teleoperation Using a Visuo-Haptic Augmented Reality Interface [J].
Aleotti, Jacopo ;
Micconi, Giorgio ;
Caselli, Stefano ;
Benassi, Giacomo ;
Zambelli, Nicola ;
Bettelli, Manuele ;
Zappettini, Andrea .
SENSORS, 2017, 17 (10)
[2]   Vision Based Autonomous Landing of Multirotor UAV on Moving Platform [J].
Araar, Oualid ;
Aouf, Nabil ;
Vitanov, Ivan .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 85 (02) :369-384
[3]  
Baca T, 2017, 2017 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR)
[4]   The ETH-MAV Team in the MBZ International Robotics Challenge [J].
Baehnemann, Rik ;
Pantie, Michael ;
Popovic, Marija ;
Schindler, Dominik ;
Tranzatto, Marco ;
Kamel, Mina ;
Grimm, Marius ;
Widauer, Jakob ;
Siegwart, Roland ;
Nieto, Juan .
JOURNAL OF FIELD ROBOTICS, 2019, 36 (01) :78-103
[5]   On-board model-based automatic collision avoidance: application in remotely-piloted unmanned aerial vehicles [J].
Bareiss, Daman ;
Bourne, Joseph R. ;
Leang, Kam K. .
AUTONOMOUS ROBOTS, 2017, 41 (07) :1539-1554
[6]  
Beul M, 2017, 2017 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR)
[7]  
Falanga D, 2017, 2017 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY AND RESCUE ROBOTICS (SSRR), P200, DOI 10.1109/SSRR.2017.8088164
[8]  
Fliess M., 2009, IFAC P, V42, P1531, DOI 10.3182 20090706-3- / FR-2004.00256
[9]  
Forster C, 2015, IEEE INT CONF ROBOT, P111, DOI 10.1109/ICRA.2015.7138988
[10]   Drone delivery systems: job assignment and dimensioning [J].
Grippa, Pasquale ;
Behrens, Doris A. ;
Wall, Friederike ;
Bettstetter, Christian .
AUTONOMOUS ROBOTS, 2019, 43 (02) :261-274