An Adaptive Multi-Level Quantization-Based Reinforcement Learning Model for Enhancing UAV Landing on Moving Targets

被引：9

作者：

Abo Mosali, Najmaddin ^{[1
]}

Shamsudin, Syariful Syafiq ^{[1
]}

Mostafa, Salama A. ^{[2
]}

Alfandi, Omar ^{[3
]}

Omar, Rosli ^{[4
]}

Al-Fadhali, Najib ^{[4
]}

Mohammed, Mazin Abed ^{[5
]}

Malik, R. Q. ^{[6
]}

Jaber, Mustafa Musa ^{[7
,8
]}

Saif, Abdu ^{[9
]}

机构：

[1] Univ Tun Hussein Onn Malaysia, Fac Mech & Mfg Engn, Res Ctr Unmanned Vehicles, Batu Pahat 86400, Johor, Malaysia

[2] Univ Tun Hussin Onn Malaysia, Fac Comp Sci & Informat Technol, Batu Pahat 84600, Johor, Malaysia

[3] Zayed Univ, Coll Technol Innovat, Abu Dhabi 4783, U Arab Emirates

[4] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Batu Pahat 86400, Johor, Malaysia

[5] Univ Anbar, Coll Comp Sci & Informat Technol, Ramadi 31001, Iraq

[6] Al Mustaqbal Univ Coll, Dept Med Instrumentat Tech Engn, Hillah 51001, Iraq

[7] Dijlah Univ Coll, Dept Med Instruments Engn Tech, Baghdad 10021, Iraq

[8] Al Farahidi Univ, Dept Med Instruments Engn Tech, Baghdad 10021, Iraq

[9] Univ Malaya, Dept Elect Engn, Fac Engn, Kuala Lumpur 50603, Selangor, Malaysia

来源：

SUSTAINABILITY | 2022年 / 14卷 / 14期

关键词：

unmanned aerial vehicle (UAV); autonomous landing; deep-neural network; reinforcement learning; multi-level quantization; Q-learning; VEHICLE; TRACKING;

D O I：

10.3390/su14148825

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The autonomous landing of an unmanned aerial vehicle (UAV) on a moving platform is an essential functionality in various UAV-based applications. It can be added to a teleoperation UAV system or part of an autonomous UAV control system. Various robust and predictive control systems based on the traditional control theory are used for operating a UAV. Recently, some attempts were made to land a UAV on a moving target using reinforcement learning (RL). Vision is used as a typical way of sensing and detecting the moving target. Mainly, the related works have deployed a deep-neural network (DNN) for RL, which takes the image as input and provides the optimal navigation action as output. However, the delay of the multi-layer topology of the deep neural network affects the real-time aspect of such control. This paper proposes an adaptive multi-level quantization-based reinforcement learning (AMLQ) model. The AMLQ model quantizes the continuous actions and states to directly incorporate simple Q-learning to resolve the delay issue. This solution makes the training faster and enables simple knowledge representation without needing the DNN. For evaluation, the AMLQ model was compared with state-of-art approaches and was found to be superior in terms of root mean square error (RMSE), which was 8.7052 compared with the proportional-integral-derivative (PID) controller, which achieved an RMSE of 10.0592.

引用

页数：17

共 40 条

[1] Detection of Nuclear Sources by UAV Teleoperation Using a Visuo-Haptic Augmented Reality Interface [J].

Aleotti, Jacopo ;

Micconi, Giorgio ;

Caselli, Stefano ;

Benassi, Giacomo ;

Zambelli, Nicola ;

Bettelli, Manuele ;

Zappettini, Andrea .

SENSORS, 2017, 17 (10)

[2] Vision Based Autonomous Landing of Multirotor UAV on Moving Platform [J].

Araar, Oualid ;

Aouf, Nabil ;

Vitanov, Ivan .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 85 (02) :369-384

[3]

Baca T, 2017, 2017 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR)

[4] The ETH-MAV Team in the MBZ International Robotics Challenge [J].

Baehnemann, Rik ;

Pantie, Michael ;

Popovic, Marija ;

Schindler, Dominik ;

Tranzatto, Marco ;

Kamel, Mina ;

Grimm, Marius ;

Widauer, Jakob ;

Siegwart, Roland ;

Nieto, Juan .

JOURNAL OF FIELD ROBOTICS, 2019, 36 (01) :78-103

[5] On-board model-based automatic collision avoidance: application in remotely-piloted unmanned aerial vehicles [J].

Bareiss, Daman ;

Bourne, Joseph R. ;

Leang, Kam K. .

AUTONOMOUS ROBOTS, 2017, 41 (07) :1539-1554

[6]

Beul M, 2017, 2017 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR)

[7]

Falanga D, 2017, 2017 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY AND RESCUE ROBOTICS (SSRR), P200, DOI 10.1109/SSRR.2017.8088164

[8]

Fliess M., 2009, IFAC P, V42, P1531, DOI 10.3182 20090706-3- / FR-2004.00256

[9]

Forster C, 2015, IEEE INT CONF ROBOT, P111, DOI 10.1109/ICRA.2015.7138988

[10] Drone delivery systems: job assignment and dimensioning [J].

Grippa, Pasquale ;

Behrens, Doris A. ;

Wall, Friederike ;

Bettstetter, Christian .

AUTONOMOUS ROBOTS, 2019, 43 (02) :261-274

← 1 2 3 4 →