A Smart Flight Controller based on Reinforcement Learning for Unmanned Aerial Vehicle (UAV)

被引：1

作者：

Khan, Fawad Salam ^{[1
,2
]}

Mohd, Mohd Norzali Haji ^{[1
]}

Lank, Raja Masood ^{[3
]}

Khan, Muhammad Danial ^{[4
]}

Abbasi, Muhammad Inam ^{[5
]}

Bagchi, Susama ^{[1
]}

机构：

[1] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Parit Raja, Malaysia

[2] CONVSYS Pvt Ltd, Parit Raja, Malaysia

[3] NED Univ Engg & Tech NEDUET, Dept Elect Engn, Karachi, Pakistan

[4] CONVSYS Pvt Ltd, Dept Machine Learning Automat & Innovat, Islamabad, Pakistan

[5] Univ Tekn Melaka Malaysia UTeM, Dept Elect & Comp Engg, Melaka, Malaysia

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS, IEEE ICSIPA 2021 | 2021年

关键词：

UAV; Reinforcement Learning; Smart flight controller; Reward Function;

D O I：

10.1109/ICSIPA52582.2021.9576806

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Traditional flight controllers consist of Proportional Integral Derivates (PID), that although have dominant stability control but required high human interventions. In this study, a smart flight controller is developed for controlling UAVs which produces operator less mechanisms for flight controllers. It uses a neural network that has been trained using reinforcement learning techniques. Engineered with a variety of actuators (pitch, yaw, roll, and speed), the next-generation flight controller is directly trained to control its own decisions in flight. It also optimizes learning algorithms different from the traditional Actor and Critic networks. The agent gets state information from the environment and calculates the reward function depending on the sensors data from the environment. The agent then receives the observations to identify the state and reward functions and the agent activates the algorithm to perform actions. It shows the performance of a trained neural network consisting of a reward function in both simulation and real-time UAV control. Experimental results show that it can respond with relative precision. Using the same framework shows that UAVs can reliably hover in the air, even under adverse initialization conditions with obstacles. Reward functions computed during the flight for 2500, 5000, 7500 and 10000 episodes between the normalized values 0 and -4000. The computation time observed during each episode is 15 micro sec.

引用

页码：203 / 208

页数：6

共 18 条

[1]

Abbas A.F., 2020, Bull. Electr. Eng. Informatics, V9, P557, DOI [10.11591/eei.v9i2.1865, DOI 10.11591/EEI.V9I2.1865]

[2]

Ali Abdul Malik Mohd, 2018, Journal of Physics: Conference Series, V1049, DOI 10.1088/1742-6596/1049/1/012067

[3]

Bueno M.B., 2017, Deep Learn. Image Process. Appl., V31, P3, DOI [DOI 10.3233/978-1-61499-822-8-164, DOI 10.48550/ARXIV.1611.03718]

[4]

Degree I. N., 3D Hand Pose Tracking from Depth Images using Deep Reinforcement Learning

[5] Human-Computer Interaction using hand gestures [J].

Gota, Dan-Ioan ;

Puscasiu, Adela ;

Fanca, Alexandra ;

Valean, Honoriu ;

Miclea, Liviu .

PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR), 2020, :195-199

[6] Deep reinforcement learning for drone navigation using sensor data [J].

Hodge, Victoria J. ;

Hawkins, Richard ;

Alexander, Rob .

NEURAL COMPUTING & APPLICATIONS, 2021, 33 (06) :2015-2033

[7] Control of a Quadrotor With Reinforcement Learning [J].

Hwangbo, Jemin ;

Sa, Inkyu ;

Siegwart, Roland ;

Hutter, Marco .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (04) :2096-2103

[8]

Khan F. S., 3D Hand Gestures Segmentation and Optimized Classification Using Deep Learning

[9] Breast Cancer Histological Images Nuclei Segmentation using Mask Regional Convolutional Neural Network [J].

Khan, Fawad Salam ;

Mohd, Mohd Norzali Haji ;

Khan, Muhammad Danial ;

Bagchi, Susama .

2020 18TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2020, :427-432

[10] Reinforcement Learning for UAV Attitude Control [J].

Koch, William ;

Mancuso, Renato ;

West, Richard ;

Bestavros, Azer .

ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2019, 3 (02)

← 1 2 →