Application of Reinforcement Learning in Controlling Quadrotor UAV Flight Actions

被引:2
|
作者
Shen, Shang-En [1 ]
Huang, Yi-Cheng [1 ]
机构
[1] Natl Chung Hsing Univ, Dept Mech Engn, Taichung 40227, Taiwan
关键词
quadrotor UAV; reinforcement learning; logic control; target recognition; action decision making;
D O I
10.3390/drones8110660
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Most literature has extensively discussed reinforcement learning (RL) for controlling rotorcraft drones during flight for traversal tasks. However, most studies lack adequate details regarding the design of reward and punishment mechanisms, and there is a limited exploration of the feasibility of applying reinforcement learning in actual flight control following simulation experiments. Consequently, this study focuses on the exploration of reward and punishment design and state input for RL. The simulation environment is constructed using AirSim and Unreal Engine, with onboard camera footage serving as the state input for reinforcement learning. The research investigates three RL algorithms suitable for discrete action training. The Deep Q Network (DQN), Advantage Actor-Critic (A2C), and Proximal Policy Optimization (PPO) were combined with three different reward and punishment design mechanisms for training and testing. The results indicate that employing the PPO algorithm along with a continuous return method as the reward mechanism allows for effective convergence during the training process, achieving a target traversal rate of 71% in the testing environment. Furthermore, this study proposes integrating the YOLOv7-tiny object detection (OD) system to assess the applicability of reinforcement learning in real-world settings. Unifying the state inputs of simulated and OD environments and replacing the original simulated image inputs with a maximum dual-target approach, the experimental simulation achieved a target traversal rate of 52% ultimately. In summary, this research formulates a set of logical frameworks for an RL reward and punishment design deployed with real-time Yolo's OD implementation synergized as a useful aid for related RL studies.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making
    Hou, Yueqi
    Liang, Xiaolong
    Lv, Maolong
    Yang, Qisong
    Li, Yang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [22] Waypoint Tracking Control for a Quadrotor based on PID and Reinforcement Learning
    Bao, Xurui
    Jing, Zhouhui
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2023, 25 (01): : 90 - 100
  • [23] Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning
    Lopes, Guilherme Cano
    Ferreira, Murillo
    Simoes, Alexandre da Silva
    Colombini, Esther Luna
    15TH LATIN AMERICAN ROBOTICS SYMPOSIUM 6TH BRAZILIAN ROBOTICS SYMPOSIUM 9TH WORKSHOP ON ROBOTICS IN EDUCATION (LARS/SBR/WRE 2018), 2018, : 503 - 508
  • [24] Learning More Complex Actions with Deep Reinforcement Learning
    Wang, Chenxi
    Du, Youtian
    Xie, Shengyuan
    Lu, Yongdi
    2021 FIFTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2021), 2021, : 121 - 122
  • [25] Reinforcement learning based flight controller capable of controlling a quadcopter with four, three and two working motors
    Dooraki, Amir Ramezani
    Lee, Deok-Jin
    2020 20TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2020, : 161 - 166
  • [26] UAV Resource Cooperation Based on Reinforcement Learning
    Shan, Mingang
    Xiong, Jian
    Liu, Bo
    Shi, Zhiping
    Li, Xia
    Miao, Ningjie
    2021 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2021,
  • [27] Robust reinforcement learning control for quadrotor with input delay and uncertainties
    Zhang, Zizuo
    Fei, Yuanyuan
    Zhou, Jiayi
    Yu, Yao
    Sun, Changyin
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (13):
  • [28] Adaptive Trajectory Tracking Control using Reinforcement Learning for Quadrotor
    Lou, Wenjie
    Guo, Xiao
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2016, 13
  • [29] UAV reinforcement learning control algorithm with demonstrations
    Sun D.
    Gao D.
    Zheng J.
    Han P.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2023, 49 (06): : 1424 - 1433
  • [30] Quadrotor Control using Reinforcement Learning under Wind Disturbance
    Lu, Songshuo
    Li, Yanjie
    Liu, Zihan
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3233 - 3240