Application of Reinforcement Learning in Controlling Quadrotor UAV Flight Actions

被引:2
|
作者
Shen, Shang-En [1 ]
Huang, Yi-Cheng [1 ]
机构
[1] Natl Chung Hsing Univ, Dept Mech Engn, Taichung 40227, Taiwan
关键词
quadrotor UAV; reinforcement learning; logic control; target recognition; action decision making;
D O I
10.3390/drones8110660
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Most literature has extensively discussed reinforcement learning (RL) for controlling rotorcraft drones during flight for traversal tasks. However, most studies lack adequate details regarding the design of reward and punishment mechanisms, and there is a limited exploration of the feasibility of applying reinforcement learning in actual flight control following simulation experiments. Consequently, this study focuses on the exploration of reward and punishment design and state input for RL. The simulation environment is constructed using AirSim and Unreal Engine, with onboard camera footage serving as the state input for reinforcement learning. The research investigates three RL algorithms suitable for discrete action training. The Deep Q Network (DQN), Advantage Actor-Critic (A2C), and Proximal Policy Optimization (PPO) were combined with three different reward and punishment design mechanisms for training and testing. The results indicate that employing the PPO algorithm along with a continuous return method as the reward mechanism allows for effective convergence during the training process, achieving a target traversal rate of 71% in the testing environment. Furthermore, this study proposes integrating the YOLOv7-tiny object detection (OD) system to assess the applicability of reinforcement learning in real-world settings. Unifying the state inputs of simulated and OD environments and replacing the original simulated image inputs with a maximum dual-target approach, the experimental simulation achieved a target traversal rate of 52% ultimately. In summary, this research formulates a set of logical frameworks for an RL reward and punishment design deployed with real-time Yolo's OD implementation synergized as a useful aid for related RL studies.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] Learning Stabilization Control of Quadrotor in Near-Ground Setting Using Reinforcement Learning
    Briliauskas, Mantas
    INFORMATION TECHNOLOGY AND CONTROL, 2024, 53 (01): : 237 - 242
  • [42] Disturbance rejection and high dynamic quadrotor control based on reinforcement learning and supervised learning
    Mingjun Li
    Zhihao Cai
    Jiang Zhao
    Jinyan Wang
    Yingxun Wang
    Neural Computing and Applications, 2022, 34 : 11141 - 11161
  • [43] Disturbance rejection and high dynamic quadrotor control based on reinforcement learning and supervised learning
    Li, Mingjun
    Cai, Zhihao
    Zhao, Jiang
    Wang, Jinyan
    Wang, Yingxun
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13) : 11141 - 11161
  • [44] Control of Quadrotor Drone with Partial State Observation via Reinforcement Learning
    Shan, Guangcun
    Zhang, Yinan
    Gao, Yong
    Wang, Tian
    Chen, Jianping
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1965 - 1968
  • [45] A Navigation Scheme for a Random Maze using Reinforcement Learning with Quadrotor Vision
    Yu, Xinglin
    Wu, Yuhu
    Sun, Xi-Ming
    2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 518 - 523
  • [46] A PID Gain Adjustment Scheme Based on Reinforcement Learning Algorithm for a Quadrotor
    Zheng Qingqing
    Tang Renjie
    Gou Siyuan
    Zhang Weizhong
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6756 - 6761
  • [47] Fuzzy PID Controller for UAV Based on Reinforcement Learning
    Zhang, Benyi
    Zhang, Weiping
    Mou, Jiawang
    Yang, Runmin
    Zhang, Yichen
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1724 - 1732
  • [48] Reinforcement Learning based Scheduling for Heterogeneous UAV Networking
    Wang, Jian
    Liu, Yongxin
    Niu, Shuteng
    Song, Houbing
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 420 - 427
  • [49] Deep Reinforcement Learning Enabled Covert Transmission With UAV
    Hu, Jinsong
    Guo, Mingqian
    Yan, Shihao
    Chen, Youjia
    Zhou, Xiaobo
    Chen, Zhizhang
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (05) : 917 - 921
  • [50] A reinforcement learning approach for UAV target searching and tracking
    Tian Wang
    Ruoxi Qin
    Yang Chen
    Hichem Snoussi
    Chang Choi
    Multimedia Tools and Applications, 2019, 78 : 4347 - 4364