Application of Reinforcement Learning in Controlling Quadrotor UAV Flight Actions

被引:2
|
作者
Shen, Shang-En [1 ]
Huang, Yi-Cheng [1 ]
机构
[1] Natl Chung Hsing Univ, Dept Mech Engn, Taichung 40227, Taiwan
关键词
quadrotor UAV; reinforcement learning; logic control; target recognition; action decision making;
D O I
10.3390/drones8110660
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Most literature has extensively discussed reinforcement learning (RL) for controlling rotorcraft drones during flight for traversal tasks. However, most studies lack adequate details regarding the design of reward and punishment mechanisms, and there is a limited exploration of the feasibility of applying reinforcement learning in actual flight control following simulation experiments. Consequently, this study focuses on the exploration of reward and punishment design and state input for RL. The simulation environment is constructed using AirSim and Unreal Engine, with onboard camera footage serving as the state input for reinforcement learning. The research investigates three RL algorithms suitable for discrete action training. The Deep Q Network (DQN), Advantage Actor-Critic (A2C), and Proximal Policy Optimization (PPO) were combined with three different reward and punishment design mechanisms for training and testing. The results indicate that employing the PPO algorithm along with a continuous return method as the reward mechanism allows for effective convergence during the training process, achieving a target traversal rate of 71% in the testing environment. Furthermore, this study proposes integrating the YOLOv7-tiny object detection (OD) system to assess the applicability of reinforcement learning in real-world settings. Unifying the state inputs of simulated and OD environments and replacing the original simulated image inputs with a maximum dual-target approach, the experimental simulation achieved a target traversal rate of 52% ultimately. In summary, this research formulates a set of logical frameworks for an RL reward and punishment design deployed with real-time Yolo's OD implementation synergized as a useful aid for related RL studies.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Hybrid Reinforcement Learning Control for a Micro Quadrotor Flight
    Yoo, Jaehyun
    Jang, Dohyun
    Kim, H. Jin
    Johansson, Karl H.
    IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (02): : 505 - 510
  • [2] Modular Reinforcement Learning for a Quadrotor UAV With Decoupled Yaw Control
    Yu, Beomyeol
    Lee, Taeyoung
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 572 - 579
  • [3] Modular Reinforcement Learning for Autonomous UAV Flight Control
    Choi, Jongkwan
    Kim, Hyeon Min
    Hwang, Ha Jun
    Kim, Yong-Duk
    Kim, Chang Ouk
    DRONES, 2023, 7 (07)
  • [4] Aggressive Quadrotor Flight Using Curiosity-Driven Reinforcement Learning
    Sun, Qiyu
    Fang, Jinbao
    Zheng, Wei Xing
    Tang, Yang
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (12) : 13838 - 13848
  • [5] Event-triggered reinforcement learning control for the quadrotor UAV with actuator saturation
    Lin, Xiaobo
    Liu, Jian
    Yu, Yao
    Sun, Changyin
    NEUROCOMPUTING, 2020, 415 (415) : 135 - 145
  • [6] Research of Precision Flight Control for Quadrotor UAV
    Gao Qingji
    Yue Fengfa
    Hu Dandan
    2014 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2014, : 2369 - 2374
  • [7] Application of Reinforcement Learning in UAV Tasks: A Survey
    Fu, Jiahao
    Yang, Feng
    UNMANNED SYSTEMS, 2025,
  • [8] Application of reinforcement learning in UAV cluster task scheduling
    Yang, Jun
    You, Xinghui
    Wu, Gaoxiang
    Hassan, Mohammad Mehedi
    Almogren, Ahmad
    Guna, Joze
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 95 : 140 - 148
  • [9] Quadrotor UAV Flight Control Using Backstepping Adaptive Controller
    Zhou, Laihong
    Zhang, Bao
    2020 IEEE 6TH INTERNATIONAL CONFERENCE ON CONTROL SCIENCE AND SYSTEMS ENGINEERING (ICCSSE), 2019, : 169 - 172
  • [10] Improved Reinforcement Learning Using Stability Augmentation With Application to Quadrotor Attitude Control
    Wu, Hangxing
    Ye, Hui
    Xue, Wentao
    Yang, Xiaofei
    IEEE ACCESS, 2022, 10 : 67590 - 67604