Reinforcement learning-based drone simulators: survey, practice, and challenge

被引:2
作者
Chan, Jun Hoong [1 ]
Liu, Kai [2 ]
Chen, Yu [3 ]
Sagar, A. S. M. Sharifuzzaman [4 ]
Kim, Yong-Guk [3 ]
机构
[1] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[2] Wenzhou Univ, Coll Comp Sci & Artificial Intelligence, Wenzhou 325100, Peoples R China
[3] Sejong Univ, Dept Comp Engn, Neungdong Ro, Seoul, South Korea
[4] Sejong Univ, Dept AI & Robot, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Drone; Reinforcement learning; Drone simulator; UAVS;
D O I
10.1007/s10462-024-10933-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, machine learning has been very useful in solving diverse tasks with drones, such as autonomous navigation, visual surveillance, communication, disaster management, and agriculture. Among these machine learning, two representative paradigms have been widely utilized in such applications: supervised learning and reinforcement learning. Researchers prefer to use supervised learning, mostly based on convolutional neural networks, because of its robustness and ease of use but yet data labeling is laborious and time-consuming. On the other hand, when traditional reinforcement learning is combined with the deep neural network, it can be a very powerful tool to solve high-dimensional input problems such as image and video. Along with the fast development of reinforcement learning, many researchers utilize reinforcement learning in drone applications, and it often outperforms supervised learning. However, it usually requires the agent to explore the environment on a trial-and-error basis which is high cost and unrealistic in the real environment. Recent advances in simulated environments can allow an agent to learn by itself to overcome these drawbacks, although the gap between the real environment and the simulator has to be minimized in the end. In this sense, a realistic and reliable simulator is essential for reinforcement learning training. This paper investigates various drone simulators that work with diverse reinforcement learning architectures. The characteristics of the reinforcement learning-based drone simulators are analyzed and compared for the researchers who would like to employ them for their projects. Finally, we shed light on some challenges and potential directions for future drone simulators.
引用
收藏
页数:47
相关论文
共 82 条
  • [1] Afzal A, 2020, Arxiv, DOI arXiv:2004.07368
  • [2] UTSim: A framework and simulator for UAV air traffic integration, control, and communication
    Al-Mousa, Amjed
    Sababha, Belal H.
    Al-Madi, Nailah
    Barghouthi, Amro
    Younisse, Remah
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2019, 16 (05)
  • [3] Altaweel M, 2022, The use of drones in human and physical geography
  • [4] [Anonymous], 2017, Openai baselines
  • [5] Anwar A, 2019, Arxiv, DOI [arXiv:1910.05547, 10.1109/ACCESS.2020.2971172]
  • [6] Babushkin A, 2022, jMAVSim
  • [7] Berndt J.S., 2004, P AIAA MOD SIM TECHN, DOI DOI 10.2514/6.2004-4923
  • [8] UAVs and simulation: an experience on MAVs
    Capello, Elisa
    Guglieri, Giorgio
    Quagliotti, Fulvia B.
    [J]. AIRCRAFT ENGINEERING AND AEROSPACE TECHNOLOGY, 2009, 81 (01) : 38 - 50
  • [9] USARSim:a robot simulator for research and education
    Carpin, Stefano
    Lewis, Mike
    Wang, Jijun
    Balakirsky, Stephen
    Scrapper, Chris
    [J]. PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10, 2007, : 1400 - +
  • [10] Coumans E., 2016, PyBullet, a python module for physics simulation for games, robotics and machine learning