Q-Learning based system for Path Planning with Unmanned Aerial Vehicles swarms in obstacle environments

被引:18
作者
Puente-Castro, Alejandro [1 ]
Rivero, Daniel [1 ]
Pedrosa, Eurico [2 ]
Pereira, Artur [2 ]
Lau, Nuno [2 ]
Fernandez-Blanco, Enrique [1 ]
机构
[1] Univ A Coruna, Fac Comp Sci, CITIC, La Coruna 15007, Spain
[2] Univ Aveiro, LASI, IEETA, DESI, Aveiro, Portugal
关键词
UAV; Swarm; Obstacle; Path Planning; Reinforcement learning; Artificial Neural Network; ALGORITHMS; MODEL;
D O I
10.1016/j.eswa.2023.121240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Path Planning methods for the autonomous control of Unmanned Aerial Vehicle (UAV) swarms are on the rise due to the numerous advantages they bring. There are increasingly more scenarios where autonomous control of multiple UAVs is required. Most of these scenarios involve a large number of obstacles, such as power lines or trees. Despite these challenges, there are also several advantages; if all UAVs can operate autonomously, personnel expenses can be reduced. Additionally, if their flight paths are optimized, energy consumption is reduced, leaving more battery time for other operations. In this paper, a Reinforcement Learning-based system is proposed to solve this problem in environments with obstacles by utilizing Q-Learning. This method allows a model, in this case, an Artificial Neural Network, to self-adjust by learning from its mistakes and successes. Regardless of the map's size or the number of UAVs in the swarm, the goal of these paths is to ensure complete coverage of an area with fixed obstacles for tasks like field prospecting. Setting goals or having any prior information apart from the provided map is not required. During the experimentation phase, five maps of varying sizes were used, each with different obstacles and a varying number of UAVs. To evaluate the quality of the results, the number of actions taken by each UAV to complete the task in each experiment was considered. The results indicate that the system achieves solutions with fewer movements as the number of UAVs increases. An increasing number of UAVs on a map lead to solutions in fewer moves. The results have been compared, and a statistical significance analysis has been conducted on the proposed model's outcomes, demonstrating its capabilities. Thus, it is shown that a two-layer Artificial Neural Network used to implement a Q-Learning algorithm is sufficient to operate on maps with obstacles.
引用
收藏
页数:14
相关论文
共 92 条
  • [1] Path planning techniques for unmanned aerial vehicles: A review, solutions, and challenges
    Aggarwal, Shubhani
    Kumar, Neeraj
    [J]. COMPUTER COMMUNICATIONS, 2020, 149 : 270 - 299
  • [2] Albani Dario, 2019, Bio-inspired Information and Communication Technologies. 11th EAI International Conference, BICT 2019. Proceedings. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering (LNICST 289), P132, DOI 10.1007/978-3-030-24202-2_10
  • [3] Albani D, 2017, 2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS)
  • [4] Albani D, 2017, IEEE INT C INT ROBOT, P4319, DOI 10.1109/IROS.2017.8206296
  • [5] Albawi S, 2017, I C ENG TECHNOL
  • [6] Austin R., 2010, AIAA ED SERIES, DOI DOI 10.1002/9780470664797
  • [7] Bergstra J, 2012, J MACH LEARN RES, V13, P281
  • [8] Bocchino R., 2018, PREPRINT
  • [9] Bonabeau E, 2001, HARVARD BUS REV, V79, P106
  • [10] RETRACTED: A Review and Future Directions of UAV Swarm Communication Architectures (Retracted Article)
    Campion, Mitch
    Ranganathan, Prakash
    Faruque, Saleh
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 903 - +