Solving routing problems for multiple cooperative Unmanned Aerial Vehicles using Transformer networks

被引：14

作者：

Fuertes, Daniel ^{[1
]}

del-Blanco, Carlos R. ^{[1
]}

Jaureguizar, Fernando ^{[1
]}

Navarro, Juan Jose ^{[2
]}

Garcia, Narciso ^{[1
]}

机构：

[1] Univ Politecn Madrid, Informat Proc & Telecommun Ctr, Grp Tratamiento Imagenes GTI, ETSI Telecomunicac, Madrid 28040, Spain

[2] Airbus Def & Space, Madrid 28906, Spain

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 122卷

关键词：

Unmanned Aerial Vehicle; Orienteering problem; Deep reinforcement learning; Attention model; Shared regions; GUIDED LOCAL SEARCH; ORIENTEERING PROBLEM; ALGORITHM; GRASP;

D O I：

10.1016/j.engappai.2023.106085

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Missions involving Unmanned Aerial Vehicle usually consist of reaching a set of regions, performing some actions in each region, and returning to a determined depot after all the regions have been successfully visited or before the fuel/battery is totally consumed. Hence, planning a route becomes an important task for many applications, especially if a team of Unmanned Aerial Vehicles is considered. From this team, coordination and cooperation are expected to optimize results of the mission. In this paper, a system for managing multiple cooperative Unmanned Aerial Vehicles is presented. This system divides the routing problem into two stages: initial planning and routing solving. Initial planning is a first step where the regions to be visited are grouped in multiple clusters according to a distance criterion, with each cluster being assigned to each of the Unmanned Aerial Vehicles. Routing solving computes the best route for every agent considering the clusters of the initial planning and a variant of the Orienteering Problem. This variant introduces the concept of shared regions, allowing an Unmanned Aerial Vehicle to visit regions from other clusters and compensating for the suboptimal region clustering of the previous stage. The Orienteering Problem with shared regions is solved using the deep learning architecture Transformer along with a deep reinforcement learning framework. This architecture is able to obtain high-quality solutions much faster than conventional optimization approaches. Extensive results and comparison with other Combinatorial Optimization algorithms, including cooperative and non-cooperative scenarios, have been performed to show the benefits of the proposed solution.

引用

页数：10

共 48 条

[1] A NEW AND SIMPLER APPROXIMATION FOR ANOVA UNDER VARIANCE HETEROGENEITY [J].

ALEXANDER, RA ;

GOVERN, DM .

JOURNAL OF EDUCATIONAL STATISTICS, 1994, 19 (02) :91-101

[2]

Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473,1409.0473, DOI 10.48550/ARXIV.1409.0473,1409.0473]

[3] TRAVELING SALESMAN PROBLEM - A SURVEY [J].

BELLMORE, M ;

NEHAUSE.GL .

OPERATIONS RESEARCH, 1968, 16 (03) :538-&

[4]

Bello I., 2016, PREPRINT

[5]

Bradley P. S., 2000, MSRTR200065

[6] GRASP with path relinking for the orienteering problem [J].

Campos, Vicente ;

Marti, Rafael ;

Sanchez-Oro, Jesus ;

Duarte, Abraham .

JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2014, 65 (12) :1800-1813

[7] THE TRUCK DISPATCHING PROBLEM [J].

DANTZIG, GB ;

RAMSER, JH .

MANAGEMENT SCIENCE, 1959, 6 (01) :80-91

[8]

Festa P, 2014, INT C TRANS OPT NETW

[9]

Gama R, 2021, Arxiv, DOI arXiv:2011.03647

[10] Dynamical Variational Autoencoders: A Comprehensive Review [J].

Girin, Laurent ;

Leglaive, Simon ;

Bie, Xiaoyu ;

Diard, Julien ;

Hueber, Thomas ;

Alameda-Pineda, Xavier .

FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2021, 15 (1-2) :1-175

← 1 2 3 4 5 →