High-Level Path Planning for an Autonomous Sailboat Robot Using Q-Learning

被引：49

作者：

da Silva Junior, Andouglas Goncalves ^{[1
,2
]}

dos Santos, Davi Henrique ^{[1
]}

Fernandes de Negreiros, Alvaro Pinto ^{[1
]}

Boas de Souza Silva, Joao Moreno Vilas ^{[2
]}

Garcia Goncalves, Luiz Marcos ^{[1
]}

机构：

[1] Univ Fed Rio Grande do Norte, DCA CT UFRN, Campus Univ, BR-59078970 Natal, RN, Brazil

[2] Inst Fed Rio Grande Norte, Ave Sen Salgado Filho,1559 Tirol, BR-59015000 Natal, RN, Brazil

来源：

SENSORS | 2020年 / 20卷 / 06期

关键词：

Q-Learning; path planning; USV; ASV; autonomous sailboat; mobile robotics; green robotics; MOBILE ROBOT; NAVIGATION; ATTENTION;

D O I：

10.3390/s20061550

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Path planning for sailboat robots is a challenging task particularly due to the kinematics and dynamics modelling of such kinds of wind propelled boats. The problem is divided into two layers. The first one is global where a general trajectory composed of waypoints is planned, which can be done automatically based on some variables such as weather conditions or defined by hand using some human-robot interface (a ground-station). In the second local layer, at execution time, the global route should be followed by making the sailboat proceed between each pair of consecutive waypoints. Our proposal in this paper is an algorithm for the global, path generation layer, which has been developed for the N-Boat (The Sailboat Robot project), in order to compute feasible sailing routes between a start and a target point while avoiding dangerous situations such as obstacles and borders. A reinforcement learning approach (Q-Learning) is used based on a reward matrix and a set of actions that changes according to wind directions to account for the dead zone, which is the region against the wind where the sailboat can not gain velocity. Our algorithm generates straight and zigzag paths accounting for wind direction. The path generated also guarantees the sailboat safety and robustness, enabling it to sail for long periods of time, depending only on the start and target points defined for this global planning. The result is the development of a complete path planner algorithm that, together with the local planner solved in previous work, can be used to allow the final developments of an N-Boat making it a fully autonomous sailboat.

引用

页数：22

共 40 条

[1] Green Robotics: Concepts, Challenges, and Strategies [J].

Alves Filho, S. E. ;

Sa, S. T. de L. ;

Burlamaqui, A. M. F. ;

Aroca, R., V ;

Goncalves, L. M. G. .

IEEE LATIN AMERICA TRANSACTIONS, 2018, 16 (04) :1042-1050

[2]

Baker R., 2015, P WORLD ROB SAIL CHA

[3] Learning coordination in multi-agent systems using influence value reinforcement learning [J].

Barrios-Aranibar, Dennis ;

Garcia Goncalves, Luiz Marcos .

PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, :471-476

[4] A MARKOVIAN DECISION PROCESS [J].

BELLMAN, R .

JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05) :679-684

[5]

Bradski G., THE OPENCV LIB

[6] A Versatile Method for Depth Data Error Estimation in RGB-D Sensors [J].

Cabrera, Elizabeth, V ;

Ortiz, Luis E. ;

da Silva, Bruno M. F. ;

Clua, Esteban W. G. ;

Goncalves, Luiz M. G. .

SENSORS, 2018, 18 (09)

[7]

Cabrera-Gamez J., 2012, P ROB SAIL 2012 CARD

[8] A knowledge-free path planning approach for smart ships based on reinforcement learning [J].

Chen, Chen ;

Chen, Xian-Qiao ;

Ma, Feng ;

Zeng, Xiao-Jun ;

Wang, Jin .

OCEAN ENGINEERING, 2019, 189

[9]

Choset H., 2005, Intelligent Robotics and Autonomous Agents series

[10] UAV Motion Strategies in Uncertain Dynamic Environments: A Path Planning Method Based on Q-Learning Strategy [J].

Cui, Jun-hui ;

Wei, Rui-xuan ;

Liu, Zong-cheng ;

Zhou, Kai .

APPLIED SCIENCES-BASEL, 2018, 8 (11)

← 1 2 3 4 →