Integral reinforcement learning-based approximate minimum time-energy path planning in an unknown environment

被引：11

作者：

He, Chenyuan ^{[1
]}

Wan, Yan ^{[2
,3
]}

Gu, Yixin ^{[3
]}

Lewis, Frank L. ^{[2
,3
]}

机构：

[1] Univ Texas Arlington, Dept Elect Engn, Arlington, TX 76019 USA

[2] Univ Texas Arlington, EE UTA, Ft Worth, TX USA

[3] Univ Texas Arlington, UTA Res Inst, Ft Worth, TX USA

来源：

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL | 2021年 / 31卷 / 06期

基金：

美国国家科学基金会;

关键词：

constrained optimal control; integral reinforcement learning; minimum time-energy path planning; NONLINEAR-SYSTEMS; VEHICLES; OPTIMIZATION; CURVATURE; DESIGN;

D O I：

10.1002/rnc.5122

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Path planning is a fundamental and critical task in many robotic applications. For energy-constrained robot platforms, path planning solutions are desired with minimum time arrivals and minimal energy consumption. Uncertain environments, such as wind conditions, pose challenges to the design of effective minimum time-energy path planning solutions. In this article, we develop a minimum time-energy path planning solution in continuous state and control input spaces using integral reinforcement learning (IRL). To provide a baseline solution for the performance evaluation of the proposed solution, we first develop a theoretical analysis for the minimum time-energy path planning problem in a known environment using the Pontryagin's minimum principle. We then provide an online adaptive solution in an unknown environment using IRL. This is done through transforming the minimum time-energy problem to an approximate minimum time-energy problem and then developing an IRL-based optimal control strategy. Convergence of the IRL-based optimal control strategy is proven. Simulation studies are developed to compare the theoretical analysis and the proposed IRL-based algorithm.

引用

页码：1905 / 1922

页数：18

共 44 条

[1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
Abu-Khalaf, M
Lewis, FL
[J]. AUTOMATICA, 2005, 41 (05) : 779 - 791
[2] Abu-Khalaf M., 2006, Nonlinear H2/H-Infinity Constrained Feedback Control: A Practical Design Approach Using Neural Networks
[3] Evolutionary path planning for autonomous underwater vehicles in a variable ocean
Alvarez, A
Caiti, A
Onken, R
[J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2004, 29 (02) : 418 - 429
[4] [Anonymous], 2019, AIAA SCITECH 2019 FO
[5] Hierarchical dynamic programming for robot path planning
Bakker, B
Zivkovic, Z
Kröse, B
[J]. 2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 3720 - 3725
[6] Bakolas E, 2010, P AMER CONTR CONF, P6163
[7] Ben-Ari M., 2018, Elements of Robotics, DOI [DOI 10.1007/978-3-319-62533-1, 10.1007/978-3-319-62533-1_1, DOI 10.1007/978-3-319-62533-1_1]
[8] SHORTEST PATHS OF BOUNDED CURVATURE IN THE PLANE
BOISSONNAT, JD
CEREZO, A
LEBLOND, J
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 1994, 11 (1-2) : 5 - 20
[9] BUI XN, 1994, IEEE INT CONF ROBOT, P2, DOI 10.1109/ROBOT.1994.351019
[10] Chakrabarty A, 2013, P AMER CONTR CONF, P2568

← 1 2 3 4 5 →