A hierarchical path planning approach based on A* and least-squares policy iteration for mobile robots

被引：68

作者：

Zuo, Lei ^{[1
]}

Guo, Qi ^{[1
]}

Xu, Xin ^{[1
]}

Fu, Hao ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Mech & Automat, Changsha 410073, Hunan, Peoples R China

来源：

NEUROCOMPUTING | 2015年 / 170卷

基金：

中国国家自然科学基金;

关键词：

Mobile robots; Hierarchical path planning; A* search; Reinforcement learning; Least squares policy iteration (LSPI); Optimality; GENERALIZED VORONOI DIAGRAMS; CONFIGURATION-SPACES; POTENTIAL FUNCTIONS; OBSTACLE AVOIDANCE; NAVIGATION; ENVIRONMENTS; ALGORITHMS; COSTMAPS; STRATEGY; ROADMAP;

D O I：

10.1016/j.neucom.2014.09.092

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a novel hierarchical path planning approach for mobile robot navigation in complex environments. The proposed approach has a two-level structure. In the first level, the A* algorithm based on grids is used to find a geometric path quickly and several path points are selected as subgoals for the next level. In the second level, an approximate policy iteration algorithm called least-squares policy iteration (LSPI) is used to learn a near-optimal local planning policy that can generate smooth trajectories under kinematic constraints of the robot. Using this near-optimal local planning policy, the mobile robot can find an optimized path by sequentially approaching the subgoals obtained in the first level. One advantage of the proposed approach is that the kinematic characteristics of the mobile robot can be incorporated into the LSPI-based path optimization procedure. The second advantage is that the LSPI-based local path optimizer uses an approximate policy iteration algorithm which has been proven to be data-efficient and stable. The training of the local path optimizer can use sample experiences collected randomly from any reasonable sampling distribution. Furthermore, the LSPI-based local path optimizer has the ability of dealing with uncertainties in the environment. For unknown obstacles, it just needs to replan the path in the second level rather than the whole planner. Simulations for path planning in various types of environments have been carried out and the results demonstrate the effectiveness of the proposed approach. (C) 2015 Elsevier B.V. All rights reserved.

引用

页码：257 / 266

页数：10

共 50 条

[1] Least-squares policy iteration
Lagoudakis, MG
Parr, R
JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (06) : 1107 - 1149
[2] Experience replay for least-squares policy iteration
Liu, Quan (quanliu@suda.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc. (01): : 274 - 281
[3] Finite-Sample Analysis of Least-Squares Policy Iteration
Lazaric, Alessandro
Ghavamzadeh, Mohammad
Munos, Remi
JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 3041 - 3074
[4] Online least-squares policy iteration for reinforcement learning control
Busoniu, Lucian
Ernst, Damien
De Schutter, Bart
Babuska, Robert
2010 AMERICAN CONTROL CONFERENCE, 2010, : 486 - 491
[5] Potential-Based Least-Squares Policy Iteration for a Parameterized Feedback Control System
Cheng, Kang
Zhang, Kanjian
Fei, Shumin
Wei, Haikun
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 169 (02) : 692 - 704
[6] Hierarchical Path Planning for Mobile Robots Based on Hybrid Map
Wu X.
Yang J.
Tang K.
Zhai J.
Lou P.
Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2023, 34 (05): : 563 - 575
[7] Least-squares policy iteration algorithms for robotics: Online, continuous, and automatic
Friedrich, Stefan R.
Schreibauer, Michael
Buss, Martin
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 83 : 72 - 84
[8] A calibration method for odometry of mobile robots based on the least-squares technique: Theory and experimental validation
Antonelli, G
Chiaverini, S
Fusco, G
IEEE TRANSACTIONS ON ROBOTICS, 2005, 21 (05) : 994 - 1004
[9] A hierarchical reinforcement learning approach for optimal path tracking of wheeled mobile robots
Zuo, Lei
Xu, Xin
Liu, Chunming
Huang, Zhenhua
NEURAL COMPUTING & APPLICATIONS, 2013, 23 (7-8) : 1873 - 1883
[10] Adaptive Kernel-Width Selection for Kernel-Based Least-Squares Policy Iteration Algorithm
Wu, Jun
Xu, Xin
Zuo, Lei
Li, Zhaobin
Wang, Jian
ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT II, 2011, 6676 : 611 - 619

← 1 2 3 4 5 →