A hierarchical path planning approach based on A* and least-squares policy iteration for mobile robots

被引:68
|
作者
Zuo, Lei [1 ]
Guo, Qi [1 ]
Xu, Xin [1 ]
Fu, Hao [1 ]
机构
[1] Natl Univ Def Technol, Coll Mech & Automat, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Mobile robots; Hierarchical path planning; A* search; Reinforcement learning; Least squares policy iteration (LSPI); Optimality; GENERALIZED VORONOI DIAGRAMS; CONFIGURATION-SPACES; POTENTIAL FUNCTIONS; OBSTACLE AVOIDANCE; NAVIGATION; ENVIRONMENTS; ALGORITHMS; COSTMAPS; STRATEGY; ROADMAP;
D O I
10.1016/j.neucom.2014.09.092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel hierarchical path planning approach for mobile robot navigation in complex environments. The proposed approach has a two-level structure. In the first level, the A* algorithm based on grids is used to find a geometric path quickly and several path points are selected as subgoals for the next level. In the second level, an approximate policy iteration algorithm called least-squares policy iteration (LSPI) is used to learn a near-optimal local planning policy that can generate smooth trajectories under kinematic constraints of the robot. Using this near-optimal local planning policy, the mobile robot can find an optimized path by sequentially approaching the subgoals obtained in the first level. One advantage of the proposed approach is that the kinematic characteristics of the mobile robot can be incorporated into the LSPI-based path optimization procedure. The second advantage is that the LSPI-based local path optimizer uses an approximate policy iteration algorithm which has been proven to be data-efficient and stable. The training of the local path optimizer can use sample experiences collected randomly from any reasonable sampling distribution. Furthermore, the LSPI-based local path optimizer has the ability of dealing with uncertainties in the environment. For unknown obstacles, it just needs to replan the path in the second level rather than the whole planner. Simulations for path planning in various types of environments have been carried out and the results demonstrate the effectiveness of the proposed approach. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:257 / 266
页数:10
相关论文
共 50 条
  • [1] Least-squares policy iteration
    Lagoudakis, MG
    Parr, R
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (06) : 1107 - 1149
  • [2] Experience replay for least-squares policy iteration
    Liu, Quan (quanliu@suda.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc. (01): : 274 - 281
  • [3] Finite-Sample Analysis of Least-Squares Policy Iteration
    Lazaric, Alessandro
    Ghavamzadeh, Mohammad
    Munos, Remi
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 3041 - 3074
  • [4] Online least-squares policy iteration for reinforcement learning control
    Busoniu, Lucian
    Ernst, Damien
    De Schutter, Bart
    Babuska, Robert
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 486 - 491
  • [5] Potential-Based Least-Squares Policy Iteration for a Parameterized Feedback Control System
    Cheng, Kang
    Zhang, Kanjian
    Fei, Shumin
    Wei, Haikun
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 169 (02) : 692 - 704
  • [6] Hierarchical Path Planning for Mobile Robots Based on Hybrid Map
    Wu X.
    Yang J.
    Tang K.
    Zhai J.
    Lou P.
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2023, 34 (05): : 563 - 575
  • [7] Least-squares policy iteration algorithms for robotics: Online, continuous, and automatic
    Friedrich, Stefan R.
    Schreibauer, Michael
    Buss, Martin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 83 : 72 - 84
  • [8] A calibration method for odometry of mobile robots based on the least-squares technique: Theory and experimental validation
    Antonelli, G
    Chiaverini, S
    Fusco, G
    IEEE TRANSACTIONS ON ROBOTICS, 2005, 21 (05) : 994 - 1004
  • [9] A hierarchical reinforcement learning approach for optimal path tracking of wheeled mobile robots
    Zuo, Lei
    Xu, Xin
    Liu, Chunming
    Huang, Zhenhua
    NEURAL COMPUTING & APPLICATIONS, 2013, 23 (7-8) : 1873 - 1883
  • [10] Adaptive Kernel-Width Selection for Kernel-Based Least-Squares Policy Iteration Algorithm
    Wu, Jun
    Xu, Xin
    Zuo, Lei
    Li, Zhaobin
    Wang, Jian
    ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT II, 2011, 6676 : 611 - 619