Learning to steer on winding tracks using semi-parametric control policies

被引:0
|
作者
Alton, K [1 ]
de Panne, MV [1 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada
来源
2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4 | 2005年
关键词
nonholonomic systems; reinforcement learning; policy search; hybrid control; vehicle steering;
D O I
暂无
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We present a semi-parametric control policy representation and use it to solve a series or nonholonornic control problems with input state spaces or up to 7 dimensions. A nearest-neighbor control policy is represented by a set of nodes that induce a Voronoi partitioning of the input space. The Voronoi cells then define local control actions. Direct policy search is applied to optimize the node locations and actions. The selective addition of nodes allows for progressive refinement of the control representation. We demonstrate this approach on the challenging problem of learning to steer cars and trucks-with-trailers around winding tracks with sharp corners. We consider the steering of both forwards and backwards-moving vehicles with only local sensory information. The steering behaviors for these nonholonomic systems are shown to generalize well to tracks not seen in training.
引用
收藏
页码:4588 / 4593
页数:6
相关论文
共 6 条
  • [1] Reinforcement Learning Method Based on Semi-parametric Regression Model
    Cheng, Yuhu
    Wang, Xuesong
    Tian, Xilan
    2010 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-5, 2010, : 11 - 15
  • [2] Reinforcement Learning-Based and Parametric Production-Maintenance Control Policies for a Deteriorating Manufacturing System
    Xanthopoulos, A. S.
    Kiatipis, Athanasios
    Koulouriotis, D. E.
    Stieger, Sepp
    IEEE ACCESS, 2018, 6 : 576 - 588
  • [3] Learning First-to-Spike Policies for Neuromorphic Control Using Policy Gradients
    Rosenfeld, Bleema
    Simeone, Osvaldo
    Rajendran, Bipin
    2019 IEEE 20TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC 2019), 2019,
  • [4] Training Drift Counteraction Optimal Control Policies Using Reinforcement Learning: An Adaptive Cruise Control Example
    Li, Zhaojian
    Chu, Tianshu
    Kolmanovsky, Ilya, V
    Yin, Xiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (09) : 2903 - 2912
  • [5] Enhanced Hydraulic Excavator Control via Semi-automatic Grading Control Using Reinforcement Learning
    Kim, Youngbum
    Kim, Jinwhan
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2025, 23 (03) : 896 - 906
  • [6] Development of a Control Algorithm for a Semi-Active Mid-Story Isolation System Using Reinforcement Learning
    Kim, Hyun-Su
    Kim, Uksun
    APPLIED SCIENCES-BASEL, 2023, 13 (04):