Learning to steer on winding tracks using semi-parametric control policies

被引：0

作者：

Alton, K ^{[1
]}

de Panne, MV ^{[1
]}

机构：

[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4 | 2005年

关键词：

nonholonomic systems; reinforcement learning; policy search; hybrid control; vehicle steering;

D O I：

暂无

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

We present a semi-parametric control policy representation and use it to solve a series or nonholonornic control problems with input state spaces or up to 7 dimensions. A nearest-neighbor control policy is represented by a set of nodes that induce a Voronoi partitioning of the input space. The Voronoi cells then define local control actions. Direct policy search is applied to optimize the node locations and actions. The selective addition of nodes allows for progressive refinement of the control representation. We demonstrate this approach on the challenging problem of learning to steer cars and trucks-with-trailers around winding tracks with sharp corners. We consider the steering of both forwards and backwards-moving vehicles with only local sensory information. The steering behaviors for these nonholonomic systems are shown to generalize well to tracks not seen in training.

引用

页码：4588 / 4593

页数：6

共 6 条

[1] Reinforcement Learning Method Based on Semi-parametric Regression Model
Cheng, Yuhu
Wang, Xuesong
Tian, Xilan
2010 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-5, 2010, : 11 - 15
[2] Reinforcement Learning-Based and Parametric Production-Maintenance Control Policies for a Deteriorating Manufacturing System
Xanthopoulos, A. S.
Kiatipis, Athanasios
Koulouriotis, D. E.
Stieger, Sepp
IEEE ACCESS, 2018, 6 : 576 - 588
[3] Learning First-to-Spike Policies for Neuromorphic Control Using Policy Gradients
Rosenfeld, Bleema
Simeone, Osvaldo
Rajendran, Bipin
2019 IEEE 20TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC 2019), 2019,
[4] Training Drift Counteraction Optimal Control Policies Using Reinforcement Learning: An Adaptive Cruise Control Example
Li, Zhaojian
Chu, Tianshu
Kolmanovsky, Ilya, V
Yin, Xiang
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (09) : 2903 - 2912
[5] Enhanced Hydraulic Excavator Control via Semi-automatic Grading Control Using Reinforcement Learning
Kim, Youngbum
Kim, Jinwhan
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2025, 23 (03) : 896 - 906
[6] Development of a Control Algorithm for a Semi-Active Mid-Story Isolation System Using Reinforcement Learning
Kim, Hyun-Su
Kim, Uksun
APPLIED SCIENCES-BASEL, 2023, 13 (04):

← 1 →