An Online Evolving Framework for Advancing Reinforcement-Learning based Automated Vehicle Control

被引：4

作者：

Han, Teawon ^{[1
]}

Nageshrao, Subramanya ^{[2
]}

Filev, Dimitar P. ^{[3
]}

Ozguner, Umit ^{[1
]}

机构：

[1] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA

[2] Ford Greenfield Labs, Palo Alto, CA 94304 USA

[3] Ford Motor Co, Dearborn, MI 48121 USA

来源：

IFAC PAPERSONLINE | 2020年 / 53卷 / 02期

关键词：

evolving controller; reinforcement learning; automated vehicle; machine learning;

D O I：

10.1016/j.ifacol.2020.12.2283

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an online evolving framework is proposed to detect and revise a controller's imperfect decision-making in advance. The framework consists of three modules: the evolving Finite State Machine (e-FSM), action-reviser, and controller modules. The e-FSM module evolves a stochastic model (e.g., Discrete-Time Markov Chain) from scratch by determining new states and identifying transition probabilities repeatedly. With the latest stochastic model and given criteria, the action-reviser module checks validity of the controller's chosen action by predicting future states. Then, if the chosen action is not appropriate, another action is inspected and selected. In order to show the advantage of the proposed framework, the Deep Deterministic Policy Gradient (DDPG) w/ and w/o the online evolving framework are applied to control an ego-vehicle in the car-following scenario where control criteria are set by speed and safety. Experimental results show that inappropriate actions chosen by the DDPG controller are detected and revised appropriately through our proposed framework, resulting in no control failures after a few iterations. Copyright (C) 2020 The Authors.

引用

页码：8118 / 8123

页数：6

共 16 条

[1]

Al-Qizwini M, 2017, IEEE INT VEH SYM, P89, DOI 10.1109/IVS.2017.7995703

[2]

[Anonymous], 2015, ACS SYM SER

[3]

Bojarski Mariusz, 2016, arXiv

[4]

Filev DP, 2010, P AMER CONTR CONF, P4139

[5] A Framework for Estimating Driver Decisions Near Intersections [J].

Gadepally, Vijay ;

Krishnamurthy, Ashok ;

Oezguener, Uemit .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (02) :637-646

[6]

Han T., 2019, ARXIV PREPRINT ARXIV

[7]

Han T, 2019, IEEE INT VEH SYM, P957, DOI 10.1109/IVS.2019.8813987

[8]

Kurt A., 2008, P IFAC 2008 WORLD C, P9540, DOI DOI 10.3182/20080706-5-KR-1001.01613

[9]

Lillicrap Timothy P, 2016, P 4 INT C LEARNING R

[10]

Liu A, 2000, DRIVER BEH RECOGNITI

← 1 2 →