Predictive cruise control of connected and autonomous vehicles via reinforcement learning

被引：19

作者：

Gao, Weinan ^{[1
]}

Odekunle, Adedapo ^{[1
]}

Chen, Yunfeng ^{[2
]}

Jiang, Zhong-Ping ^{[3
]}

机构：

[1] Georgia Southern Univ, Allen E Paulson Coll Engn & Comp, Dept Elect & Comp Engn, Statesboro, GA 30460 USA

[2] Purdue Univ, Purdue Polytech Inst, Dept Construct Management Technol, W Lafayette, IN 47907 USA

[3] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Brooklyn, NY 11201 USA

来源：

IET CONTROL THEORY AND APPLICATIONS | 2019年 / 13卷 / 17期

关键词：

road vehicles; optimal control; adaptive control; learning (artificial intelligence); predictive control; velocity control; distributed control; road traffic control; mobile robots; acceleration control; reinforcement learning; autonomous vehicle; data-driven adaptive optimal control algorithm; vehicle safety; passenger comfort; predictive cruise control; connected vehicles; distributed optimal controllers; suboptimal control; velocity regulation; headway regulation; acceleration regulation; TIME MULTIAGENT SYSTEMS; OUTPUT REGULATION; CONSENSUS;

D O I：

10.1049/iet-cta.2018.6031

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Predictive cruise control concerns designing controllers for autonomous vehicles using the broadcasted information from the traffic lights such that the idle time around the intersection can be reduced. This study proposes a novel adaptive optimal control approach based on reinforcement learning to solve the predictive cruise control problem of a platoon of connected and autonomous vehicles. First, the reference velocity is determined for each autonomous vehicle in the platoon. Second, a data-driven adaptive optimal control algorithm is developed to estimate the gains of the desired distributed optimal controllers without the exact knowledge of system dynamics. The obtained controller is able to regulate the headway, velocity, and acceleration of each vehicle in a suboptimal sense. The goal of trip time reduction is achieved without compromising vehicle safety and passenger comfort. Numerical simulations are presented to validate the efficacy of the proposed methodology.

引用

页码：2849 / 2855

页数：7

共 32 条

[11] Sampled-Data Cooperative Adaptive Cruise Control of Vehicles With Sensor Failures [J].

Guo, Ge ;

Yue, Wei .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (06) :2404-2418

[12]

Huang G. B., 2007, Tech. Rep.

[13]

Jiang Y., 2017, IEEE T IND INF, V14, P1974

[14]

Jiang Y., 2017, Robust adaptive dynamic programming

[15]

Kavurucu Yusuf, 2017, ELECT ELECT COMPUTER, P1

[16] ON AN ITERATIVE TECHNIQUE FOR RICCATI EQUATION COMPUTATIONS [J].

KLEINMAN, DL .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1968, AC13 (01) :114-+

[17] Development and Evaluation of a Cooperative Vehicle Intersection Control Algorithm Under the Connected Vehicles Environment [J].

Lee, Joyoung ;

Park, Byungkyu .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2012, 13 (01) :81-90

[18] Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data [J].

Lewis, F. L. ;

Vamvoudakis, Kyriakos G. .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (01) :14-25

[19] Model Predictive Multi-Objective Vehicular Adaptive Cruise Control [J].

Li, Shengbo ;

Li, Keqiang ;

Rajamani, Rajesh ;

Wang, Jianqiang .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2011, 19 (03) :556-566

[20] Coordinated output regulation of heterogeneous linear systems under switching topologies [J].

Meng, Ziyang ;

Yang, Tao ;

Dimarogonas, Dimos V. ;

Johansson, Karl Henrik .

AUTOMATICA, 2015, 53 :362-368

← 1 2 3 4 →