Reinforcement-Learning-Based Cooperative Adaptive Cruise Control of Buses in the Lincoln Tunnel Corridor with Time-Varying Topology

被引：58

作者：

Gao, Weinan ^{[1
]}

Gao, Jingqin ^{[2
]}

Ozbay, Kaan ^{[2
,3
]}

Jiang, Zhong-Ping ^{[4
]}

机构：

[1] Georgia Southern Univ, Allen E Paulson Coll Engn & Comp, Dept Elect & Comp Engn, Statesboro, GA 30460 USA

[2] NYU, Tandon Sch Engn, Dept Civil & Urban Engn, Brooklyn, NY 11201 USA

[3] NYU, Tandon Sch Engn, C2SMART Tier 1 Univ Transportat Ctr, Brooklyn, NY 11201 USA

[4] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Control & Networks Lab, Brooklyn, NY 11201 USA

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2019年 / 20卷 / 10期

基金：

美国国家科学基金会;

关键词：

Reinforcement learning; connected and autonomous vehicles; cooperative adaptive cruise control; time-varying topology; SYSTEMS;

D O I：

10.1109/TITS.2019.2895285

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

The exclusive bus lane (XBL) is one of the most popular bus transit systems in the U.S. The Lincoln Tunnel utilizes an XBL through the tunnel in the AM peak period. This paper proposes a novel data-driven cooperative adaptive cruise control (CACC) algorithm that aims to minimize a cost function for connected and autonomous buses along the XBL. Different from existing model-based CACC algorithms, the proposed approach employs the idea of reinforcement learning, which does not rely on accurate knowledge of bus dynamics. Considering a time-varying topology, where each autonomous vehicle can only receive information from preceding vehicles that are within its communication range, a distributed controller is learned real-time by online headway, velocity, and acceleration data collected from the system trajectories. The convergence of the proposed algorithm and the stability of the closed-loop system are rigorously analyzed. The effectiveness of the proposed approach is demonstrated using a well-calibrated Paramics microscopic traffic simulation model of the XBL corridor. The simulation results show that the travel time in the autonomous version of the XBL are close to the present day travel time even when the bus volume is increased by 30%.

引用

页码：3796 / 3805

页数：10

共 51 条

[1]

[Anonymous], 2005, TECH REP

[2]

[Anonymous], 2016, TECH REP

[3]

[Anonymous], 2009, TECH REP

[4]

Bertsekas D. P., 1996, Neuro-Dynamic Programming

[5] On Krause's Multi-Agent Consensus Model With State-Dependent Connectivity [J].

Blondel, Vincent D. ;

Hendrickx, Julien M. ;

Tsitsiklis, John N. .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2009, 54 (11) :2586-2597

[6] Shear Behavior of Rough Rock Joints Reinforced by Bolts [J].

Chen, Na ;

Zhang, Xiaobo ;

Jiang, Qinghui ;

Feng, Xixia ;

Wei, Wei ;

Yi, Bing .

INTERNATIONAL JOURNAL OF GEOMECHANICS, 2018, 18 (01)

[7]

Gao W., IET CONTROL THEORY A, DOI [10.1049/iet-cta/2018/6031, DOI 10.1049/IET-CTA/2018/6031]

[8] Learning-Based Adaptive Optimal Tracking Control of Strict-Feedback Nonlinear Systems [J].

Gao, Weinan ;

Jiang, Zhong-Ping .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) :2614-2624

[9] Data-Driven Adaptive Optimal Control of Connected Vehicles [J].

Gao, Weinan ;

Jiang, Zhong-Ping ;

Ozbay, Kaan .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (05) :1122-1133

[10] Nonlinear and Adaptive Suboptimal Control of Connected Vehicles: A Global Adaptive Dynamic Programming Approach [J].

Gao, Weinan ;

Jiang, Zhong-Ping .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 85 (3-4) :597-611

← 1 2 3 4 5 6 →