Reinforcement-Learning-Based Cooperative Adaptive Cruise Control of Buses in the Lincoln Tunnel Corridor with Time-Varying Topology

被引:54
作者
Gao, Weinan [1 ]
Gao, Jingqin [2 ]
Ozbay, Kaan [2 ,3 ]
Jiang, Zhong-Ping [4 ]
机构
[1] Georgia Southern Univ, Allen E Paulson Coll Engn & Comp, Dept Elect & Comp Engn, Statesboro, GA 30460 USA
[2] NYU, Tandon Sch Engn, Dept Civil & Urban Engn, Brooklyn, NY 11201 USA
[3] NYU, Tandon Sch Engn, C2SMART Tier 1 Univ Transportat Ctr, Brooklyn, NY 11201 USA
[4] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Control & Networks Lab, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Reinforcement learning; connected and autonomous vehicles; cooperative adaptive cruise control; time-varying topology; SYSTEMS;
D O I
10.1109/TITS.2019.2895285
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The exclusive bus lane (XBL) is one of the most popular bus transit systems in the U.S. The Lincoln Tunnel utilizes an XBL through the tunnel in the AM peak period. This paper proposes a novel data-driven cooperative adaptive cruise control (CACC) algorithm that aims to minimize a cost function for connected and autonomous buses along the XBL. Different from existing model-based CACC algorithms, the proposed approach employs the idea of reinforcement learning, which does not rely on accurate knowledge of bus dynamics. Considering a time-varying topology, where each autonomous vehicle can only receive information from preceding vehicles that are within its communication range, a distributed controller is learned real-time by online headway, velocity, and acceleration data collected from the system trajectories. The convergence of the proposed algorithm and the stability of the closed-loop system are rigorously analyzed. The effectiveness of the proposed approach is demonstrated using a well-calibrated Paramics microscopic traffic simulation model of the XBL corridor. The simulation results show that the travel time in the autonomous version of the XBL are close to the present day travel time even when the bus volume is increased by 30%.
引用
收藏
页码:3796 / 3805
页数:10
相关论文
共 50 条
  • [21] Reinforcement learning based time-varying formation control for quadrotor unmanned aerial vehicles system with input saturation
    Chi Ma
    Yizhe Cao
    Dianbiao Dong
    Applied Intelligence, 2023, 53 : 28730 - 28744
  • [22] MAS-Based Distributed Cooperative Control for DC Microgrid Through Switching Topology Communication Network With Time-Varying Delays
    Dou, Chunxia
    Yue, Dong
    Zhang, Zhanqiang
    Ma, Kai
    IEEE SYSTEMS JOURNAL, 2019, 13 (01): : 615 - 624
  • [23] Online Reinforcement-Learning-Based Adaptive Terminal Sliding Mode Control for Disturbed Bicycle Robots on a Curved Pavement
    Zhu, Xianjin
    Deng, Yang
    Zheng, Xudong
    Zheng, Qingyuan
    Liang, Bin
    Liu, Yu
    ELECTRONICS, 2022, 11 (21)
  • [24] Cooperative output feedback tracking control for multi-agent consensus with time-varying delays and switching topology
    Jiang, Yulian
    Liu, Jianchang
    Wang, Shenquan
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2015, 37 (04) : 550 - 559
  • [25] Platooning Cooperative Adaptive Cruise Control for Dynamic Performance and Energy Saving: A Comparative Study of Linear Quadratic and Reinforcement Learning-Based Controllers
    Borneo, Angelo
    Zerbato, Luca
    Miretti, Federico
    Tota, Antonio
    Galvagno, Enrico
    Misul, Daniela Anna
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [26] Deep Reinforcement Learning for Control of Time-Varying Musculoskeletal Systems With High Fatigability: A Feasibility Study
    Abreu, Jessica
    Crowder, Douglas C.
    Kirsch, Robert F.
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2022, 30 : 2613 - 2622
  • [27] Flight Testing Reinforcement-Learning-Based Online Adaptive Flight Control Laws on CS-25-Class Aircraft
    Konatala, Ramesh
    Milz, Daniel
    Weiser, Christian
    Looye, Gertjan
    van Kampen, E.
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2024, 47 (11) : 2460 - 2467
  • [28] Model-Based Safe Reinforcement Learning With Time-Varying Constraints: Applications to Intelligent Vehicles
    Zhang, Xinglong
    Peng, Yaoqian
    Luo, Biao
    Pan, Wei
    Xu, Xin
    Xie, Haibin
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (10) : 12744 - 12753
  • [29] Reinforcement Learning-Based Underwater Acoustic Channel Tracking for Correlated Time-Varying Channels
    Wang, Yuhang
    Li, Wei
    Huang, Qihang
    OCEANS 2021: SAN DIEGO - PORTO, 2021,
  • [30] Adaptive learning control synchronization for unknown time-varying complex dynamical networks with prescribed performance
    Fan, Aili
    Li, Junmin
    SOFT COMPUTING, 2021, 25 (07) : 5093 - 5103