Combining Reinforcement Learning with Model Predictive Control for On-Ramp Merging

被引:27
|
作者
Lubars, Joseph [1 ,2 ]
Gupta, Harsh [1 ,2 ]
Chinchali, Sandeep [4 ]
Li, Liyun [3 ]
Raja, Adnan [3 ]
Srikant, R. [1 ,2 ]
Wu, Xinzhou [3 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Champaign, IL 61820 USA
[2] Univ Illinois, Coordinated Sci Lab, Champaign, IL 61820 USA
[3] Xmotorsai, Santa Clara, CA USA
[4] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
关键词
GAME; GO;
D O I
10.1109/ITSC48978.2021.9564954
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of designing an algorithm to allow a car to autonomously merge on to a highway from an on-ramp. Two broad classes of techniques have been proposed to solve motion planning problems in autonomous driving: Model Predictive Control (MPC) and Reinforcement Learning (RL). In this paper, we first establish the strengths and weaknesses of state-of-the-art MPC and RL-based techniques through simulations. We show that the performance of the RL agent is worse than that of the MPC solution from the perspective of safety and robustness to out-of-distribution traffic patterns, i.e., traffic patterns which were not seen by the RL agent during training. On the other hand, the performance of the RL agent is better than that of the MPC solution when it comes to efficiency and passenger comfort. We subsequently present an algorithm which blends the model-free RL agent with the MPC solution and show that it provides better tradeoffs between all metrics - passenger comfort, efficiency, crash rate and robustness.
引用
收藏
页码:942 / 947
页数:6
相关论文
共 50 条
  • [21] Safety benefit of cooperative control for heterogeneous traffic on-ramp merging
    Xiao Jing
    Xin Pei
    Song Yan
    Chunyang Han
    Selpi
    Eleonora Andreotti
    Jishiyu Ding
    Transportation Safety and Environment, 2022, (04) : 84 - 91
  • [22] Cooperative On-Ramp Merging Control Model for Mixed Traffic on Multi-Lane Freeways
    Hou, Kangning
    Zheng, Fangfang
    Liu, Xiaobo
    Guo, Ge
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (10) : 10774 - 10790
  • [23] Spatial-Dependent Robust Control Strategy for On-Ramp Merging
    Meng, Tianchuang
    Huang, Jin
    Hu, Ziniu
    Yang, Zeyu
    Chen, Ye-Hwa
    Yang, Diange
    Zhong, Zhihua
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (03) : 3191 - 3205
  • [24] Pseudospectral convex optimization for on-ramp merging control of connected vehicles
    Shi, Yang
    Wang, Zhenbo
    Wang, Chieh
    Shao, Yunli
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (15): : 10972 - 10999
  • [25] Double-Layer Optimal Control Method for On-Ramp Merging
    Xia, Sen
    Yan, Yanbin
    Fang, Yukun
    An, Ran
    Wu, Xia
    Min, Haigen
    Xu, Zhigang
    CICTP 2023: INNOVATION-EMPOWERED TECHNOLOGY FOR SUSTAINABLE, INTELLIGENT, DECARBONIZED, AND CONNECTED TRANSPORTATION, 2023, : 2230 - 2241
  • [26] Reinforcement Learning with Probabilistically Safe Control Barrier Functions for Ramp Merging
    Udatha, Soumith
    Lyu, Yiwei
    Dolan, John
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5625 - 5630
  • [27] Learning When to Drive in Intersections by Combining Reinforcement Learning and Model Predictive Control
    Tram, Tommy
    Batkovic, Ivo
    Ali, Mohammad
    Sjoberg, Jonas
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3263 - 3268
  • [28] On-ramp control
    Huang, D
    TRAFFIC AND GRANULAR FLOW'01, 2003, : 351 - 356
  • [29] Deep reinforcement learning algorithm based ramp merging decision model
    Chen, Zeyu
    Du, Yu
    Jiang, Anni
    Miao, Siqi
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2025, 239 (01) : 70 - 84
  • [30] Deep reinforcement learning algorithm based ramp merging decision model
    Chen, Zeyu
    Du, Yu
    Jiang, Anni
    Miao, Siqi
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2025, 239 (01) : 70 - 84