Image-based traffic signal control via world models

被引:29
作者
Dai, Xingyuan [1 ,2 ]
Zhao, Chen [1 ,2 ]
Wang, Xiao [3 ]
Lv, Yisheng [1 ,2 ]
Lin, Yilun [4 ]
Wang, Fei-Yue [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Anhui Univ, Sch Artificial Intelligence, Hefei 230039, Peoples R China
[4] Shanghai AI Lab, Shanghai 200232, Peoples R China
基金
中国国家自然科学基金;
关键词
Traffic signal control; Traffic prediction; Traffic world model; Reinforcement learning; U491; TP181; PARALLEL CONTROL; NETWORK;
D O I
10.1631/FITEE.2200323
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traffic signal control is shifting from passive control to proactive control, which enables the controller to direct current traffic flow to reach its expected destinations. To this end, an effective prediction model is needed for signal controllers. What to predict, how to predict, and how to leverage the prediction for control policy optimization are critical problems for proactive traffic signal control. In this paper, we use an image that contains vehicle positions to describe intersection traffic states. Then, inspired by a model-based reinforcement learning method, DreamerV2, we introduce a novel learning-based traffic world model. The traffic world model that describes traffic dynamics in image form is used as an abstract alternative to the traffic environment to generate multi-step planning data for control policy optimization. In the execution phase, the optimized traffic controller directly outputs actions in real time based on abstract representations of traffic states, and the world model can also predict the impact of different control behaviors on future traffic conditions. Experimental results indicate that the traffic world model enables the optimized real-time control policy to outperform common baselines, and the model achieves accurate image-based prediction, showing promising applications in futuristic traffic signal control.
引用
收藏
页码:1795 / 1813
页数:19
相关论文
共 39 条
  • [1] Hierarchical traffic signal optimization using reinforcement learning and traffic prediction with long-short term memory
    Abdoos, Monireh
    Bazzan, Ana L. C.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 171
  • [2] [Anonymous], 2000, MACHINE LEARNING P 1, P1151
  • [3] [Anonymous], 2015, NATURE
  • [4] Multiagent Reinforcement Learning: Rollout and Policy Iteration
    Bertsekas, Dimitri
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2021, 8 (02) : 249 - 272
  • [5] DeepTrend 2.0: A light-weighted multi-scale traffic prediction model using detrending
    Dai, Xingyuan
    Fu, Rui
    Zhao, Enmin
    Zhang, Zuo
    Lin, Yilun
    Wang, Fei-Yue
    Li, Li
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2019, 103 : 142 - 157
  • [6] Urban traffic signal control with connected and automated vehicles: A survey
    Guo, Qiangqiang
    Li, Li
    Ban, Xuegang
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2019, 101 : 313 - 334
  • [7] Hafner D, 2019, PR MACH LEARN RES, V97
  • [8] Hafner D, 2022, Arxiv, DOI [arXiv:2010.02193, DOI 10.48550/ARXIV.2010.02193]
  • [9] Model based urban traffic control, part I: Local model and local model predictive controllers
    Hao, Zhenzhen
    Boel, Rene
    Li, Zhiwu
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 97 : 61 - 81
  • [10] An End-to-End Recommendation System for Urban Traffic Controls and Management Under a Parallel Learning Framework
    Jin, Junchen
    Guo, Haifeng
    Xu, Jia
    Wang, Xiao
    Wang, Fei-Yue
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) : 1616 - 1626