Image-based traffic signal control via world models

被引：31

作者：

Dai, Xingyuan ^{[1
,2
]}

Zhao, Chen ^{[1
,2
]}

Wang, Xiao ^{[3
]}

Lv, Yisheng ^{[1
,2
]}

Lin, Yilun ^{[4
]}

Wang, Fei-Yue ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

[3] Anhui Univ, Sch Artificial Intelligence, Hefei 230039, Peoples R China

[4] Shanghai AI Lab, Shanghai 200232, Peoples R China

来源：

FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING | 2022年 / 23卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Traffic signal control; Traffic prediction; Traffic world model; Reinforcement learning; U491; TP181; PARALLEL CONTROL; NETWORK;

D O I：

10.1631/FITEE.2200323

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Traffic signal control is shifting from passive control to proactive control, which enables the controller to direct current traffic flow to reach its expected destinations. To this end, an effective prediction model is needed for signal controllers. What to predict, how to predict, and how to leverage the prediction for control policy optimization are critical problems for proactive traffic signal control. In this paper, we use an image that contains vehicle positions to describe intersection traffic states. Then, inspired by a model-based reinforcement learning method, DreamerV2, we introduce a novel learning-based traffic world model. The traffic world model that describes traffic dynamics in image form is used as an abstract alternative to the traffic environment to generate multi-step planning data for control policy optimization. In the execution phase, the optimized traffic controller directly outputs actions in real time based on abstract representations of traffic states, and the world model can also predict the impact of different control behaviors on future traffic conditions. Experimental results indicate that the traffic world model enables the optimized real-time control policy to outperform common baselines, and the model achieves accurate image-based prediction, showing promising applications in futuristic traffic signal control.

引用

页码：1795 / 1813

页数：19

共 39 条

[1] Hierarchical traffic signal optimization using reinforcement learning and traffic prediction with long-short term memory [J].

Abdoos, Monireh ;

Bazzan, Ana L. C. .

EXPERT SYSTEMS WITH APPLICATIONS, 2021, 171

[2]

[Anonymous], 2000, P INT C MACH LEARN

[3]

[Anonymous], 2015, NATURE

[4] Multiagent Reinforcement Learning: Rollout and Policy Iteration [J].

Bertsekas, Dimitri .

IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2021, 8 (02) :249-272

[5] DeepTrend 2.0: A light-weighted multi-scale traffic prediction model using detrending [J].

Dai, Xingyuan ;

Fu, Rui ;

Zhao, Enmin ;

Zhang, Zuo ;

Lin, Yilun ;

Wang, Fei-Yue ;

Li, Li .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2019, 103 :142-157

[6] Urban traffic signal control with connected and automated vehicles: A survey [J].

Guo, Qiangqiang ;

Li, Li ;

Ban, Xuegang .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2019, 101 :313-334

[7]

Hafner D, 2019, PR MACH LEARN RES, V97

[8]

Hafner D, 2022, Arxiv, DOI arXiv:2010.02193

[9] Model based urban traffic control, part I: Local model and local model predictive controllers [J].

Hao, Zhenzhen ;

Boel, Rene ;

Li, Zhiwu .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 97 :61-81

[10] An End-to-End Recommendation System for Urban Traffic Controls and Management Under a Parallel Learning Framework [J].

Jin, Junchen ;

Guo, Haifeng ;

Xu, Jia ;

Wang, Xiao ;

Wang, Fei-Yue .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) :1616-1626

← 1 2 3 4 →