Traffic Signal Control Using Deep Reinforcement Learning with Multiple Resources of Rewards

被引：6

作者：

Zhong, Dunhao ^{[1
]}

Boukerche, Azzedine ^{[1
]}

机构：

[1] Univ Ottawa, EECS, PARADISE Res Lab, Ottawa, ON, Canada

来源：

PE-WASUN'19: PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYMPOSIUM ON PERFORMANCE EVALUATION OF WIRELESS AD HOC, SENSOR, & UBIQUITOUS NETWORKS | 2019年

基金：

加拿大自然科学与工程研究理事会;

关键词：

intelligent traffic signal control; deep reinforcement learning; multiple rewards;

D O I：

10.1145/3345860.3361522

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Intelligent traffic signal control is an effective way to solve the traffic congestion problem in the real world. One trend is to use Deep Reinforcement Learning (DRL) to control traffic signals based on the snapshots of traffic states. While most of the research used single numeric reward to frame multiple objectives, such as minimizing waiting time and waiting queue length, they overlooked that one reward for multiple objectives misleads agents taking wrong actions in certain states, which causes following traffic fluctuation. In this paper, we propose a DRL-based framework that uses multiple rewards for multiple objectives. Our framework aims to solve the difficulty of assessing behaviours by single numeric reward and control traffic flows more steadily. We evaluated our approach on both synthetic traffic scenarios and a real-world traffic dataset in Toronto. The results show that our approach outperformed single reward-based approaches.

引用

页码：23 / 28

页数：6

共 19 条

[1] Reinforcement learning-based multi-agent system for network traffic signal control
Arel, I.
Liu, C.
Urbanik, T.
Kohls, A. G.
[J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) : 128 - 135
[2] Distributed learning and multi-objectivity in traffic light control
Brys, Tim
Pham, Tong T.
Taylor, Matthew E.
[J]. CONNECTION SCIENCE, 2014, 26 (01) : 65 - 83
[3] Cookson G., 2018, INRIX GLOBAL TRAFFIC
[4] Multiobjective Reinforcement Learning for Traffic Signal Control Using Vehicular Ad Hoc Network
Duan Houli
Li Zhiheng
Zhang Yi
[J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
[5] El-Tantawy S., 2010, 2010 13th International IEEE Conference on Intelligent Transportation Systems (ITSC 2010), P665, DOI 10.1109/ITSC.2010.5625066
[6] Fukushima K., 1979, Transactions of the Institute of Electronics and Communication Engineers of Japan, Section E (English), VE62, P675
[7] Genders W., 2016, USING DEEP REINFORCE
[8] Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework
Khamis, Mohamed A.
Gomaa, Walid
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 29 : 134 - 151
[9] Khamis MA, 2012, IEEE INT C INTELL TR, P995, DOI 10.1109/ITSC.2012.6338853
[10] Liang X., 2018, ARXIV180311115

← 1 2 →