Traffic signal optimization control method based on adaptive weighted averaged double deep Q network

被引：5

作者：

Chen, Youqing ^{[1
]}

Zhang, Huizhen ^{[1
]}

Liu, Minglei ^{[1
]}

Ye, Ming ^{[1
]}

Xie, Hui ^{[1
]}

Pan, Yubiao ^{[1
]}

机构：

[1] Huaqiao Univ, Coll Comp Sci & Technol, Xiamen 361024, Fujian, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 15期

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Deep learning; Double deep Q network; Intelligent transportation; Traffic signal control;

D O I：

10.1007/s10489-023-04469-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As a critical node and major bottleneck of the urban traffic networks, the control of traffic signals at road intersections has an essential impact on road traffic flow and congestion. Deep reinforcement learning algorithms have shown excellent control effects on traffic signal timing optimization. Still, the diversity of actual road control scenarios and real-time control requirements have put forward higher requirements on the adaptiveness of the algorithms. This paper proposes an Adaptive Weighted Averaged Double Deep Q Network (AWA-DDQN) based traffic signal optimal control method. Firstly, the formula is used to calculate the double estimator weight for updating the network model. Then, the mean value of the action evaluation is calculated by the network history parameters as the target value. Based on this, a certain number of adjacent action evaluation values are used to generate hyperparameters for weight calculation through the fully connected layer, and the number of action values for mean calculation is gradually reduced to enhance the stability of model training. Finally, simulation experiments were conducted using the traffic simulation software Vissim. The results show that the AWA-DDQN-based signal control method effectively reduces the average delay time, the average queue length and the average number of stops of vehicles compared with existing methods, and significantly improves traffic flow efficiency at intersections.

引用

页码：18333 / 18354

页数：22

共 44 条

[41] Zhang ZZ, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3455
[42] IPDALight: Intensity- and phase duration-aware traffic signal control based on Reinforcement Learning
Zhao, Wupan
Ye, Yutong
Ding, Jiepin
Wang, Ting
Wei, Tongquan
Chen, Mingsong
[J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 123
[43] Efficient Multiagent Policy Optimization Based on Weighted Estimators in Stochastic Cooperative Environments
Zheng, Yan
Hao, Jian-Ye
Zhang, Zong-Zhang
Meng, Zhao-Peng
Hao, Xiao-Tian
[J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (02) : 268 - 280
[44] Context-Aware Multiagent Broad Reinforcement Learning for Mixed Pedestrian-Vehicle Adaptive Traffic Light Control
Zhu, Ruijie
Wu, Shuning
Li, Lulu
Lv, Ping
Xu, Mingliang
[J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (20): : 19694 - 19705

← 1 2 3 4 5 →