Traffic signal optimization control method based on adaptive weighted averaged double deep Q network

被引:5
作者
Chen, Youqing [1 ]
Zhang, Huizhen [1 ]
Liu, Minglei [1 ]
Ye, Ming [1 ]
Xie, Hui [1 ]
Pan, Yubiao [1 ]
机构
[1] Huaqiao Univ, Coll Comp Sci & Technol, Xiamen 361024, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Deep learning; Double deep Q network; Intelligent transportation; Traffic signal control;
D O I
10.1007/s10489-023-04469-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a critical node and major bottleneck of the urban traffic networks, the control of traffic signals at road intersections has an essential impact on road traffic flow and congestion. Deep reinforcement learning algorithms have shown excellent control effects on traffic signal timing optimization. Still, the diversity of actual road control scenarios and real-time control requirements have put forward higher requirements on the adaptiveness of the algorithms. This paper proposes an Adaptive Weighted Averaged Double Deep Q Network (AWA-DDQN) based traffic signal optimal control method. Firstly, the formula is used to calculate the double estimator weight for updating the network model. Then, the mean value of the action evaluation is calculated by the network history parameters as the target value. Based on this, a certain number of adjacent action evaluation values are used to generate hyperparameters for weight calculation through the fully connected layer, and the number of action values for mean calculation is gradually reduced to enhance the stability of model training. Finally, simulation experiments were conducted using the traffic simulation software Vissim. The results show that the AWA-DDQN-based signal control method effectively reduces the average delay time, the average queue length and the average number of stops of vehicles compared with existing methods, and significantly improves traffic flow efficiency at intersections.
引用
收藏
页码:18333 / 18354
页数:22
相关论文
共 44 条
  • [41] Zhang ZZ, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3455
  • [42] IPDALight: Intensity- and phase duration-aware traffic signal control based on Reinforcement Learning
    Zhao, Wupan
    Ye, Yutong
    Ding, Jiepin
    Wang, Ting
    Wei, Tongquan
    Chen, Mingsong
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 123
  • [43] Efficient Multiagent Policy Optimization Based on Weighted Estimators in Stochastic Cooperative Environments
    Zheng, Yan
    Hao, Jian-Ye
    Zhang, Zong-Zhang
    Meng, Zhao-Peng
    Hao, Xiao-Tian
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (02) : 268 - 280
  • [44] Context-Aware Multiagent Broad Reinforcement Learning for Mixed Pedestrian-Vehicle Adaptive Traffic Light Control
    Zhu, Ruijie
    Wu, Shuning
    Li, Lulu
    Lv, Ping
    Xu, Mingliang
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (20): : 19694 - 19705