Traffic Signal Control Method Based on Modified Proximal Policy Optimization

被引:3
作者
An, Yaohui [1 ]
Zhang, Jing [1 ]
机构
[1] Tiangong Univ, Tianjin, Peoples R China
来源
2022 10TH INTERNATIONAL CONFERENCE ON TRAFFIC AND LOGISTIC ENGINEERING (ICTLE 2022) | 2022年
关键词
intelligent transportation; traffic signal control; proximal policy optimization; intersection; importance sampling;
D O I
10.1109/ICTLE55577.2022.9901894
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
For the traffic congestion problem at intersections, this paper proposed a modified proximal policy optimization algorithm, which can adaptively regulate traffic signals to alleviate intersection congestion. The algorithm made further theoretical corrections to the unbiasedness of the advantage function in it through the importance sampling method. The algorithm was also modified to better adapt to traffic signal control and thus increase the sensitivity of the algorithm to control actions. In order to fully extract road state information and reduce the complexity of the state space, the experiment used images as state inputs, which are real-time snapshots of road conditions at intersections. To verify the performance of the proposed algorithm, it was compared with other deep reinforcement learning algorithms in the traffic simulation software SUMO. The results show that the cumulative reward of the model constructed by the proposed algorithm is increased by 68.3% and the average waiting time is reduced by 65.3% compared to the baseline DQN model. Experimental results demonstrate that the algorithm can effectively regulate traffic congestion.
引用
收藏
页码:83 / 88
页数:6
相关论文
共 16 条
  • [1] [Anonymous], 2015, 2015 12 INT C EXPO E
  • [2] Behrisch M., 2011, Sumo-simulation of urban mobility: An overview, V2011
  • [3] Casas N, 2017, Arxiv, DOI [arXiv:1703.09035, DOI 10.48550/ARXIV.1703.09035]
  • [4] An Introduction to Deep Reinforcement Learning
    Francois-Lavet, Vincent
    Henderson, Peter
    Islam, Riashat
    Bellemare, Marc G.
    Pineau, Joelle
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2018, 11 (3-4): : 219 - 354
  • [5] Modeling and optimizing traffic light settings in road networks
    Goettlich, Simone
    Herty, Michael
    Ziegler, Ute
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2015, 55 : 36 - 51
  • [6] Traffic signal timing via deep reinforcement learning
    Li L.
    Lv Y.
    Wang F.-Y.
    [J]. Li, Li (li-li@tsinghua.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc. (03): : 247 - 254
  • [7] Mnih V, 2013, Arxiv, DOI [arXiv:1312.5602, 10.48550/ARXIV.1312.5602]
  • [8] Threshold Tuning Using Stochastic Optimization for Graded Signal Control
    Prashanth, L. A.
    Bhatnagar, Shalabh
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2012, 61 (09) : 3865 - 3880
  • [9] Schulman J., 2015, arXiv, DOI [arXiv:1502.05477, DOI 10.48550/ARXIV.1502.05477]
  • [10] Schulman J, 2017, Arxiv, DOI arXiv:1707.06347