An adaptive traffic signal control scheme with Proximal Policy Optimization based on deep reinforcement learning for a single intersection

被引:1
|
作者
Wang, Lijuan [1 ,2 ]
Zhang, Guoshan [1 ]
Yang, Qiaoli [2 ]
Han, Tianyang [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Lanzhou Jiaotong Univ, Sch Automat & Elect Engn, Lanzhou 730070, Gansu, Peoples R China
[3] Univ Tokyo, Grad Sch Engn, Dept Civil Engn, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138656, Japan
基金
中国国家自然科学基金;
关键词
Traffic signal control; Proximal policy optimization; Deep reinforcement learning; SYSTEM;
D O I
10.1016/j.engappai.2025.110440
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive traffic signal control (ATSC) is an important means to alleviate traffic congestion and improve the quality of road traffic. Although deep reinforcement learning (DRL) technology has shown great potential in solving traffic signal control problems, the state representation and reward design, as well as action interval time, still need to be further studied. The advantages of policy learning have not been fully applied in TSC. To address the aforementioned issues, we propose a DRL-based traffic signal control scheme with Poximal Policy Optimization (PPO-TSC). We use the waiting time of vehicles and the queue length of lanes represented the spatiotemporal characteristics of traffic flow to design the simplified traffic states feature vectors, and define the reward function that is consistent with the state. Additionally, we compare and analyze the performance indexes obtained by various methods using action intervals of 5s, 10s, and 15s. The algorithm is implemented based on the Actor-Critic architecture, using the advantage estimation and the clip mechanism to constrain the range of gradient updates. We validate the proposed scheme at a single intersection in Simulation of Urban MObility (SUMO) under two different traffic demand patterns of flat traffic and peak traffic. The experimental results show that the proposed method is significantly better than other compared methods. Specifically, PPOTSC demonstrates a reduction of 24% in average travel time (ATT), a decrease of 45% in the average time loss (ATL), and an increase of 16% in average speed (AS) compared with the existing methods under peak traffic condition.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Backdoor attacks against deep reinforcement learning based traffic signal control systems
    Heng Zhang
    Jun Gu
    Zhikun Zhang
    Linkang Du
    Yongmin Zhang
    Yan Ren
    Jian Zhang
    Hongran Li
    Peer-to-Peer Networking and Applications, 2023, 16 : 466 - 474
  • [32] Backdoor attacks against deep reinforcement learning based traffic signal control systems
    Zhang, Heng
    Gu, Jun
    Zhang, Zhikun
    Du, Linkang
    Zhang, Yongmin
    Ren, Yan
    Zhang, Jian
    Li, Hongran
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2023, 16 (01) : 466 - 474
  • [33] Robust Deep Reinforcement Learning for Traffic Signal Control
    Kai Liang Tan
    Anuj Sharma
    Soumik Sarkar
    Journal of Big Data Analytics in Transportation, 2020, 2 (3): : 263 - 274
  • [34] Model-Based Deep Reinforcement Learning with Traffic Inference for Traffic Signal Control
    Wang, Hao
    Zhu, Jinan
    Gu, Bao
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [35] Deep Reinforcement Learning Based on Proximal Policy Optimization for the Maintenance of a Wind Farm with Multiple Crews
    Pinciroli, Luca
    Baraldi, Piero
    Ballabio, Guido
    Compare, Michele
    Zio, Enrico
    ENERGIES, 2021, 14 (20)
  • [36] Adaptive energy management strategy for FCHEV based on improved proximal policy optimization in deep reinforcement learning algorithm
    Lu, Xueqin
    Qian, Shenchen
    Zhai, Xinrui
    Wang, Peiyinquan
    Wu, Tao
    ENERGY CONVERSION AND MANAGEMENT, 2024, 321
  • [37] A proximal policy optimization based deep reinforcement learning framework for tracking control of a flexible robotic manipulator
    Kumar, V. Joshi
    Elumalai, Vinodh Kumar
    RESULTS IN ENGINEERING, 2025, 25
  • [38] Proximal policy optimization learning based control of congested freeway traffic
    Mo, Shurong
    Wu, Nailong
    Qi, Jie
    Pan, Anqi
    Feng, Zhiguang
    Yan, Huaicheng
    Wang, Yueying
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2024, 45 (02) : 719 - 736
  • [39] Adaptive traffic signal control system using composite reward architecture based deep reinforcement learning
    Jamil, Abu Rafe Md
    Ganguly, Kishan Kumar
    Nower, Naushin
    IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (14) : 2030 - 2041
  • [40] Traffic Signal Control with State-Optimizing Deep Reinforcement Learning and Fuzzy Logic
    Meepokgit, Teerapun
    Wisayataksin, Sumek
    APPLIED SCIENCES-BASEL, 2024, 14 (17):