The impact of deep reinforcement learning-based traffic signal control on Emission reduction in urban Road networks empowered by cooperative vehicle-infrastructure systems

被引:0
作者
Shang, Wen-Long [1 ,2 ,3 ]
Song, Xuewang [1 ]
Xiang, Qiannian [1 ]
Chen, Haibo [2 ]
Elhajj, Mireille [3 ]
Bi, Huibo [1 ]
Wang, Kun [4 ]
Ochieng, Washington [3 ]
机构
[1] Beijing Univ Technol, Coll Metropolitan Transportat, Beijing, Peoples R China
[2] Univ Leeds, Inst Transport Studies, Leeds, England
[3] Imperial Coll London, Ctr Transport Studies, Dept Civil & Environm Engn, London, England
[4] Hong Kong Polytech Univ, Dept Ind & Syst Engn, Hong Kong, Peoples R China
基金
北京市自然科学基金;
关键词
Deep reinforcement learning; Day-to-day; Cooperative vehicle infrastructure system; Urban road network; Signal control; Emission reduction; DYNAMIC USER EQUILIBRIUM; HIGH-RESOLUTION; TRANSPORTATION; ASSIGNMENT; MODEL; INFORMATION; ALGORITHM; POLICIES; WAVES;
D O I
10.1016/j.apenergy.2025.125884
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The critical challenges of climate change have made carbon emission reduction an urgent global priority, with urban transportation systems (UTS) being significant contributors. Rapid urbanization and increased traffic demand have intensified congestion and carbon emissions. However, existing traffic signal control methods often rely on large historical data and pre-set signal timings, making it struggle to adapt dynamically to reduce emissions effectively. This study proposes a novel traffic signal control method based on Deep Reinforcement Learning (DRL), integrated with Cooperative Vehicle-Infrastructure Systems (CVIS) and a doubly Day-to-Day (DTD) Dynamic Traffic Assignment model, aimed at improving traffic efficiency and reducing carbon emissions. The DTD model simulates the urban road network as the training environment, while CVIS provides realtime traffic flow data that informs the DRL model's optimization of signal timings. Furthermore, the DTD model adjusts traveler departure times and route choices for significant emission reductions. The proposed DRL-based method significantly improves signal control efficiency and carbon emission reduction under complex traffic conditions. A case study on the Sioux Falls network indicates that the DRL-based traffic signal strategy outperforms traditional fixed-time control strategies, achieving CO2 emission reductions of 21 % to 27 % in various scenarios, particularly excelling in the S-V scenario. Notably, emissions on high-traffic Link 28 (S-V) were reduced by up to 54.9 %. This study underscores the potential of DRL in low-carbon traffic management and provides practical insights for the sustainable development of future traffic systems, offering robust solutions for emission reduction and efficient traffic management in UTS.
引用
收藏
页数:19
相关论文
共 107 条
[41]   A Dueling Deep Q-Network method for low-carbon traffic signal control [J].
Kang, Leilei ;
Huang, Hao ;
Lu, Weike ;
Liu, Lan .
APPLIED SOFT COMPUTING, 2023, 141
[42]   Adaptive Traffic Light Control With Deep Reinforcement Learning: An Evaluation of Traffic Flow and Energy Consumption [J].
Koch, Lucas ;
Brinkmann, Tobias ;
Wegener, Marius ;
Badalian, Kevin ;
Andert, Jakob .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) :15066-15076
[43]  
Kouridis C, 2009, INFORM TECHNOLOGIES
[44]   Simulation of rerouting phenomena in Dynamic Traffic Assignment with the Information Comply Model [J].
Kucharski, Rafal ;
Gentile, Guido .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2019, 126 :414-441
[45]   A day-to-day dynamical model for the evolution of path flows under disequilibrium of traffic networks with fixed demand [J].
Kumar, Amit ;
Peeta, Srinivas .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2015, 80 :235-256
[46]   Accounting for traffic speed dynamics when calculating COPERT and PHEM pollutant emissions at the urban scale [J].
Lejri, Delphine ;
Can, Arnaud ;
Schiper, Nicole ;
Leclercq, Ludovic .
TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2018, 63 :588-603
[47]   Development and experiment of an intelligent connected cooperative vehicle infrastructure system based on multiple V2I modes and BWM-IGR method [J].
Li, Chunjie ;
Xu, Chengcheng ;
Chen, Yusen ;
Li, Zhibin .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2024, 635
[48]   ON KINEMATIC WAVES .2. A THEORY OF TRAFFIC FLOW ON LONG CROWDED ROADS [J].
LIGHTHILL, MJ ;
WHITHAM, GB .
PROCEEDINGS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL AND PHYSICAL SCIENCES, 1955, 229 (1178) :317-345
[49]   Deep reinforcement learning for transit signal priority in a connected environment [J].
Long, Meng ;
Zou, Xiexin ;
Zhou, Yue ;
Chung, Edward .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 142
[50]   Spatial-temporal evolution analysis of the impact of smart transportation policies on urban carbon emissions [J].
Lu, Hao ;
Xiao, Chengyou ;
Jiao, Liudan ;
Du, Xiaosen ;
Huang, Aqiong .
SUSTAINABLE CITIES AND SOCIETY, 2024, 101