Dual-Objective Reinforcement Learning-Based Adaptive Traffic Signal Control for Decarbonization and Efficiency Optimization
被引:5
作者:
Zhang, Gongquan
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Traff & Transportat Engn, Changsha 410075, Peoples R China
Harvard Univ, Harvard Med Sch, Boston, MA 02138 USACent South Univ, Sch Traff & Transportat Engn, Changsha 410075, Peoples R China
Zhang, Gongquan
[1
,2
]
Chang, Fangrong
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Resources & Safety Engn, Changsha 410083, Peoples R ChinaCent South Univ, Sch Traff & Transportat Engn, Changsha 410075, Peoples R China
Chang, Fangrong
[3
]
Huang, Helai
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Traff & Transportat Engn, Changsha 410075, Peoples R ChinaCent South Univ, Sch Traff & Transportat Engn, Changsha 410075, Peoples R China
Huang, Helai
[1
]
Zhou, Zilong
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Resources & Safety Engn, Changsha 410083, Peoples R ChinaCent South Univ, Sch Traff & Transportat Engn, Changsha 410075, Peoples R China
Zhou, Zilong
[3
]
机构:
[1] Cent South Univ, Sch Traff & Transportat Engn, Changsha 410075, Peoples R China
[2] Harvard Univ, Harvard Med Sch, Boston, MA 02138 USA
[3] Cent South Univ, Sch Resources & Safety Engn, Changsha 410083, Peoples R China
adaptive signal control system;
intersections;
carbon emissions;
deep reinforcement learning;
CONTROL-SYSTEM;
EMISSIONS;
INTERSECTIONS;
STRATEGY;
VEHICLES;
NETWORK;
SAFETY;
MODEL;
CONSUMPTION;
DESIGN;
D O I:
10.3390/math12132056
中图分类号:
O1 [数学];
学科分类号:
0701 ;
070101 ;
摘要:
To improve traffic efficiency, adaptive traffic signal control (ATSC) systems have been widely developed. However, few studies have proactively optimized the air environmental issues in the development of ATSC. To fill this research gap, this study proposes an optimized ATSC algorithm to take into consideration both traffic efficiency and decarbonization. The proposed algorithm is developed based on the deep reinforcement learning (DRL) framework with dual goals (DRL-DG) for traffic control system optimization. A novel network structure combining Convolutional Neural Networks and Long Short-Term Memory Networks is designed to map the intersection traffic state to a Q-value, accelerating the learning process. The reward mechanism involves a multi-objective optimization function, employing the entropy weight method to balance the weights among dual goals. Based on a representative intersection in Changsha, Hunan Province, China, a simulated intersection scenario is constructed to train and test the proposed algorithm. The result shows that the ATSC system optimized by the proposed DRL-DG results in a reduction of more than 71% in vehicle waiting time and 46% in carbon emissions compared to traditional traffic signal control systems. It converges faster and achieves a balanced dual-objective optimization compared to the prevailing DRL-based ATSC.
机构:
Imperial Coll London, Dept Comp, London, England
PROWLER Io, Cambridge, EnglandImperial Coll London, Dept Bioengn, London, England
Deisenroth, Marc Peter
Brundage, Miles
论文数: 0引用数: 0
h-index: 0
机构:
Arizona State Univ, Sci & Technol Dept, Human & Social Dimens, Tempe, AZ 85287 USA
Univ Oxford, Future Humanity Inst, Oxford, EnglandImperial Coll London, Dept Bioengn, London, England
机构:
Imperial Coll London, Dept Comp, London, England
PROWLER Io, Cambridge, EnglandImperial Coll London, Dept Bioengn, London, England
Deisenroth, Marc Peter
Brundage, Miles
论文数: 0引用数: 0
h-index: 0
机构:
Arizona State Univ, Sci & Technol Dept, Human & Social Dimens, Tempe, AZ 85287 USA
Univ Oxford, Future Humanity Inst, Oxford, EnglandImperial Coll London, Dept Bioengn, London, England