Traffic Signal Control Method Based on Modified Proximal Policy Optimization

被引：4

作者：

An, Yaohui ^{[1
]}

Zhang, Jing ^{[1
]}

机构：

[1] Tiangong Univ, Tianjin, Peoples R China

来源：

2022 10TH INTERNATIONAL CONFERENCE ON TRAFFIC AND LOGISTIC ENGINEERING (ICTLE 2022) | 2022年

关键词：

intelligent transportation; traffic signal control; proximal policy optimization; intersection; importance sampling;

D O I：

10.1109/ICTLE55577.2022.9901894

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

For the traffic congestion problem at intersections, this paper proposed a modified proximal policy optimization algorithm, which can adaptively regulate traffic signals to alleviate intersection congestion. The algorithm made further theoretical corrections to the unbiasedness of the advantage function in it through the importance sampling method. The algorithm was also modified to better adapt to traffic signal control and thus increase the sensitivity of the algorithm to control actions. In order to fully extract road state information and reduce the complexity of the state space, the experiment used images as state inputs, which are real-time snapshots of road conditions at intersections. To verify the performance of the proposed algorithm, it was compared with other deep reinforcement learning algorithms in the traffic simulation software SUMO. The results show that the cumulative reward of the model constructed by the proposed algorithm is increased by 68.3% and the average waiting time is reduced by 65.3% compared to the baseline DQN model. Experimental results demonstrate that the algorithm can effectively regulate traffic congestion.

引用

页码：83 / 88

页数：6

共 16 条

[1]

[Anonymous], 2015, 2015 12 INT C EXPO E

[2]

Behrisch M., 2011, SUMO SIMULATION URBA, V2011

[3]

Casas N, 2017, Arxiv, DOI arXiv:1703.09035

[4] An Introduction to Deep Reinforcement Learning [J].

Francois-Lavet, Vincent ;

Henderson, Peter ;

Islam, Riashat ;

Bellemare, Marc G. ;

Pineau, Joelle .

FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2018, 11 (3-4) :219-354

[5] Modeling and optimizing traffic light settings in road networks [J].

Goettlich, Simone ;

Herty, Michael ;

Ziegler, Ute .

COMPUTERS & OPERATIONS RESEARCH, 2015, 55 :36-51

[6] Traffic signal timing via deep reinforcement learning [J].

Li L. ;

Lv Y. ;

Wang F.-Y. .

IEEE/CAA Journal of Automatica Sinica, 2016, 3 (03) :247-254

[7]

Mnih V, 2013, Arxiv, DOI arXiv:1312.5602

[8] Threshold Tuning Using Stochastic Optimization for Graded Signal Control [J].

Prashanth, L. A. ;

Bhatnagar, Shalabh .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2012, 61 (09) :3865-3880

[9]

Schulman J., 2017, arXiv, DOI [arXiv:1502.05477, DOI 10.48550/ARXIV.1502.05477]

[10]

Schulman J, 2017, Arxiv, DOI arXiv:1707.06347

← 1 2 →