Hybrid Reinforcement Learning-Based Eco-Driving Strategy for Connected and Automated Vehicles at Signalized Intersections

被引：87

作者：

Bai, Zhengwei ^{[1
]}

Hao, Peng ^{[2
]}

Shangguan, Wei ^{[3
,4
]}

Cai, Baigen ^{[3
,4
]}

Barth, Matthew J. ^{[1
]}

机构：

[1] Univ Calif Riverside, Dept Elect & Comp Engn, Riverside, CA 92507 USA

[2] Univ Calif Riverside, Ctr Environm Res & Technol, Riverside, CA 92507 USA

[3] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Sch Elect & Informat Engn, Beijing 100044, Peoples R China

[4] Beijing Jiaotong Univ, Beijing Engn Res Ctr EMC & GNSS Technol Rail Tran, Sch Elect & Informat Engn, Beijing 100044, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2022年 / 23卷 / 09期

关键词：

Task analysis; Reinforcement learning; Numerical models; Vehicle dynamics; Energy consumption; Uncertainty; Predictive models; Hybrid reinforcement learning; connected and automated vehicle; eco-driving strategy; signalized intersections; TECHNOLOGY; NETWORKS;

D O I：

10.1109/TITS.2022.3145798

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Taking advantage of both vehicle-to-everything (V2X) communication and automated driving technology, connected and automated vehicles are quickly becoming one of the transformative solutions to many transportation problems. However, in a mixed traffic environment at signalized intersections, it is still a challenging task to improve overall throughput and energy efficiency considering the complexity and uncertainty in the traffic system. In this study, we proposed a hybrid reinforcement learning (HRL) framework which combines the rule-based strategy and the deep reinforcement learning (deep RL) to support connected eco-driving at signalized intersections in mixed traffic. Vision-perceptive methods are integrated with vehicle-to-infrastructure (V2I) communications to achieve higher mobility and energy efficiency in mixed connected traffic. The HRL framework has three components: a rule-based driving manager that operates the collaboration between the rule-based policies and the RL policy; a multi-stream neural network that extracts the hidden features of vision and V2I information; and a deep RL-based policy network that generate both longitudinal and lateral eco-driving actions. In order to evaluate our approach, we developed a Unity-based simulator and designed a mixed-traffic intersection scenario. Moreover, several baselines were implemented to compare with our new design, and numerical experiments were conducted to test the performance of the HRL model. The experiments show that our HRL method can reduce energy consumption by 12.70% and save 11.75% travel time when compared with a state-of-the-art model-based Eco-Driving approach.

引用

页码：15850 / 15863

页数：14

共 40 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

Albawi S, 2017, I C ENG TECHNOL

[3]

[Anonymous], 2013, NATL GREENHOUSE GAS

[4]

Bai ZW, 2019, CHIN CONTR CONF, P8600, DOI [10.23919/ChiCC.2019.8866005, 10.23919/chicc.2019.8866005]

[5]

Bertsekas DP, 1995, Dynamic Programming and Optimal Control, V1

[6]

Chen JY, 2018, IEEE INT VEH SYM, P1239, DOI 10.1109/IVS.2018.8500368

[7] Eco-driving in urban traffic networks using traffic signals information [J].

De Nunzio, Giovanni ;

de Wit, Carlos Canudas ;

Moulin, Philippe ;

Di Domenico, Domenico .

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2016, 26 (06) :1307-1324

[8] Cooperative Adaptive Cruise Control: A Reinforcement Learning Approach [J].

Desjardins, Charles ;

Chaib-draa, Brahim .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (04) :1248-1260

[9] An Intersection Game-Theory-Based Traffic Control Algorithm in a Connected Vehicle Environment [J].

Elhenawy, Mohammed ;

Elbery, Ahmed A. ;

Hassan, Abdallah A. ;

Rakha, Hesham A. .

2015 IEEE 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, :343-347

[10] Preparing a nation for autonomous vehicles: opportunities, barriers and policy recommendations [J].

Fagnant, Daniel J. ;

Kockelman, Kara .

TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2015, 77 :167-181

← 1 2 3 4 →