Graph neural network and reinforcement learning for multi-agent cooperative control of connected autonomous vehicles

被引：160

作者：

Chen, Sikai ^{[1
,2
,3
]}

Dong, Jiqian ^{[1
,2
]}

Ha, Paul ^{[1
,2
]}

Li, Yujie ^{[1
,2
]}

Labi, Samuel ^{[1
,2
]}

机构：

[1] Purdue Univ, Ctr Connected & Automated Transportat CCAT, W Lafayette, IN 47907 USA

[2] Purdue Univ, Lyles Sch Civil Engn, W Lafayette, IN 47907 USA

[3] Carnegie Mellon Univ, Sch Comp Sci, Robot Inst, Pittsburgh, PA 15213 USA

来源：

COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING | 2021年 / 36卷 / 07期

关键词：

CRACK DETECTION; TRAJECTORY OPTIMIZATION; DYNAMIC CLASSIFICATION; INTERSECTION CONTROL; MODEL;

D O I：

10.1111/mice.12702

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

A connected autonomous vehicle (CAV) network can be defined as a set of connected vehicles including CAVs that operate on a specific spatial scope that may be a road network, corridor, or segment. The spatial scope constitutes an environment where traffic information is shared and instructions are issued for controlling the CAVs movements. Within such a spatial scope, high-level cooperation among CAVs fostered by joint planning and control of their movements can greatly enhance the safety and mobility performance of their operations. Unfortunately, the highly combinatory and volatile nature of CAV networks due to the dynamic number of agents (vehicles) and the fast-growing joint action space associated with multi-agent driving tasks pose difficultly in achieving cooperative control. The problem is NP-hard and cannot be efficiently resolved using rule-based control techniques. Also, there is a great deal of information in the literature regarding sensing technologies and control logic in CAV operations but relatively little information on the integration of information from collaborative sensing and connectivity sources. Therefore, we present a novel deep reinforcement learning-based algorithm that combines graphic convolution neural network with deep Q-network to form an innovative graphic convolution Q network that serves as the information fusion module and decision processor. In this study, the spatial scope we consider for the CAV network is a multi-lane road corridor. We demonstrate the proposed control algorithm using the application context of freeway lane-changing at the approaches to an exit ramp. For purposes of comparison, the proposed model is evaluated vis-a-vis traditional rule-based and long short-term memory-based fusion models. The results suggest that the proposed model is capable of aggregating information received from sensing and connectivity sources and prescribing efficient operative lane-change decisions for multiple CAVs, in a manner that enhances safety and mobility. That way, the operational intentions of individual CAVs can be fulfilled even in partially observed and highly dynamic mixed traffic streams. The paper presents experimental evidence to demonstrate that the proposed algorithm can significantly enhance CAV operations. The proposed algorithm can be deployed at roadside units or cloud platforms or other centralized control facilities.

引用

页码：838 / 857

页数：20

共 88 条

[1] Enhanced probabilistic neural network with local decision circles: A robust classifier [J].

Ahmadlou, Mehran ;

Adeli, Hojjat .

INTEGRATED COMPUTER-AIDED ENGINEERING, 2010, 17 (03) :197-210

[2] A dynamic ensemble learning algorithm for neural networks [J].

Alam, Kazi Md Rokibul ;

Siddique, Nazmul ;

Adeli, Hojjat .

NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12) :8675-8690

[3]

Alamaniotis M, 2014, 5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, P33, DOI 10.1109/IISA.2014.6878812

[4] APPLICATIONS OF GRAPH-THEORY IN CHEMISTRY [J].