Graph neural network and reinforcement learning for multi-agent cooperative control of connected autonomous vehicles

被引:160
作者
Chen, Sikai [1 ,2 ,3 ]
Dong, Jiqian [1 ,2 ]
Ha, Paul [1 ,2 ]
Li, Yujie [1 ,2 ]
Labi, Samuel [1 ,2 ]
机构
[1] Purdue Univ, Ctr Connected & Automated Transportat CCAT, W Lafayette, IN 47907 USA
[2] Purdue Univ, Lyles Sch Civil Engn, W Lafayette, IN 47907 USA
[3] Carnegie Mellon Univ, Sch Comp Sci, Robot Inst, Pittsburgh, PA 15213 USA
关键词
CRACK DETECTION; TRAJECTORY OPTIMIZATION; DYNAMIC CLASSIFICATION; INTERSECTION CONTROL; MODEL;
D O I
10.1111/mice.12702
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A connected autonomous vehicle (CAV) network can be defined as a set of connected vehicles including CAVs that operate on a specific spatial scope that may be a road network, corridor, or segment. The spatial scope constitutes an environment where traffic information is shared and instructions are issued for controlling the CAVs movements. Within such a spatial scope, high-level cooperation among CAVs fostered by joint planning and control of their movements can greatly enhance the safety and mobility performance of their operations. Unfortunately, the highly combinatory and volatile nature of CAV networks due to the dynamic number of agents (vehicles) and the fast-growing joint action space associated with multi-agent driving tasks pose difficultly in achieving cooperative control. The problem is NP-hard and cannot be efficiently resolved using rule-based control techniques. Also, there is a great deal of information in the literature regarding sensing technologies and control logic in CAV operations but relatively little information on the integration of information from collaborative sensing and connectivity sources. Therefore, we present a novel deep reinforcement learning-based algorithm that combines graphic convolution neural network with deep Q-network to form an innovative graphic convolution Q network that serves as the information fusion module and decision processor. In this study, the spatial scope we consider for the CAV network is a multi-lane road corridor. We demonstrate the proposed control algorithm using the application context of freeway lane-changing at the approaches to an exit ramp. For purposes of comparison, the proposed model is evaluated vis-a-vis traditional rule-based and long short-term memory-based fusion models. The results suggest that the proposed model is capable of aggregating information received from sensing and connectivity sources and prescribing efficient operative lane-change decisions for multiple CAVs, in a manner that enhances safety and mobility. That way, the operational intentions of individual CAVs can be fulfilled even in partially observed and highly dynamic mixed traffic streams. The paper presents experimental evidence to demonstrate that the proposed algorithm can significantly enhance CAV operations. The proposed algorithm can be deployed at roadside units or cloud platforms or other centralized control facilities.
引用
收藏
页码:838 / 857
页数:20
相关论文
共 88 条
[1]   Enhanced probabilistic neural network with local decision circles: A robust classifier [J].
Ahmadlou, Mehran ;
Adeli, Hojjat .
INTEGRATED COMPUTER-AIDED ENGINEERING, 2010, 17 (03) :197-210
[2]   A dynamic ensemble learning algorithm for neural networks [J].
Alam, Kazi Md Rokibul ;
Siddique, Nazmul ;
Adeli, Hojjat .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12) :8675-8690
[3]  
Alamaniotis M, 2014, 5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, P33, DOI 10.1109/IISA.2014.6878812
[4]   APPLICATIONS OF GRAPH-THEORY IN CHEMISTRY [J].
BALABAN, AT .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (03) :334-343
[5]   Encoder-decoder network for pixel-level road crack detection in black-box images [J].
Bang, Seongdeok ;
Park, Somin ;
Kim, Hongjo ;
Kim, Hyoungkwan .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2019, 34 (08) :713-727
[6]   GRAPH THEORY AND SOCIAL NETWORKS - TECHNICAL COMMENT ON CONNECTEDNESS AND CONNECTIVITY [J].
BARNES, JA .
SOCIOLOGY-THE JOURNAL OF THE BRITISH SOCIOLOGICAL ASSOCIATION, 1969, 3 :215-232
[7]   The complexity of decentralized control of Markov decision processes [J].
Bernstein, DS ;
Givan, R ;
Immerman, N ;
Zilberstein, S .
MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (04) :819-840
[8]  
Bourbakis NG, 2017, IEEE TRANSP ELECT C, P767, DOI 10.1109/ITEC.2017.7993366
[9]  
Boutilier C, 1996, THEORETICAL ASPECTS OF RATIONALITY AND KNOWLEDGE, P195
[10]  
Chen, 2019, THESIS PURDUE U