RSG-GCN: Predicting Semantic Relationships in Urban Traffic Scene With Map Geometric Prior

被引:11
作者
Tian, Yafu [1 ,6 ]
Carballo, Alexander [2 ,3 ,4 ,5 ]
Li, Ruifeng [6 ]
Takeda, Kazuya [1 ,4 ,5 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Nagoya 4648603, Japan
[2] Gifu Univ, Fac Engn, Gifu 5011193, Japan
[3] Gifu Univ, Grad Sch Engn, Gifu 5011193, Japan
[4] Nagoya Univ, Inst Innovat Future Soc, Nagoya 4648601, Japan
[5] Nagoya Univ, Tier IV Inc, Open Innovat Ctr, Nagoya 4506610, Japan
[6] Harbin Inst Technol, State Key Lab Robot & Intelligent Syst, Harbin 150001, Peoples R China
来源
IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 4卷
基金
中国国家自然科学基金; 日本科学技术振兴机构;
关键词
Graph neural networks; semantic relationship prediction; traffic scene understanding; GRAPH NEURAL-NETWORK;
D O I
10.1109/OJITS.2023.3260624
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated identification of the relationships between traffic actors and surrounding objects, in order to describe their behavior and predict their intentions, has become the focus of increasing attention in the field of autonomous driving. Therefore, in this work, we propose a Road Scene Graphs-Graph Convolutional Network (RSG-GCN) as a novel, graph-based model for predicting the topological graph structure of a given traffic scene. The status of the actors and HD map information are integrated as prior knowledge, allowing the edges linking the actor nodes to capture potential semantic relationships, such as "vehicle approaching pedestrian" and "pedestrian waiting at intersection". To train this model, we created our own RSG dataset, as well as a relational dataset and benchmark derived from nuScenes. Our extensive range of experiments demonstrate that our model can more accurately predict semantic relationships and behavior in a given traffic scene than other popular traffic scene prediction models. In particular, regarding the use of HD map prior knowledge, we found that the resulting increase in accuracy significantly outweighs performance loss caused by the increase in graph size. The downstream applications of RSG include traffic scene retrieval and synthetic traffic scene generation, which are briefly described.
引用
收藏
页码:244 / 260
页数:17
相关论文
共 83 条
[1]  
[Anonymous], OPENDRIVE 1 6
[2]   SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences [J].
Behley, Jens ;
Garbade, Martin ;
Milioto, Andres ;
Quenzel, Jan ;
Behnke, Sven ;
Stachniss, Cyrill ;
Gall, Juergen .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9296-9306
[3]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628
[4]  
Casas S, 2019, Arxiv, DOI arXiv:1910.08233
[5]   Graph neural network and reinforcement learning for multi-agent cooperative control of connected autonomous vehicles [J].
Chen, Sikai ;
Dong, Jiqian ;
Ha, Paul ;
Li, Yujie ;
Labi, Samuel .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2021, 36 (07) :838-857
[6]   Knowledge-Embedded Routing Network for Scene Graph Generation [J].
Chen, Tianshui ;
Yu, Weihao ;
Chen, Riquan ;
Lin, Liang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6156-6164
[7]  
Cho KYHY, 2014, Arxiv, DOI arXiv:1409.1259
[8]  
Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, DOI 10.3115/V1/D14-1179, 10.48550/arXiv.1406.1078]
[9]  
Cong Yuren, 2021, ICCV, P16372
[10]   A (sub)graph isomorphism algorithm for matching large graphs [J].
Cordella, LP ;
Foggia, P ;
Sansone, C ;
Vento, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (10) :1367-1372