Context-Aware 3D Object Detection From a Single Image in Autonomous Driving

被引：8

作者：

Zhou, Dingfu ^{[1
,2
]}

Song, Xibin ^{[1
,2
]}

Fang, Jin ^{[1
,2
]}

Dai, Yuchao ^{[3
]}

Li, Hongdong ^{[4
]}

Zhang, Liangjun ^{[1
,2
]}

机构：

[1] Baidu Res, Robot & Autonomous Driving Lab, Beijing 100085, Peoples R China

[2] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100193, Peoples R China

[3] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710060, Peoples R China

[4] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 0200, Australia

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2022年 / 23卷 / 10期

基金：

澳大利亚研究理事会; 中国国家自然科学基金;

关键词：

Three-dimensional displays; Object detection; Training; Feature extraction; Task analysis; Sensors; Detectors; Monocular 3D object detection; context-aware feature aggregation; self-attention; RECOGNITION; MODEL;

D O I：

10.1109/TITS.2022.3154022

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Camera sensors have been widely used in Driver-Assistance and Autonomous Driving Systems due to their rich texture information. Recently, with the development of deep learning techniques, many approaches have been proposed to detect objects in 3D from a single frame, however, there is still much room for improvement. In this paper, we generally review the recently proposed state-of-the-art monocular-based 3D object detection approaches first. Based on the analysis of the disadvantage of previous center-based frameworks, a novel feature aggregation strategy has been proposed to boost the 3D object detection by exploring the context information. Specifically, an Instance-Guided Spatial Attention (IGSA) module is proposed to collect the local instance information and the Channel-Wise Feature Attention (CWFA) module is employed for aggregating the global context information. In addition, an instance-guided object regression strategy is also proposed to alleviate the influence of center location prediction uncertainty in the inference process. Finally, the proposed approach has been verified on the public 3D object detection benchmark. The experimental results show that the proposed approach can significantly boost the performance of the baseline method on both 3D detection and 2D Bird's-Eye View among all three categories. Furthermore, our method outperforms all the monocular-based methods (even these trained with depth as auxiliary inputs) and achieves state-of-the-art performance on the KITTI benchmark.

引用

页码：18568 / 18580

页数：13

共 50 条

[1] Ground-Aware Monocular 3D Object Detection for Autonomous Driving
Liu, Yuxuan
Yixuan, Yuan
Liu, Ming
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 919 - 926
[2] Shape-Aware Monocular 3D Object Detection
Chen, Wei
Zhao, Jie
Zhao, Wan-Lei
Wu, Song-Yuan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
[3] Context-Aware Dynamic Feature Extraction for 3D Object Detection in Point Clouds
Tian, Yonglin
Huang, Lichao
Yu, Hui
Wu, Xiangbin
Li, Xuesong
Wang, Kunfeng
Wang, Zilei
Wang, Fei-Yue
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 10773 - 10785
[4] RI-Fusion: 3D Object Detection Using Enhanced Point Features With Range-Image Fusion for Autonomous Driving
Zhang, Xinyu
Wang, Li
Zhang, Guoxin
Lan, Tianwei
Zhang, Haoming
Zhao, Lijun
Li, Jun
Zhu, Lei
Liu, Huaping
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[5] Density Aware 3D Object Single Stage Detector
Ning, Jingmei
Da, Feipeng
Gai, Shaoyan
IEEE SENSORS JOURNAL, 2021, 21 (20) : 23108 - 23117
[6] Exploring Diversity-Based Active Learning for 3D Object Detection in Autonomous Driving
Lin, Jinpeng
Liang, Zhihao
Deng, Shengheng
Cai, Lile
Jiang, Tao
Li, Tianrui
Jia, Kui
Xu, Xun
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 15454 - 15466
[7] Deep Learning-Based Image 3-D Object Detection for Autonomous Driving: Review
Alaba, Simegnew Yihunie
Ball, John E.
IEEE SENSORS JOURNAL, 2023, 23 (04) : 3378 - 3394
[8] Context-aware 3D object anchoring for mobile robots
Guenther, Martin
Ruiz-Sarmiento, J. R.
Galindo, Cipriano
Gonzalez-Jimenez, Javier
Hertzberg, Joachim
ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 110 : 12 - 32
[9] Bridging Multi-Scale Context-Aware Representation for Object Detection
Wang, Boying
Ji, Ruyi
Zhang, Libo
Wu, Yanjun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2317 - 2329
[10] Performance and Challenges of 3D Object Detection Methods in Complex Scenes for Autonomous Driving
Wang, Ke
Zhou, Tianqiang
Li, Xingcan
Ren, Fan
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (02): : 1699 - 1716

← 1 2 3 4 5 →