Improving 3D Object Detection with Context-Aware and Dimensional Interaction Attention

被引：0

作者：

Jing Zhou

Zixin Gong

Junchi Zhang

机构：

[1] Jianghan University,School of Artificial Intelligence

来源：

Neural Processing Letters | / 56卷

关键词：

3D object detection; Attention mechanism; Contextual information; Dimensional interaction attention;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Recently, 3D object detection technology based on point clouds has developed rapidly. However, too few points of distant and occluded objects are scanned by the sensor, and thus these objects suffer from too insufficient features to be detected. This case damages the detection accuracy. Therefore, we constitute a novel 3D object detection with Context-aware and dimensional Interaction Attention Network (CIANet) to explore vital geometric cues for enriching the feature representation of the object, thus boosting the overall detection performance. Specifically, in the first stage, we employ the 3D sparse convolution to extract voxel features, and then construct a Channel-Spatial Hybrid Attention (CSHA) module and a Contextual Self-Attention (CSA) module to enhance voxel features for generating proposals. The CSHA module aims to enhance the key information of the channel and spatial domains of 2D Bird’s Eye View (BEV) features, and the CSA module is applied to supplement contextual information to the enhanced BEV features, thus generating accurate proposals. In the second stage, we construct a Dimensional Interaction Attention (DIA) module to refine Region of Interest (RoI) features within the proposals. It enhances the interactions among the channel and spatial dimensions of RoI features to learn accurate boundaries of objects for proposal refinement. Extensive experiments on the KITTI and Waymo benchmarks show the superior detection performance of CIANet compared to recent methods, especially for objects such as pedestrians and cyclists.

引用

共 50 条

[1] Improving 3D Object Detection with Context-Aware and Dimensional Interaction Attention
Zhou, Jing
Gong, Zixin
Zhang, Junchi
NEURAL PROCESSING LETTERS, 2024, 56 (01)
[2] CONTEXT-AWARE DATA AUGMENTATION FOR LIDAR 3D OBJECT DETECTION
Hu, Xuzhong
Duan, Zaipeng
Huang, Xiao
Xu, Ziwen
Ming, Delie
Ma, Jie
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 11 - 15
[3] SA-Det3D: Self-Attention Based Context-Aware 3D Object Detection
Bhattacharyya, Prarthana
Huang, Chengjie
Czarnecki, Krzysztof
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3022 - 3031
[4] Context-Aware 3D Object Streaming for Mobile Games
Rahimi, Hesam
Shirehjini, Ali Asghar Nazari
Shirmohammadi, Shervin
2011 10TH ANNUAL WORKSHOP ON NETWORK AND SYSTEMS SUPPORT FOR GAMES (NETGAMES 2011), 2011,
[5] Context-aware 3D object anchoring for mobile robots
Guenther, Martin
Ruiz-Sarmiento, J. R.
Galindo, Cipriano
Gonzalez-Jimenez, Javier
Hertzberg, Joachim
ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 110 : 12 - 32
[6] Context-Aware Dynamic Feature Extraction for 3D Object Detection in Point Clouds
Tian, Yonglin
Huang, Lichao
Yu, Hui
Wu, Xiangbin
Li, Xuesong
Wang, Kunfeng
Wang, Zilei
Wang, Fei-Yue
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 10773 - 10785
[7] Context-Aware 3D Object Detection From a Single Image in Autonomous Driving
Zhou, Dingfu
Song, Xibin
Fang, Jin
Dai, Yuchao
Li, Hongdong
Zhang, Liangjun
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18568 - 18580
[8] Context-Aware 3D Points of Interest Detection via Spatial Attention Mechanism
Shu, Zhenyu
Gao, Ling
Yi, Shun
Wu, Fangyu
Ding, Xin
Wan, Ting
Xin, Shiqing
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
[9] MCGNET: MULTI-LEVEL CONTEXT-AWARE AND GEOMETRIC-AWARE NETWORK FOR 3D OBJECT DETECTION
Chen, Keng
Zhou, Feng
Dai, Ju
Shen, Pei
Cai, Xingquan
Zhang, Fengquan
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1846 - 1850
[10] Global Context-Aware Attention LSTM Networks for 3D Action Recognition
Liu, Jun
Wang, Gang
Hu, Ping
Duan, Ling-Yu
Kot, Alex C.
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3671 - 3680

← 1 2 3 4 5 →