Improving 3D Object Detection with Context-Aware and Dimensional Interaction Attention

被引:0
|
作者
Jing Zhou
Zixin Gong
Junchi Zhang
机构
[1] Jianghan University,School of Artificial Intelligence
来源
Neural Processing Letters | / 56卷
关键词
3D object detection; Attention mechanism; Contextual information; Dimensional interaction attention;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, 3D object detection technology based on point clouds has developed rapidly. However, too few points of distant and occluded objects are scanned by the sensor, and thus these objects suffer from too insufficient features to be detected. This case damages the detection accuracy. Therefore, we constitute a novel 3D object detection with Context-aware and dimensional Interaction Attention Network (CIANet) to explore vital geometric cues for enriching the feature representation of the object, thus boosting the overall detection performance. Specifically, in the first stage, we employ the 3D sparse convolution to extract voxel features, and then construct a Channel-Spatial Hybrid Attention (CSHA) module and a Contextual Self-Attention (CSA) module to enhance voxel features for generating proposals. The CSHA module aims to enhance the key information of the channel and spatial domains of 2D Bird’s Eye View (BEV) features, and the CSA module is applied to supplement contextual information to the enhanced BEV features, thus generating accurate proposals. In the second stage, we construct a Dimensional Interaction Attention (DIA) module to refine Region of Interest (RoI) features within the proposals. It enhances the interactions among the channel and spatial dimensions of RoI features to learn accurate boundaries of objects for proposal refinement. Extensive experiments on the KITTI and Waymo benchmarks show the superior detection performance of CIANet compared to recent methods, especially for objects such as pedestrians and cyclists.
引用
收藏
相关论文
共 50 条
  • [1] Improving 3D Object Detection with Context-Aware and Dimensional Interaction Attention
    Zhou, Jing
    Gong, Zixin
    Zhang, Junchi
    NEURAL PROCESSING LETTERS, 2024, 56 (01)
  • [2] CONTEXT-AWARE DATA AUGMENTATION FOR LIDAR 3D OBJECT DETECTION
    Hu, Xuzhong
    Duan, Zaipeng
    Huang, Xiao
    Xu, Ziwen
    Ming, Delie
    Ma, Jie
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 11 - 15
  • [3] SA-Det3D: Self-Attention Based Context-Aware 3D Object Detection
    Bhattacharyya, Prarthana
    Huang, Chengjie
    Czarnecki, Krzysztof
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3022 - 3031
  • [4] Context-Aware 3D Object Streaming for Mobile Games
    Rahimi, Hesam
    Shirehjini, Ali Asghar Nazari
    Shirmohammadi, Shervin
    2011 10TH ANNUAL WORKSHOP ON NETWORK AND SYSTEMS SUPPORT FOR GAMES (NETGAMES 2011), 2011,
  • [5] Context-aware 3D object anchoring for mobile robots
    Guenther, Martin
    Ruiz-Sarmiento, J. R.
    Galindo, Cipriano
    Gonzalez-Jimenez, Javier
    Hertzberg, Joachim
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 110 : 12 - 32
  • [6] Context-Aware Dynamic Feature Extraction for 3D Object Detection in Point Clouds
    Tian, Yonglin
    Huang, Lichao
    Yu, Hui
    Wu, Xiangbin
    Li, Xuesong
    Wang, Kunfeng
    Wang, Zilei
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 10773 - 10785
  • [7] Context-Aware 3D Object Detection From a Single Image in Autonomous Driving
    Zhou, Dingfu
    Song, Xibin
    Fang, Jin
    Dai, Yuchao
    Li, Hongdong
    Zhang, Liangjun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18568 - 18580
  • [8] Context-Aware 3D Points of Interest Detection via Spatial Attention Mechanism
    Shu, Zhenyu
    Gao, Ling
    Yi, Shun
    Wu, Fangyu
    Ding, Xin
    Wan, Ting
    Xin, Shiqing
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [9] MCGNET: MULTI-LEVEL CONTEXT-AWARE AND GEOMETRIC-AWARE NETWORK FOR 3D OBJECT DETECTION
    Chen, Keng
    Zhou, Feng
    Dai, Ju
    Shen, Pei
    Cai, Xingquan
    Zhang, Fengquan
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1846 - 1850
  • [10] Global Context-Aware Attention LSTM Networks for 3D Action Recognition
    Liu, Jun
    Wang, Gang
    Hu, Ping
    Duan, Ling-Yu
    Kot, Alex C.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3671 - 3680