Group-Free 3D Object Detection via Transformers

被引:139
|
作者
Liu, Ze [1 ,2 ,3 ]
Zhang, Zheng [2 ]
Cao, Yue [2 ]
Hu, Han [2 ]
Tong, Xin [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
[3] MSRA, Beijing, Peoples R China
关键词
D O I
10.1109/ICCV48922.2021.00294
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, directly detecting 3D objects from 3D point clouds has received increasing attention. To extract object representation from an irregular point cloud, existing methods usually take a point grouping step to assign the points to an object candidate so that a PointNet-like network could be used to derive object features from the grouped points. However, the inaccurate point assignments caused by the hand-crafted grouping scheme decrease the performance of 3D object detection. In this paper, we present a simple yet effective method for directly detecting 3D objects from the 3D point cloud. Instead of grouping local points to each object candidate, our method computes the feature of an object from all the points in the point cloud with the help of an attention mechanism in the Transformers [42], where the contribution of each point is automatically learned in the network training. With an improved attention stacking scheme, our method fuses object features in different stages and generates more accurate object detection results. With few bells and whistles, the proposed method achieves state-of-the-art 3D object detection performance on two widely used benchmarks, ScanNet V2 and SUN RGB-D.
引用
收藏
页码:2929 / 2938
页数:10
相关论文
共 50 条
  • [1] GFENet: Group-Free Enhancement Network for Indoor Scene 3D Object Detection
    Zhou, Feng
    Dai, Ju
    Pan, Junjun
    Zhu, Mengxiao
    Cai, Xingquan
    Huang, Bin
    Wang, Chen
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT III, 2024, 14497 : 119 - 136
  • [2] Real-Time Multimodal 3D Object Detection with Transformers
    Liu, Hengsong
    Duan, Tongle
    WORLD ELECTRIC VEHICLE JOURNAL, 2024, 15 (07):
  • [3] DAFormer: Depth-aware 3D Object Detection Guided by Camera Model via Transformers
    Gao, Junbin
    Ruan, Hao
    Xu, Bingrong
    Zeng, Zhigang
    2022 IEEE INTERNATIONAL CONFERENCE ON CYBORG AND BIONIC SYSTEMS, CBS, 2022, : 170 - 175
  • [4] AShapeFormer : Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers
    Li, Zechuan
    Yu, Hongshan
    Yang, Zhengeng
    Chen, Tongjia
    Akhtar, Naveed
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1012 - 1021
  • [5] DeepInteraction: 3D Object Detection via Modality Interaction
    Yang, Zeyu
    Chen, Jiaqi
    Miao, Zhenwei
    Li, Wei
    Zhu, Xiatian
    Zhang, Li
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers
    Bai, Xuyang
    Hu, Zeyu
    Zhu, Xinge
    Huang, Qingqiu
    Chen, Yilun
    Fu, Hangbo
    Tai, Chiew-Lan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1080 - 1089
  • [7] MuTrans: Multiple Transformers for Fusing Feature Pyramid on 2D and 3D Object Detection
    Xie, Bangquan
    Yang, Liang
    Wei, Ailin
    Weng, Xiaoxiong
    Li, Bing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4407 - 4415
  • [8] Monocular 3D Object Detection: An Extrinsic Parameter Free Approach
    Zhou, Yunsong
    He, Yuan
    Zhu, Hongzi
    Wang, Cheng
    Li, Hongyang
    Jiang, Qinhong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7552 - 7562
  • [9] MonoEF: Extrinsic Parameter Free Monocular 3D Object Detection
    Zhou, Yunsong
    He, Yuan
    Zhu, Hongzi
    Wang, Cheng
    Li, Hongyang
    Jiang, Qinhong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 10114 - 10128
  • [10] Improving 3D Object Detection via Joint Attribute-oriented 3D Loss
    Ye, Zhen
    Xue, Jianru
    Dou, Jian
    Pan, Yuxin
    Fang, Jianwu
    Wang, Di
    Zheng, Nanning
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 951 - 956