CenterFormer: Center-Based Transformer for 3D Object Detection

被引:61
|
作者
Zhou, Zixiang [1 ,2 ]
Zhao, Xiangchen [1 ]
Wang, Yu [1 ]
Wang, Panqu [1 ]
Foroosh, Hassan [2 ]
机构
[1] TuSimple, San Diego, CA 92122 USA
[2] Univ Cent Florida, Computat Imaging Lab, Orlando, FL 32816 USA
来源
COMPUTER VISION, ECCV 2022, PT XXXVIII | 2022年 / 13698卷
关键词
LiDAR point cloud; 3D object detection; Transformer; Multi-frame fusion;
D O I
10.1007/978-3-031-19839-7_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Query-based transformer has shown great potential in constructing long-range attention in many image-domain tasks, but has rarely been considered in LiDAR-based 3D object detection due to the overwhelming size of the point cloud data. In this paper, we propose CenterFormer, a center-based transformer network for 3D object detection. CenterFormer first uses a center heatmap to select center candidates on top of a standard voxel-based point cloud encoder. It then uses the feature of the center candidate as the query embedding in the transformer. To further aggregate features from multiple frames, we design an approach to fuse features through cross-attention. Lastly, regression heads are added to predict the bounding box on the output center feature representation. Our design reduces the convergence difficulty and computational complexity of the transformer structure. The results show significant improvements over the strong baseline of anchor-free object detection networks. CenterFormer achieves state-of-the-art performance for a single model on the Waymo Open Dataset, with 73.7% mAPH on the validation set and 75.6% mAPH on the test set, significantly outperforming all previously published CNN and transformer-based methods. Our code is publicly available at https://github.com/TuSimple/centerformer
引用
收藏
页码:496 / 513
页数:18
相关论文
共 50 条
  • [21] DS-Trans: A 3D Object Detection Method Based on a Deformable Spatiotemporal Transformer for Autonomous Vehicles
    Zhu, Yuan
    Xu, Ruidong
    Tao, Chongben
    An, Hao
    Wang, Huaide
    Sun, Zhipeng
    Lu, Ke
    REMOTE SENSING, 2024, 16 (09)
  • [22] Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection for Autonomous Driving
    Yuan, Zhenxun
    Song, Xiao
    Bai, Lei
    Wang, Zhe
    Ouyang, Wanli
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2068 - 2078
  • [23] Voxel Transformer with Density-Aware Deformable Attention for 3D Object Detection
    Kim, Taeho
    Kim, Joohee
    SENSORS, 2023, 23 (16)
  • [24] DAFDeTr: Deformable Attention Fusion Based 3D Detection Transformer
    Erabati, Gopi Krishna
    Araujo, Helder
    ROBOTICS, COMPUTER VISION AND INTELLIGENT SYSTEMS, ROBOVIS 2024, 2024, 2077 : 293 - 315
  • [25] DVST: Deformable Voxel Set Transformer for 3D Object Detection from Point Clouds
    Ning, Yaqian
    Cao, Jie
    Bao, Chun
    Hao, Qun
    REMOTE SENSING, 2023, 15 (23)
  • [26] AFTR: A Robustness Multi-Sensor Fusion Model for 3D Object Detection Based on Adaptive Fusion Transformer
    Zhang, Yan
    Liu, Kang
    Bao, Hong
    Qian, Xu
    Wang, Zihan
    Ye, Shiqing
    Wang, Weicen
    SENSORS, 2023, 23 (20)
  • [27] Long-Short Range Adaptive Transformer With Dynamic Sampling for 3D Object Detection
    Wang, Chuxin
    Deng, Jiacheng
    He, Jianfeng
    Zhang, Tianzhu
    Zhang, Zhe
    Zhang, Yongdong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7616 - 7629
  • [28] RVT: Robotic View Transformer for 3D Object Manipulation
    Goyal, Ankit
    Xu, Jie
    Guo, Yijie
    Blukis, Valts
    Chao, Yu-Wei
    Fox, Dieter
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [29] TBFNT3D: Two-Branch Fusion Network With Transformer for Multimodal Indoor 3D Object Detection
    Cheng, Jun
    Zhang, Sheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6523 - 6530
  • [30] 3D object detection network based on symmetric shape generation
    Tu X.
    Zheng S.
    Yu S.
    Li W.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2023, 44 (06): : 252 - 263