Query-based Temporal Fusion with Explicit Motion for 3D Object Detection

被引:0
作者
Hou, Jinghua [1 ]
Liu, Zhe [1 ]
Liang, Dingkang [1 ]
Zou, Zhikang [2 ]
Ye, Xiaoqing [2 ]
Bai, Xiang [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Hubei, Peoples R China
[2] Baidu Inc, Beijing, Peoples R China
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effectively utilizing temporal information to improve 3D detection performance is vital for autonomous driving vehicles. Existing methods either conduct temporal fusion based on the dense BEV features or sparse 3D proposal features. However, the former does not pay more attention to foreground objects, leading to more computation costs and sub-optimal performance. The latter implements time-consuming operations to generate sparse 3D proposal features, and the performance is limited by the quality of 3D proposals. In this paper, we propose a simple and effective Query-based Temporal Fusion Network (QTNet). The main idea is to exploit the object queries in previous frames to enhance the representation of current object queries by the proposed Motion-guided Temporal Modeling (MTM) module, which utilizes the spatial position information of object queries along the temporal dimension to construct their relevance between adjacent frames reliably. Experimental results show our proposed QTNet outperforms BEV-based or proposal-based manners on the nuScenes dataset. Besides, the MTM is a plug-and-play module, which can be integrated into some advanced LiDAR-only or multi-modality 3D detectors and even brings new SOTA performance with negligible computation cost and latency on the nuScenes dataset. These experiments powerfully illustrate the superiority and generalization of our method. The code is available at https://github.com/AlmoonYsl/QTNet.
引用
收藏
页数:16
相关论文
共 58 条
  • [1] Ba Jimmy Lei, 2016, ARXIV
  • [2] Bai Xuyang, 2022, IEEE C COMP VIS PATT
  • [3] Caesar Holger, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P11618, DOI 10.1109/CVPR42600.2020.01164
  • [4] Cryogenic-Temperature Thermodynamically Suppressed and Strongly Confined CsPbBr3 Quantum Dots for Deeply Blue Light-Emitting Diodes
    Cao, Jingjing
    Yan, Cheng
    Luo, Chao
    Li, Wen
    Zeng, Xiankan
    Xu, Zhong
    Fu, Xuehai
    Wang, Qing
    Chu, Xiang
    Huang, Haichao
    Zhao, Xiaoyun
    Lu, Jun
    Yang, Weiqing
    [J]. ADVANCED OPTICAL MATERIALS, 2021, 9 (17)
  • [5] Carion N., 2020, ECCV, P213
  • [6] Chen Xuesong, 2022, EUR C COMP VIS
  • [7] Chen Yukang, 2023, IEEE C COMP VIS PATT
  • [8] Chen Yukang, 2023, IEEE C COMP VIS PATT
  • [9] Deng Jiajun, 2021, AAAI C ART INT
  • [10] A Ka-Band Iris-Loaded Waveguide Slot Antenna With Enhanced Out-of-Band Suppression
    Deng, Shuai
    Li, Jin
    Yuan, Tao
    [J]. 2022 IEEE RADIO AND WIRELESS SYMPOSIUM (RWS), 2022, : 5 - 8