TFEdet: Efficient Multi-Frame 3D Object Detector via Proposal-Centric Temporal Feature Extraction

被引:0
作者
Kim, Jongho [1 ]
Sagong, Sungpyo [1 ]
Yi, Kyongsu [1 ]
机构
[1] Seoul Natl Univ, Dept Mech Engn, Seoul 08826, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Proposals; Feature extraction; Point cloud compression; Detectors; Three-dimensional displays; Autonomous vehicles; Transformers; Convolution; Object detection; Laser radar; 3D object detection; multi-frame detection; autonomous driving; LiDAR point cloud; gated recurrent unit;
D O I
10.1109/ACCESS.2024.3482093
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes the Temporal Feature Extraction Detector (TFEdet), a novel deep learning-based 3D multi-frame object detector efficiently utilizing temporal features from consecutive point clouds. To leverage previously processed frames, inter-frame bipartite matching is performed between current detections from a pre-trained single-frame detector and predicted prior detections, while considering the ego-motion. Subsequently, based on inter-frame association, two types of proposed temporal features are accumulated: temporal proposal features, which are aggregated single-frame features of proposals, and inter-frame proposal features, which containing explicit information between frames. These collected temporal features are then temporally encoded in a Gated Recurrent Unit (GRU)-based temporal feature extraction head and added as residuals to the current frame proposals, leading to the final detection. In performance evaluations on the nuScenes dataset, the proposed TFEdet, which processes a relatively smaller number of point clouds, handles more than twice the frames per second compared to previous multi-frame detectors and still demonstrates competitive detection performance through effective utilization of temporal proposal features.
引用
收藏
页码:154526 / 154534
页数:9
相关论文
共 47 条
  • [1] 3D Object Detection With Multi-Frame RGB-Lidar Feature Alignment
    Ercelik, Emec
    Yurtsever, Ekim
    Knoll, Alois
    IEEE ACCESS, 2021, 9 : 143138 - 143149
  • [2] MFTFF: multi-frame 3D object detection with temporal feature fusion
    Xin Meng
    Yuan Zhou
    Min Zhang
    Cui Wang
    Jianghang Lv
    Jonghyuk Kim
    Shifeng Wang
    The Journal of Supercomputing, 81 (8)
  • [3] Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection
    Zhang, Yifan
    Zhu, Zhiyu
    Hou, Junhui
    Wu, Dapeng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10614 - 10628
  • [4] X-View: Non-Egocentric Multi-View 3D Object Detector
    Xie, Liang
    Xu, Guodong
    Cai, Deng
    He, Xiaofei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1488 - 1497
  • [5] Context-Aware Dynamic Feature Extraction for 3D Object Detection in Point Clouds
    Tian, Yonglin
    Huang, Lichao
    Yu, Hui
    Wu, Xiangbin
    Li, Xuesong
    Wang, Kunfeng
    Wang, Zilei
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 10773 - 10785
  • [6] Multi-Sensor Fusion 3D Object Detection Based on Multi-Frame Information
    Wu S.
    Geng J.
    Wu C.
    Yan Z.
    Chen K.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2023, 43 (12): : 1282 - 1289
  • [7] GFA-SMT: Geometric Feature Aggregation and Self-Attention in a Multi-Head Transformer for 3D Object Detection in Autonomous Vehicles
    Mushtaq, Husnain
    Deng, Xiaoheng
    Jiang, Ping
    Wan, Shaohua
    Ali, Mubashir
    Ullah, Irshad
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3557 - 3573
  • [8] SCNet3D: Rethinking the Feature Extraction Process of Pillar-Based 3D Object Detection
    Li, Junru
    Wang, Zhiling
    Gong, Diancheng
    Wang, Chunchun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 770 - 784
  • [9] Focal-PETR: Embracing Foreground for Efficient Multi-Camera 3D Object Detection
    Wang, Shihao
    Jiang, Xiaohui
    Li, Ying
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1481 - 1489
  • [10] GraphAlign plus plus : An Accurate Feature Alignment by Graph Matching for Multi-Modal 3D Object Detection
    Song, Ziying
    Jia, Caiyan
    Yang, Lei
    Wei, Haiyue
    Liu, Lin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2619 - 2632