TFEdet: Efficient Multi-Frame 3D Object Detector via Proposal-Centric Temporal Feature Extraction

被引:0
作者
Kim, Jongho [1 ]
Sagong, Sungpyo [1 ]
Yi, Kyongsu [1 ]
机构
[1] Seoul Natl Univ, Dept Mech Engn, Seoul 08826, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Proposals; Feature extraction; Point cloud compression; Detectors; Three-dimensional displays; Autonomous vehicles; Transformers; Convolution; Object detection; Laser radar; 3D object detection; multi-frame detection; autonomous driving; LiDAR point cloud; gated recurrent unit;
D O I
10.1109/ACCESS.2024.3482093
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes the Temporal Feature Extraction Detector (TFEdet), a novel deep learning-based 3D multi-frame object detector efficiently utilizing temporal features from consecutive point clouds. To leverage previously processed frames, inter-frame bipartite matching is performed between current detections from a pre-trained single-frame detector and predicted prior detections, while considering the ego-motion. Subsequently, based on inter-frame association, two types of proposed temporal features are accumulated: temporal proposal features, which are aggregated single-frame features of proposals, and inter-frame proposal features, which containing explicit information between frames. These collected temporal features are then temporally encoded in a Gated Recurrent Unit (GRU)-based temporal feature extraction head and added as residuals to the current frame proposals, leading to the final detection. In performance evaluations on the nuScenes dataset, the proposed TFEdet, which processes a relatively smaller number of point clouds, handles more than twice the frames per second compared to previous multi-frame detectors and still demonstrates competitive detection performance through effective utilization of temporal proposal features.
引用
收藏
页码:154526 / 154534
页数:9
相关论文
共 47 条
  • [41] Mental Disease Feature Extraction with MRI by 3D Convolutional Neural Network with Multi-Channel Input
    Cao, Lijun
    Liu, Zhi
    Cao, Yankun
    Li, Kening
    He, Xiaofu
    2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 224 - 227
  • [42] A novel multi-model 3D object detection framework with adaptive voxel-image feature fusion
    Liu, Zhao
    Fu, Zhongliang
    Li, Gang
    Zhang, Shengyuan
    IET COMPUTER VISION, 2024, 18 (05) : 640 - 651
  • [43] Real-time 3D reconstruction system using multi-task feature extraction network and surfel
    Li, Guangqiang
    Hou, Junyi
    Chen, Zhong
    Yu, Lei
    Fei, Shumin
    OPTICAL ENGINEERING, 2021, 60 (08)
  • [44] CenterCoop: Center-Based Feature Aggregation for Communication-Efficient Vehicle-Infrastructure Cooperative 3D Object Detection
    Zhou, Linyi
    Gan, Zhongxue
    Fan, Jiayuan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3570 - 3577
  • [45] Feature Line Extraction from 3D Model of Oblique Photogrammetry Based on Multi-Objective Weighted Shortest Path
    Zhu Q.
    Shang Q.
    Hu H.
    Yu H.
    Zhong R.
    Ding Y.
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2021, 56 (01): : 116 - 122
  • [46] Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds
    Kuang, Hongwu
    Wang, Bei
    An, Jianping
    Zhang, Ming
    Zhang, Zehan
    SENSORS, 2020, 20 (03)
  • [47] SMS-Net: Sparse multi-scale voxel feature aggregation network for LiDAR-based 3D object detection
    Liu, Sheng
    Huang, Wenhao
    Cao, Yifeng
    Li, Dingda
    Chen, Shengyong
    NEUROCOMPUTING, 2022, 501 : 555 - 565