TFEdet: Efficient Multi-Frame 3D Object Detector via Proposal-Centric Temporal Feature Extraction

被引：0

作者：

Kim, Jongho ^{[1
]}

Sagong, Sungpyo ^{[1
]}

Yi, Kyongsu ^{[1
]}

机构：

[1] Seoul Natl Univ, Dept Mech Engn, Seoul 08826, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Proposals; Feature extraction; Point cloud compression; Detectors; Three-dimensional displays; Autonomous vehicles; Transformers; Convolution; Object detection; Laser radar; 3D object detection; multi-frame detection; autonomous driving; LiDAR point cloud; gated recurrent unit;

D O I：

10.1109/ACCESS.2024.3482093

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes the Temporal Feature Extraction Detector (TFEdet), a novel deep learning-based 3D multi-frame object detector efficiently utilizing temporal features from consecutive point clouds. To leverage previously processed frames, inter-frame bipartite matching is performed between current detections from a pre-trained single-frame detector and predicted prior detections, while considering the ego-motion. Subsequently, based on inter-frame association, two types of proposed temporal features are accumulated: temporal proposal features, which are aggregated single-frame features of proposals, and inter-frame proposal features, which containing explicit information between frames. These collected temporal features are then temporally encoded in a Gated Recurrent Unit (GRU)-based temporal feature extraction head and added as residuals to the current frame proposals, leading to the final detection. In performance evaluations on the nuScenes dataset, the proposed TFEdet, which processes a relatively smaller number of point clouds, handles more than twice the frames per second compared to previous multi-frame detectors and still demonstrates competitive detection performance through effective utilization of temporal proposal features.

引用

页码：154526 / 154534

页数：9

共 47 条

[41] Mental Disease Feature Extraction with MRI by 3D Convolutional Neural Network with Multi-Channel Input
Cao, Lijun
Liu, Zhi
Cao, Yankun
Li, Kening
He, Xiaofu
2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 224 - 227
[42] A novel multi-model 3D object detection framework with adaptive voxel-image feature fusion
Liu, Zhao
Fu, Zhongliang
Li, Gang
Zhang, Shengyuan
IET COMPUTER VISION, 2024, 18 (05) : 640 - 651
[43] Real-time 3D reconstruction system using multi-task feature extraction network and surfel
Li, Guangqiang
Hou, Junyi
Chen, Zhong
Yu, Lei
Fei, Shumin
OPTICAL ENGINEERING, 2021, 60 (08)
[44] CenterCoop: Center-Based Feature Aggregation for Communication-Efficient Vehicle-Infrastructure Cooperative 3D Object Detection
Zhou, Linyi
Gan, Zhongxue
Fan, Jiayuan
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3570 - 3577
[45] Feature Line Extraction from 3D Model of Oblique Photogrammetry Based on Multi-Objective Weighted Shortest Path
Zhu Q.
Shang Q.
Hu H.
Yu H.
Zhong R.
Ding Y.
Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2021, 56 (01): : 116 - 122
[46] Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds
Kuang, Hongwu
Wang, Bei
An, Jianping
Zhang, Ming
Zhang, Zehan
SENSORS, 2020, 20 (03)
[47] SMS-Net: Sparse multi-scale voxel feature aggregation network for LiDAR-based 3D object detection
Liu, Sheng
Huang, Wenhao
Cao, Yifeng
Li, Dingda
Chen, Shengyong
NEUROCOMPUTING, 2022, 501 : 555 - 565

← 1 2 3 4 5 →