PillarVTP: vehicle trajectory prediction method based on local point cloud aggregation and receptive field expansion

被引：0

作者：

Liao, Zhuhua ^{[1
]}

Yang, Jiyuan ^{[1
]}

Zhao, Yijiang ^{[1
]}

Liu, Yizhi ^{[1
]}

Zhang, Hui ^{[1
]}

机构：

[1] Hunan Univ Sci Technol, Dept Comp Sci & Engn, Xiangtan 411201, Hunan, Peoples R China

来源：

MULTIMEDIA SYSTEMS | 2024年 / 30卷 / 06期

关键词：

Trajectory prediction; Object detection; Receptive field; Point cloud;

D O I：

10.1007/s00530-024-01521-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Vehicle trajectory prediction plays a crucial role in the control and safety warning of autonomous vehicles. Existing methods often depend on costly high definition (HD) maps for generating trajectories to fit their scenarios, or involve inefficient aggregation of local point clouds into voxels. Therefore, an end-to-end vehicle trajectory prediction method (PillarVTP) is proposed based on local point cloud aggregation and receptive field expansion. Firstly, we construct a novel pillar-based object detection network, introducing SPPCSPC which uses max pooling layers with multiple kernel sizes on a single feature level as the neck for extracting multi-scale features, and improving ResNet-18 by adding a depth stage to expand the receptive field at multiple levels. Then, we present performing feature upsampling to improve performance before predicting vehicle positions. And a shallow convolutional network is utilized to implement the future feature learning network, which learns future features from the previous features for predicting vehicle positions in future frames. Subsequently, the positions of vehicles are matched greedily from future frames to the current frame, and the matched future trajectories are associated with the vehicles detected in the current frame. Finally, the proposed PillarVTP is evaluated on the nuScenes and Argoverse 1 datasets. Experimental results demonstrate that PillarVTP outperforms recent end-to-end prediction method based on point cloud data, FutureDet, by 3.4% and surpasses traditional multi-stage method, Trajectron + + , by 13.7%. Furthermore, PillarVTP shows good robustness under various weather conditions.

引用

页数：10

共 36 条

[1] Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving
Ben Agro
Sykora, Quinlan
Casas, Sergio
Urtasun, Raquel
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1379 - 1388
[2] Caesar H, 2020, PROC CVPR IEEE, P11618, DOI 10.1109/CVPR42600.2020.01164
[3] Environment-Attention Network for Vehicle Trajectory Prediction
Cai, Yingfeng
Wang, Zihao
Wang, Hai
Chen, Long
Li, Yicheng
Sotelo, Miguel Angel
Li, Zhixiong
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (11) : 11216 - 11227
[4] Argoverse: 3D Tracking and Forecasting with Rich Maps
Chang, Ming-Fang
Lambert, John
Sangkloy, Patsorn
Singh, Jagjeet
Bak, Slawomir
Hartnett, Andrew
Wang, De
Carr, Peter
Lucey, Simon
Ramanan, Deva
Hays, James
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8740 - 8749
[5] Convolutional Social Pooling for Vehicle Trajectory Prediction
Deo, Nachiket
Trivedi, Mohan M.
[J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1549 - 1557
[6] ST-SIGMA: Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting
Fang, Yang
Luo, Bei
Zhao, Ting
He, Dong
Jiang, Bingbing
Liu, Qilie
[J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2022, 7 (04) : 744 - 757
[7] Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks
Gupta, Agrim
Johnson, Justin
Li Fei-Fei
Savarese, Silvio
Alahi, Alexandre
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2255 - 2264
[8] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1904 - 1916
[9] PointPillars: Fast Encoders for Object Detection from Point Clouds
Lang, Alex H.
Vora, Sourabh
Caesar, Holger
Zhou, Lubing
Yang, Jiong
Beijbom, Oscar
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12689 - 12697
[10] CornerNet: Detecting Objects as Paired Keypoints
Law, Hei
Deng, Jia
[J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 765 - 781

← 1 2 3 4 →