PillarVTP: vehicle trajectory prediction method based on local point cloud aggregation and receptive field expansion

被引:0
作者
Liao, Zhuhua [1 ]
Yang, Jiyuan [1 ]
Zhao, Yijiang [1 ]
Liu, Yizhi [1 ]
Zhang, Hui [1 ]
机构
[1] Hunan Univ Sci Technol, Dept Comp Sci & Engn, Xiangtan 411201, Hunan, Peoples R China
关键词
Trajectory prediction; Object detection; Receptive field; Point cloud;
D O I
10.1007/s00530-024-01521-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vehicle trajectory prediction plays a crucial role in the control and safety warning of autonomous vehicles. Existing methods often depend on costly high definition (HD) maps for generating trajectories to fit their scenarios, or involve inefficient aggregation of local point clouds into voxels. Therefore, an end-to-end vehicle trajectory prediction method (PillarVTP) is proposed based on local point cloud aggregation and receptive field expansion. Firstly, we construct a novel pillar-based object detection network, introducing SPPCSPC which uses max pooling layers with multiple kernel sizes on a single feature level as the neck for extracting multi-scale features, and improving ResNet-18 by adding a depth stage to expand the receptive field at multiple levels. Then, we present performing feature upsampling to improve performance before predicting vehicle positions. And a shallow convolutional network is utilized to implement the future feature learning network, which learns future features from the previous features for predicting vehicle positions in future frames. Subsequently, the positions of vehicles are matched greedily from future frames to the current frame, and the matched future trajectories are associated with the vehicles detected in the current frame. Finally, the proposed PillarVTP is evaluated on the nuScenes and Argoverse 1 datasets. Experimental results demonstrate that PillarVTP outperforms recent end-to-end prediction method based on point cloud data, FutureDet, by 3.4% and surpasses traditional multi-stage method, Trajectron + + , by 13.7%. Furthermore, PillarVTP shows good robustness under various weather conditions.
引用
收藏
页数:10
相关论文
共 36 条
  • [1] Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving
    Ben Agro
    Sykora, Quinlan
    Casas, Sergio
    Urtasun, Raquel
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1379 - 1388
  • [2] Caesar H, 2020, PROC CVPR IEEE, P11618, DOI 10.1109/CVPR42600.2020.01164
  • [3] Environment-Attention Network for Vehicle Trajectory Prediction
    Cai, Yingfeng
    Wang, Zihao
    Wang, Hai
    Chen, Long
    Li, Yicheng
    Sotelo, Miguel Angel
    Li, Zhixiong
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (11) : 11216 - 11227
  • [4] Argoverse: 3D Tracking and Forecasting with Rich Maps
    Chang, Ming-Fang
    Lambert, John
    Sangkloy, Patsorn
    Singh, Jagjeet
    Bak, Slawomir
    Hartnett, Andrew
    Wang, De
    Carr, Peter
    Lucey, Simon
    Ramanan, Deva
    Hays, James
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8740 - 8749
  • [5] Convolutional Social Pooling for Vehicle Trajectory Prediction
    Deo, Nachiket
    Trivedi, Mohan M.
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1549 - 1557
  • [6] ST-SIGMA: Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting
    Fang, Yang
    Luo, Bei
    Zhao, Ting
    He, Dong
    Jiang, Bingbing
    Liu, Qilie
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2022, 7 (04) : 744 - 757
  • [7] Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks
    Gupta, Agrim
    Johnson, Justin
    Li Fei-Fei
    Savarese, Silvio
    Alahi, Alexandre
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2255 - 2264
  • [8] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1904 - 1916
  • [9] PointPillars: Fast Encoders for Object Detection from Point Clouds
    Lang, Alex H.
    Vora, Sourabh
    Caesar, Holger
    Zhou, Lubing
    Yang, Jiong
    Beijbom, Oscar
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12689 - 12697
  • [10] CornerNet: Detecting Objects as Paired Keypoints
    Law, Hei
    Deng, Jia
    [J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 765 - 781