BiTraP: Bi-Directional Pedestrian Trajectory Prediction With Multi-Modal Goal Estimation

被引:101
作者
Yao, Yu [1 ]
Atkins, Ella [2 ]
Johnson-Roberson, Matthew [3 ]
Vasudevan, Ram [4 ]
Du, Xiaoxiao [3 ]
机构
[1] Univ Michigan, Inst Robot, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Aerosp Engn, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Dept Naval Architecture & Marine Engn, Ann Arbor, MI 48109 USA
[4] Univ Michigan, Dept Mech Engn, Ann Arbor, MI 48109 USA
关键词
Computer vision for automation; human and humanoid motion analysis and synthesis; deep learning methods; multi-modal trajectory prediction; goal-conditioned prediction;
D O I
10.1109/LRA.2021.3056339
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Pedestrian trajectory prediction is an essential task in robotic applications such as autonomous driving and robot navigation. State-of-the-art trajectory predictors use a conditional variational autoencoder (CVAE) with recurrent neural networks (RNNs) to encode observed trajectories and decode multi-modal future trajectories. This process can suffer from accumulated errors over long prediction horizons (>= 2 seconds). This letter presents BiTraP, a goal-conditioned hi-directional multi-modal trajectory prediction method based on the CVAE. BiTraP estimates the goal (end-point) of trajectories and introduces a novel bidirectional decoder to improve longer-term trajectory prediction accuracy. Extensive experiments show that BiTraP generalizes to both first-person view (FPV) and bird's-eye view (BEV) scenarios and outperforms state-of-the-art results by similar to 10-50%. We also show that different choices of non-parametric versus parametric target models in the CVAE directly influence the predicted multi-modal trajectory distributions. These results provide guidance on trajectory predictor design for robotic applications such as collision avoidance and navigation systems. Our code is available at: bups://github.com/untautobots/bidireaction-trajectory-prediction.
引用
收藏
页码:1463 / 1470
页数:8
相关论文
共 45 条
[1]   Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].
Alahi, Alexandre ;
Goel, Kratarth ;
Ramanathan, Vignesh ;
Robicquet, Alexandre ;
Li Fei-Fei ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971
[2]  
Anderson C, 2019, IEEE INT C INT ROBOT, P4236, DOI [10.1109/IROS40897.2019.8967857, 10.1109/iros40897.2019.8967857]
[3]  
[Anonymous], 2019, ARXIV191005449
[4]  
[Anonymous], 2019, ARXIV190800024
[5]   Accurate and Diverse Sampling of Sequences based on a "Best of Many" Sample Objective [J].
Bhattacharyya, Apratim ;
Schiele, Bernt ;
Fritz, Mario .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8485-8493
[6]   Long-Term On-Board Prediction of People in Traffic Scenes under Uncertainty [J].
Bhattacharyya, Apratim ;
Fritz, Mario ;
Schiele, Bernt .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4194-4202
[7]  
Bütepage J, 2018, IEEE INT CONF ROBOT, P4563, DOI 10.1109/ICRA.2018.8460651
[8]  
Deo N., 2020, ARXIV200100735
[9]  
Deo N, 2018, IEEE INT VEH SYM, P1179, DOI 10.1109/IVS.2018.8500493
[10]   Bio-LSTM: A Biomechanically Inspired Recurrent Neural Network for 3-D Pedestrian Pose and Gait Prediction [J].
Du, Xiaoxiao ;
Vasudevan, Ram ;
Johnson-Roberson, Matthew .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) :1501-1508