BiTraP: Bi-Directional Pedestrian Trajectory Prediction With Multi-Modal Goal Estimation

被引：123

作者：

Yao, Yu ^{[1
]}

Atkins, Ella ^{[2
]}

Johnson-Roberson, Matthew ^{[3
]}

Vasudevan, Ram ^{[4
]}

Du, Xiaoxiao ^{[3
]}

机构：

[1] Univ Michigan, Inst Robot, Ann Arbor, MI 48109 USA

[2] Univ Michigan, Dept Aerosp Engn, Ann Arbor, MI 48109 USA

[3] Univ Michigan, Dept Naval Architecture & Marine Engn, Ann Arbor, MI 48109 USA

[4] Univ Michigan, Dept Mech Engn, Ann Arbor, MI 48109 USA

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2021年 / 6卷 / 02期

关键词：

Computer vision for automation; human and humanoid motion analysis and synthesis; deep learning methods; multi-modal trajectory prediction; goal-conditioned prediction;

D O I：

10.1109/LRA.2021.3056339

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Pedestrian trajectory prediction is an essential task in robotic applications such as autonomous driving and robot navigation. State-of-the-art trajectory predictors use a conditional variational autoencoder (CVAE) with recurrent neural networks (RNNs) to encode observed trajectories and decode multi-modal future trajectories. This process can suffer from accumulated errors over long prediction horizons (>= 2 seconds). This letter presents BiTraP, a goal-conditioned hi-directional multi-modal trajectory prediction method based on the CVAE. BiTraP estimates the goal (end-point) of trajectories and introduces a novel bidirectional decoder to improve longer-term trajectory prediction accuracy. Extensive experiments show that BiTraP generalizes to both first-person view (FPV) and bird's-eye view (BEV) scenarios and outperforms state-of-the-art results by similar to 10-50%. We also show that different choices of non-parametric versus parametric target models in the CVAE directly influence the predicted multi-modal trajectory distributions. These results provide guidance on trajectory predictor design for robotic applications such as collision avoidance and navigation systems. Our code is available at: bups://github.com/untautobots/bidireaction-trajectory-prediction.

引用

页码：1463 / 1470

页数：8

共 45 条

[1] Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].

Alahi, Alexandre ;

Goel, Kratarth ;

Ramanathan, Vignesh ;

Robicquet, Alexandre ;

Li Fei-Fei ;

Savarese, Silvio .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971

[2]

Anderson C, 2019, IEEE INT C INT ROBOT, P4236, DOI [10.1109/IROS40897.2019.8967857, 10.1109/iros40897.2019.8967857]

[3]

[Anonymous], 2019, ARXIV191005449

[4] Accurate and Diverse Sampling of Sequences based on a "Best of Many" Sample Objective [J].

Bhattacharyya, Apratim ;

Schiele, Bernt ;

Fritz, Mario .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8485-8493

[5] Long-Term On-Board Prediction of People in Traffic Scenes under Uncertainty [J].

Bhattacharyya, Apratim ;

Fritz, Mario ;

Schiele, Bernt .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4194-4202

[6]

Bütepage J, 2018, IEEE INT CONF ROBOT, P4563, DOI 10.1109/ICRA.2018.8460651

[7]

Choi Chiho, 2019, ARXIV190800024

[8]

Deo N., 2020, TRAJECTORY FORECASTS

[9]

Deo N, 2018, IEEE INT VEH SYM, P1179, DOI 10.1109/IVS.2018.8500493

[10] Bio-LSTM: A Biomechanically Inspired Recurrent Neural Network for 3-D Pedestrian Pose and Gait Prediction [J].

Du, Xiaoxiao ;

Vasudevan, Ram ;

Johnson-Roberson, Matthew .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) :1501-1508

← 1 2 3 4 5 →