A reinforcement learning intelligent deductive model with pre-trained sequence information

被引：0

作者：

Han, Xinyu ^{[1
]}

Xu, Huosheng ^{[1
,2
]}

Yu, Hao ^{[1
]}

Li, Sizhao ^{[1
]}

机构：

[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China

[2] Wuhan Digital Engn Inst, Wuhan, Hubei, Peoples R China

来源：

INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION | 2023年 / 22卷 / 04期

基金：

国家重点研发计划;

关键词：

reinforcement learning; trajectory prediction; intelligent deduction; neural networks; PREDICTION;

D O I：

10.1504/IJBIC.2023.136098

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Agent trajectory prediction is an increasingly popular topic in computer vision and autonomous driving. With the help of deep learning and big data, it is possible to understand the interaction model between agents hidden in complex environments. Existing methods usually pay more attention to the average trajectory offset of the agent while ignoring the distribution differences of the target. This issue results inevitable performance decrease. To address this issue, we propose a novel reinforcement learning intelligent deduction model (RLDM). It achieves joint reasoning of goals and paths in a unified framework, and accurately predicts trajectories in a short period of time with fewer datasets. Specifically, an end-to-end time-series pre-training module is proposed to explore the agent's training state reward and goal reward. Moreover, a prediction module based on the combination of kinematics and environmental background is proposed to explore the agent motion characteristics. By this way, acting in a purely reactive manner is better relieved. Practical trajectory prediction experiments are designed, and the experimental results verify the superior performance of our proposed model. The model experiment results are improved by 2% and 11% on the ADE and FDE metric on average.

引用

页码：195 / 205

页数：12

共 24 条

[1] Social LSTM: Human Trajectory Prediction in Crowded Spaces
Alahi, Alexandre
Goel, Kratarth
Ramanathan, Vignesh
Robicquet, Alexandre
Li Fei-Fei
Savarese, Silvio
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 961 - 971
[2] Capobianco S., 2022, arXiv
[3] Regularising neural networks for future trajectory prediction via inverse reinforcement learning framework
Choi, Dooseop
Min, Kyoungwook
Choi, Jeongdan
[J]. IET COMPUTER VISION, 2020, 14 (05) : 192 - 200
[4] DSA-GAN: Driving Style Attention Generative Adversarial Network for Vehicle Trajectory Prediction
Choi, Seungwon
Kweon, Nahyun
Yang, Chanuk
Kim, Dongchan
Shon, Hyukju
Choi, Jaewoong
Huh, Kunsoo
[J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1515 - 1520
[5] A Two-Block RNN-Based Trajectory Prediction From Incomplete Trajectory
Fujii, Ryo
Vongkulbhisal, Jayakorn
Hachiuma, Ryo
Saito, Hideo
[J]. IEEE ACCESS, 2021, 9 : 56140 - 56151
[6] Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks
Gupta, Agrim
Johnson, Justin
Li Fei-Fei
Savarese, Silvio
Alahi, Alexandre
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2255 - 2264
[7] Multimodal Deep Generative Models for Trajectory Prediction: A Conditional Variational Autoencoder Approach
Ivanovic, Boris
Leung, Karen
Schmerling, Edward
Pavone, Marco
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 295 - 302
[8] Kuefler A, 2017, IEEE INT VEH SYM, P204, DOI 10.1109/IVS.2017.7995721
[9] Conditional Generative Neural System for Probabilistic Trajectory Prediction
Li, Jiachen
Ma, Hengbo
Tomizuka, Masayoshi
[J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 6150 - 6156
[10] A Recurrent Attention and Interaction Model for Pedestrian Trajectory Prediction
Li, Xuesong
Liu, Yating
Wang, Kunfeng
Wang, Fei-Yue
[J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (05) : 1361 - 1370

← 1 2 3 →