A reinforcement learning intelligent deductive model with pre-trained sequence information

被引:0
作者
Han, Xinyu [1 ]
Xu, Huosheng [1 ,2 ]
Yu, Hao [1 ]
Li, Sizhao [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
[2] Wuhan Digital Engn Inst, Wuhan, Hubei, Peoples R China
基金
国家重点研发计划;
关键词
reinforcement learning; trajectory prediction; intelligent deduction; neural networks; PREDICTION;
D O I
10.1504/IJBIC.2023.136098
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Agent trajectory prediction is an increasingly popular topic in computer vision and autonomous driving. With the help of deep learning and big data, it is possible to understand the interaction model between agents hidden in complex environments. Existing methods usually pay more attention to the average trajectory offset of the agent while ignoring the distribution differences of the target. This issue results inevitable performance decrease. To address this issue, we propose a novel reinforcement learning intelligent deduction model (RLDM). It achieves joint reasoning of goals and paths in a unified framework, and accurately predicts trajectories in a short period of time with fewer datasets. Specifically, an end-to-end time-series pre-training module is proposed to explore the agent's training state reward and goal reward. Moreover, a prediction module based on the combination of kinematics and environmental background is proposed to explore the agent motion characteristics. By this way, acting in a purely reactive manner is better relieved. Practical trajectory prediction experiments are designed, and the experimental results verify the superior performance of our proposed model. The model experiment results are improved by 2% and 11% on the ADE and FDE metric on average.
引用
收藏
页码:195 / 205
页数:12
相关论文
共 24 条
  • [1] Social LSTM: Human Trajectory Prediction in Crowded Spaces
    Alahi, Alexandre
    Goel, Kratarth
    Ramanathan, Vignesh
    Robicquet, Alexandre
    Li Fei-Fei
    Savarese, Silvio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 961 - 971
  • [2] Capobianco S., 2022, arXiv
  • [3] Regularising neural networks for future trajectory prediction via inverse reinforcement learning framework
    Choi, Dooseop
    Min, Kyoungwook
    Choi, Jeongdan
    [J]. IET COMPUTER VISION, 2020, 14 (05) : 192 - 200
  • [4] DSA-GAN: Driving Style Attention Generative Adversarial Network for Vehicle Trajectory Prediction
    Choi, Seungwon
    Kweon, Nahyun
    Yang, Chanuk
    Kim, Dongchan
    Shon, Hyukju
    Choi, Jaewoong
    Huh, Kunsoo
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1515 - 1520
  • [5] A Two-Block RNN-Based Trajectory Prediction From Incomplete Trajectory
    Fujii, Ryo
    Vongkulbhisal, Jayakorn
    Hachiuma, Ryo
    Saito, Hideo
    [J]. IEEE ACCESS, 2021, 9 : 56140 - 56151
  • [6] Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks
    Gupta, Agrim
    Johnson, Justin
    Li Fei-Fei
    Savarese, Silvio
    Alahi, Alexandre
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2255 - 2264
  • [7] Multimodal Deep Generative Models for Trajectory Prediction: A Conditional Variational Autoencoder Approach
    Ivanovic, Boris
    Leung, Karen
    Schmerling, Edward
    Pavone, Marco
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 295 - 302
  • [8] Kuefler A, 2017, IEEE INT VEH SYM, P204, DOI 10.1109/IVS.2017.7995721
  • [9] Conditional Generative Neural System for Probabilistic Trajectory Prediction
    Li, Jiachen
    Ma, Hengbo
    Tomizuka, Masayoshi
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 6150 - 6156
  • [10] A Recurrent Attention and Interaction Model for Pedestrian Trajectory Prediction
    Li, Xuesong
    Liu, Yating
    Wang, Kunfeng
    Wang, Fei-Yue
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (05) : 1361 - 1370