Enhancing Autonomous Driving With Spatial Memory and Attention in Reinforcement Learning

被引:0
|
作者
Gerasyov, Matvey [1 ]
Savchenko, Andrey V. [2 ,3 ]
Makarov, Ilya [4 ,5 ,6 ]
机构
[1] HSE Univ, Sch Data Anal & Artificial Intelligence, Moscow 101000, Russia
[2] Sber AI Lab, Moscow 117312, Russia
[3] HSE Univ, Lab Algorithms & Technol Network Anal, Nizhnii Novgorod 603155, Russia
[4] AIRI, Moscow 105064, Russia
[5] ISP RAS, Moscow 109004, Russia
[6] Natl Res Nucl Univ MEPhI, Artificial Intelligence Res Ctr, Moscow 115409, Russia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Long short term memory; Vectors; Visualization; Transformers; Head; Autonomous vehicles; Trajectory; Benchmark testing; Training; Tensors; Attention; deep reinforcement learning; partially observable Markov decision process;
D O I
10.1109/ACCESS.2024.3486602
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning in environments with visual observations presents challenges due to incomplete individual observations. The lack of complete information leads to increased uncertainty in decision-making, which requires agents to be supplemented with a memory module to retain information about previous observations. Our paper proposes a novel spatial memory mechanism with a flexible access system based on the multihead attention mechanism. Through experiments in the Atari benchmark and multiple autonomous driving environments, our approach outperforms agents using classical convolutional and recurrent neural networks. Further analysis reveals repeated interpretive patterns in attention distribution among trained agents. This study highlights the effectiveness of spatial memory and attention mechanisms in improving the efficiency of deep reinforcement learning in partially observable environments.
引用
收藏
页码:173316 / 173324
页数:9
相关论文
共 50 条
  • [41] Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network
    Wu, Yuanqing
    Liao, Siqin
    Liu, Xiang
    Li, Zhihang
    Lu, Renquan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3680 - 3690
  • [42] Vision-Based Autonomous Driving: A Hierarchical Reinforcement Learning Approach
    Wang, Jiao
    Sun, Haoyi
    Zhu, Can
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11213 - 11226
  • [43] EPNet: An Efficient Postprocessing Network for Enhancing Semantic Segmentation in Autonomous Driving
    Sun, Libo
    Xia, Jiatong
    Xie, Hui
    Sun, Changming
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [44] Maximum Entropy Inverse Reinforcement Learning Using Monte Carlo Tree Search for Autonomous Driving
    da Silva, Junior Anderson Rodrigues
    Grassi Jr, Valdir
    Wolf, Denis Fernando
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) : 11552 - 11562
  • [45] Unsupervised Reinforcement Learning for Multi-Task Autonomous Driving: Expanding Skills and Cultivating Curiosity
    Ma, Zhenyu
    Liu, Xinyi
    Huang, Yanjun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 14209 - 14219
  • [46] Adaptive Path-Tracking Controller Embedded With Reinforcement Learning and Preview Model for Autonomous Driving
    Xia, Qi
    Chen, Peng
    Xu, Guoyan
    Sun, Haodong
    Li, Liang
    Yu, Guizhen
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (03) : 3736 - 3750
  • [47] Pre-training with asynchronous supervised learning for reinforcement learning based autonomous driving
    Wang, Yunpeng
    Zheng, Kunxian
    Tian, Daxin
    Duan, Xuting
    Zhou, Jianshan
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (05) : 673 - 686
  • [48] Enhancing BVR Air Combat Agent Development With Attention-Driven Reinforcement Learning
    Kuroswiski, Andre R.
    Wu, Annie S.
    Passaro, Angelo
    IEEE ACCESS, 2025, 13 : 70446 - 70463
  • [49] Conditional Predictive Behavior Planning With Inverse Reinforcement Learning for Human-Like Autonomous Driving
    Huang, Zhiyu
    Liu, Haochen
    Wu, Jingda
    Lv, Chen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (07) : 7244 - 7258
  • [50] Spatial-temporal recurrent reinforcement learning for autonomous ships
    Waltz, Martin
    Okhrin, Ostap
    NEURAL NETWORKS, 2023, 165 : 634 - 653