Temporal prediction model with context-aware data augmentation for robust visual reinforcement learning

被引:0
|
作者
Yue, Xinkai [1 ]
Ge, Hongwei [1 ]
He, Xin [1 ]
Hou, Yaqing [1 ]
机构
[1] College of Computer Science and Technology, Dalian University of Technology, Dalian, China
基金
中国国家自然科学基金;
关键词
Benchmarking - Forecasting - Learning systems - Pixels - Robotics;
D O I
10.1007/s00521-024-10251-w
中图分类号
学科分类号
摘要
While reinforcement learning has shown promising abilities to solve continuous control tasks from visual inputs, it remains a challenge to learn robust representations from high-dimensional observations and generalize to unseen environments with distracting elements. Recently, strong data augmentation has been applied to increase the diversity of the training data, but it may damage the task-relevant pixels and thus hinder the optimization of reinforcement learning. To this end, this paper proposes temporal prediction model with context-aware data augmentation (TPMC), a framework which incorporates context-aware strong augmentation into the dynamic model for learning robust policies. Specifically, TPMC utilizes the gradient-based saliency map to identify and preserve task-relevant pixels during strong augmentation, generating reliable augmented images for stable training. Moreover, the temporal prediction consistency between strong and weak augmented views is enforced to construct a contrastive objective for learning shared task-relevant representations. Extensive experiments are conducted to evaluate the performance on DMControl-GB benchmarks and several robotic manipulation tasks. Experimental results demonstrate that TPMC achieves superior data-efficiency and generalization to other state-of-the-art methods. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:19337 / 19352
页数:15
相关论文
共 50 条
  • [1] Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation
    Guan, Lin
    Verma, Mudit
    Guo, Sihang
    Zhang, Ruohan
    Kambhampati, Subbarao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [2] Learning Cascaded Context-aware Framework for Robust Visual Tracking
    Ma, Ding
    Wu, Xiangqian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 28 - 36
  • [3] Context-Aware Visual Compatibility Prediction
    Cucurull, Guillem
    Taslakian, Perouz
    Vazquez, David
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12609 - 12618
  • [4] Prototypical Context-Aware Dynamics for Generalization in Visual Control With Model-Based Reinforcement Learning
    Wang, Junjie
    Zhang, Qichao
    Mu, Yao
    Li, Dong
    Zhao, Dongbin
    Zhuang, Yuzheng
    Luo, Ping
    Wang, Bin
    Hao, Jianye
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (09) : 10717 - 10727
  • [5] ContextMix: A context-aware data augmentation method for industrial visual inspection systems
    Kim, Hyungmin
    Kim, Donghun
    Ahn, Pyunghwan
    Suh, Sungho
    Cho, Hansang
    Kim, Junmo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [6] Context-aware reinforcement learning for course recommendation
    Lin, Yuanguo
    Lin, Fan
    Yang, Lvqing
    Zeng, Wenhua
    Liu, Yong
    Wu, Pengcheng
    APPLIED SOFT COMPUTING, 2022, 125
  • [7] Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning
    Lee, Kimin
    Seo, Younggyo
    Lee, Seunghyun
    Lee, Honglak
    Shin, Jinwoo
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [8] Simultaneous Visual Context-aware Path Prediction
    Iesaki, Haruka
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    Ishii, Yasunori
    Kozuka, Kazuki
    Fujimura, Ryota
    VISAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4: VISAPP, 2020, : 741 - 748
  • [9] A context-aware robust intrusion detection system: a reinforcement learning-based approach
    Sethi, Kamalakanta
    Rupesh, E. Sai
    Kumar, Rahul
    Bera, Padmalochan
    Madhav, Y. Venu
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2020, 19 (06) : 657 - 678
  • [10] A context-aware robust intrusion detection system: a reinforcement learning-based approach
    Kamalakanta Sethi
    E. Sai Rupesh
    Rahul Kumar
    Padmalochan Bera
    Y. Venu Madhav
    International Journal of Information Security, 2020, 19 : 657 - 678