Uncertainty-Aware Data Augmentation for Offline Reinforcement Learning

被引:0
|
作者
Su, Yunjie [1 ]
Kong, Yilun [1 ]
Wang, Xueqian [1 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
关键词
Data augmentation; Uncertainty estimation; Out of distribution; Offline reinforcement learning;
D O I
10.1109/IJCNN54540.2023.10191211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the key challenges in Offline Reinforcement Learning is that it cannot conduct further environment exploration and performs poorly in terms of out-of-distribution generalizations. Data augmentation is commonly used to solve the issue of limited coverage of the full state-action space in static offline dataset. However, the existing data augmentation methods for proprioceptive observation suffer from the dilemma where the data coverage is often limited by tight constraints, while aggressive methods may exacerbate the performance. At the heart of this phenomenon are the diverged action distribution and the high uncertainty of the value function. In this paper, we propose to extend the static offline datasets during training by adding gradient-based perturbation to the state and utilizing the estimated uncertainty of the value function to constrain the range of the gradient. The estimated uncertainty of the value function works as a guidance to adjust the range of augmentation automatically, ensuring the adaptability and reliability of the state perturbation. The proposed algorithm Uncertainty-Aware Data Augmentation(UADA), is plugged into various standard offline RL algorithms and evaluated on several offline reinforcement learning tasks. The empirical results confirm that UADA substantially improves the performance and achieves better model stability compared with the original algorithms.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Uncertainty-Aware COVID-19 Detection from Imbalanced Sound Data
    Xia, Tong
    Han, Jing
    Qendro, Lorena
    Dang, Ting
    Mascolo, Cecilia
    INTERSPEECH 2021, 2021, : 2951 - 2955
  • [22] Model-Based Offline Reinforcement Learning with Uncertainty Estimation and Policy Constraint
    Zhu J.
    Du C.
    Dullerud G.E.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 1 - 13
  • [23] Uncertainty-aware point cloud segmentation for infrastructure projects using Bayesian deep learning
    Vassilev, Hristo
    Laska, Marius
    Blankenbach, Joerg
    AUTOMATION IN CONSTRUCTION, 2024, 164
  • [24] False Correlation Reduction for Offline Reinforcement Learning
    Deng, Zhihong
    Fu, Zuyue
    Wang, Lingxiao
    Yang, Zhuoran
    Bai, Chenjia
    Zhou, Tianyi
    Wang, Zhaoran
    Jiang, Jing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 1199 - 1211
  • [25] Significance extraction based on data augmentation for reinforcement learning
    Han, Yuxi
    Li, Dequan
    Yang, Yang
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2025, : 385 - 399
  • [26] Benchmarking Offline Reinforcement Learning
    Tittaferrante, Andrew
    Yassine, Abdulsalam
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 259 - 263
  • [27] Robust Tracking via Uncertainty-Aware Semantic Consistency
    Ma, Jie
    Lan, Xiangyuan
    Zhong, Bineng
    Li, Guorong
    Tang, Zhenjun
    Li, Xianxian
    Ji, Rongrong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1740 - 1751
  • [28] Uncertainty-aware image inpainting with adaptive feedback network
    Ma, Xin
    Zhou, Xiaoqiang
    Huang, Huaibo
    Jia, Gengyun
    Wang, Yaohui
    Chen, Xinyuan
    Chen, Cunjian
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [29] Uncertainty-Aware Semantic Guidance and Estimation for Image Inpainting
    Liao, Liang
    Xiao, Jing
    Wang, Zheng
    Lin, Chia-Wen
    Satoh, Shin'ichi
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) : 310 - 323
  • [30] μDARTS: Model Uncertainty-Aware Differentiable Architecture Search
    Chakraborty, Biswadeep
    Mukhopadhyay, Saibal
    IEEE ACCESS, 2022, 10 : 98670 - 98682