Uncertainty-Aware Data Augmentation for Offline Reinforcement Learning

被引:0
|
作者
Su, Yunjie [1 ]
Kong, Yilun [1 ]
Wang, Xueqian [1 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
关键词
Data augmentation; Uncertainty estimation; Out of distribution; Offline reinforcement learning;
D O I
10.1109/IJCNN54540.2023.10191211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the key challenges in Offline Reinforcement Learning is that it cannot conduct further environment exploration and performs poorly in terms of out-of-distribution generalizations. Data augmentation is commonly used to solve the issue of limited coverage of the full state-action space in static offline dataset. However, the existing data augmentation methods for proprioceptive observation suffer from the dilemma where the data coverage is often limited by tight constraints, while aggressive methods may exacerbate the performance. At the heart of this phenomenon are the diverged action distribution and the high uncertainty of the value function. In this paper, we propose to extend the static offline datasets during training by adding gradient-based perturbation to the state and utilizing the estimated uncertainty of the value function to constrain the range of the gradient. The estimated uncertainty of the value function works as a guidance to adjust the range of augmentation automatically, ensuring the adaptability and reliability of the state perturbation. The proposed algorithm Uncertainty-Aware Data Augmentation(UADA), is plugged into various standard offline RL algorithms and evaluated on several offline reinforcement learning tasks. The empirical results confirm that UADA substantially improves the performance and achieves better model stability compared with the original algorithms.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Conservative network for offline reinforcement learning
    Peng, Zhiyong
    Liu, Yadong
    Chen, Haoqiang
    Zhou, Zongtan
    KNOWLEDGE-BASED SYSTEMS, 2023, 282
  • [42] Federated Uncertainty-Aware Aggregation for Fundus Diabetic Retinopathy Staging
    Wang, Meng
    Wang, Lianyu
    Xu, Xinxing
    Zou, Ke
    Qian, Yiming
    Goh, Rick Siow Mong
    Liu, Yong
    Fu, Huazhu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 222 - 232
  • [43] Uncertainty-Aware Denoising Network for Artifact Removal in EEG Signals
    Jin, Xiyuan
    Wang, Jing
    Liu, Lei
    Lin, Youfang
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 4470 - 4480
  • [44] Offline reinforcement learning with representations for actions
    Lou, Xingzhou
    Yin, Qiyue
    Zhang, Junge
    Yu, Chao
    He, Zhaofeng
    Cheng, Nengjie
    Huang, Kaiqi
    INFORMATION SCIENCES, 2022, 610 : 746 - 758
  • [45] Hyperparameter Tuning in Offline Reinforcement Learning
    Tittaferrante, Andrew
    Yassine, Abdulsalam
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 585 - 590
  • [46] Offline Reinforcement Learning at Multiple Frequencies
    Burns, Kaylee
    Yu, Tianhe
    Finn, Chelsea
    Hausman, Karol
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 2041 - 2051
  • [47] Deep Adaptive Pansharpening via Uncertainty-Aware Image Fusion
    Zheng, Kaiwen
    Huang, Jie
    Zhou, Man
    Hong, Danfeng
    Zhao, Feng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [48] Uncertainty-Aware Blind Image Quality Assessment in the Laboratory and Wild
    Zhang, Weixia
    Ma, Kede
    Zhai, Guangtao
    Yang, Xiaokang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3474 - 3486
  • [49] A Hybrid Framework for Uncertainty-Aware Depth Prediction in the Underwater Environment
    Marques, Filipe
    Castro, Filipa
    Parente, Manuel
    Costa, Pedro
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 102 - 107
  • [50] Uncertainty-aware semi-supervised few shot segmentation
    Kim, Soopil
    Chikontwe, Philip
    An, Sion
    Park, Sang Hyun
    PATTERN RECOGNITION, 2023, 137