Generalization Enhancement of Visual Reinforcement Learning through Internal States

被引:0
|
作者
Yang, Hanlin [1 ]
Zhu, William [1 ]
Zhu, Xianchao [2 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu 611731, Peoples R China
[2] Henan Univ Technol, Sch Artificial Intelligence & Big Data, Zhengzhou 450001, Peoples R China
关键词
visual reinforcement learning; transfer learning; generalization;
D O I
10.3390/s24144513
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Visual reinforcement learning is important in various practical applications, such as video games, robotic manipulation, and autonomous navigation. However, a major challenge in visual reinforcement learning is the generalization to unseen environments, that is, how agents manage environments with previously unseen backgrounds. This issue is triggered mainly by the high unpredictability inherent in high-dimensional observation space. To deal with this problem, techniques including domain randomization and data augmentation have been explored; nevertheless, these methods still cannot attain a satisfactory result. This paper proposes a new method named Internal States Simulation Auxiliary (ISSA), which uses internal states to improve generalization in visual reinforcement learning tasks. Our method contains two agents, a teacher agent and a student agent: the teacher agent has the ability to directly access the environment's internal states and is used to facilitate the student agent's training; the student agent receives initial guidance from the teacher agent and subsequently continues to learn independently. From another perspective, our method can be divided into two phases, the transfer learning phase and traditional visual reinforcement learning phase. In the first phase, the teacher agent interacts with environments and imparts knowledge to the vision-based student agent. With the guidance of the teacher agent, the student agent is able to discover more effective visual representations that address the high unpredictability of high-dimensional observation space. In the next phase, the student agent autonomously learns from the visual information in the environment, and ultimately, it becomes a vision-based reinforcement learning agent with enhanced generalization. The effectiveness of our method is evaluated using the DMControl Generalization Benchmark and the DrawerWorld with texture distortions. Preliminary results indicate that our method significantly improves generalization ability and performance in complex continuous control tasks.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Enhancing Visual Generalization in Reinforcement Learning with Cycling Augmentation
    Sun, Shengjie
    Lyu, Jiafei
    Li, Lu
    Guo, Jiazhe
    Yan, Mengbei
    Liu, Runze
    Li, Xiu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT IV, 2024, 15019 : 397 - 411
  • [2] A Reinforcement Learning Approach for Scheduling Problems with Improved Generalization through Order Swapping
    Vivekanandan, Deepak
    Wirth, Samuel
    Karlbauer, Patrick
    Klarmann, Noah
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (02): : 418 - 430
  • [3] Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T
    Colas, Jaron T.
    Dundon, Neil M.
    Gerraty, Raphael T.
    Saragosa-Harris, Natalie M.
    Szymula, Karol P.
    Tanwisuth, Koranis
    Tyszka, J. Michael
    van Geen, Camilla
    Ju, Harang
    Toga, Arthur W.
    Gold, Joshua, I
    Bassett, Dani S.
    Hartley, Catherine A.
    Shohamy, Daphna
    Grafton, Scott T.
    O'Doherty, John P.
    HUMAN BRAIN MAPPING, 2022, 43 (15) : 4750 - 4790
  • [4] LevDoom: A Benchmark for Generalization on Level Difficulty in Reinforcement Learning
    Tomilin, Tristan
    Dai, Tianhong
    Fang, Meng
    Pechenizkiy, Mykola
    2022 IEEE CONFERENCE ON GAMES, COG, 2022, : 72 - 79
  • [5] Adversarial Discriminative Feature Separation for Generalization in Reinforcement Learning
    Liu, Yong
    Wu, Chunwei
    Xi, Xidong
    Li, Yan
    Cao, Guitao
    Cao, Wenming
    Wang, Hong
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
    Kanagawa, Yuji
    Kaneko, Tomoyuki
    2019 IEEE CONFERENCE ON GAMES (COG), 2019,
  • [7] Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments
    Aleksandrowicz, Maciej
    Jaworek-Korjakowska, Joanna
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2024, 14 (01) : 45 - 61
  • [8] IMPROVING GENERALIZATION OF REINFORCEMENT LEARNING USING A BILINEAR POLICY NETWORK
    Fang, Fen
    Liang, Wenyu
    Wu, Yan
    Xu, Qianli
    Lim, Joo-Hwee
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 991 - 995
  • [9] Policy Optimization with Augmented Value Targets for Generalization in Reinforcement Learning
    Nafi, Nasik Muhammad
    Poggi-Corradini, Giovanni
    Hsu, William
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [10] Information-Theoretic Generalization Bounds for Batch Reinforcement Learning
    Liu, Xingtu
    ENTROPY, 2024, 26 (11)