Generalization Enhancement of Visual Reinforcement Learning through Internal States

被引:0
作者
Yang, Hanlin [1 ]
Zhu, William [1 ]
Zhu, Xianchao [2 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu 611731, Peoples R China
[2] Henan Univ Technol, Sch Artificial Intelligence & Big Data, Zhengzhou 450001, Peoples R China
关键词
visual reinforcement learning; transfer learning; generalization;
D O I
10.3390/s24144513
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Visual reinforcement learning is important in various practical applications, such as video games, robotic manipulation, and autonomous navigation. However, a major challenge in visual reinforcement learning is the generalization to unseen environments, that is, how agents manage environments with previously unseen backgrounds. This issue is triggered mainly by the high unpredictability inherent in high-dimensional observation space. To deal with this problem, techniques including domain randomization and data augmentation have been explored; nevertheless, these methods still cannot attain a satisfactory result. This paper proposes a new method named Internal States Simulation Auxiliary (ISSA), which uses internal states to improve generalization in visual reinforcement learning tasks. Our method contains two agents, a teacher agent and a student agent: the teacher agent has the ability to directly access the environment's internal states and is used to facilitate the student agent's training; the student agent receives initial guidance from the teacher agent and subsequently continues to learn independently. From another perspective, our method can be divided into two phases, the transfer learning phase and traditional visual reinforcement learning phase. In the first phase, the teacher agent interacts with environments and imparts knowledge to the vision-based student agent. With the guidance of the teacher agent, the student agent is able to discover more effective visual representations that address the high unpredictability of high-dimensional observation space. In the next phase, the student agent autonomously learns from the visual information in the environment, and ultimately, it becomes a vision-based reinforcement learning agent with enhanced generalization. The effectiveness of our method is evaluated using the DMControl Generalization Benchmark and the DrawerWorld with texture distortions. Preliminary results indicate that our method significantly improves generalization ability and performance in complex continuous control tasks.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] An Optimal Transfer of Knowledge in Reinforcement Learning through Greedy Approach
    Kumari, Deepika
    Chaudhary, Mahima
    Mishra, Ashish Kumar
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [32] Specificity and generalization of visual perceptual learning in humans; an event-related potential study
    Ding, YL
    Song, Y
    Fan, S
    Qu, Z
    Chen, L
    NEUROREPORT, 2003, 14 (04) : 587 - 590
  • [33] Improving Batch Reinforcement Learning Performance through Transfer of Samples
    Lazaric, Alessandro
    Restelli, Marcello
    Bonarini, Andrea
    STAIRS 2008, 2008, 179 : 106 - 117
  • [34] Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement Learning
    Nyberg, Jakob
    Johnson, Pontus
    2024 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2024, : 282 - 289
  • [35] A Model of External Memory for Navigation in Partially Observable Visual Reinforcement Learning Tasks
    Smith, Robert J.
    Heywood, Malcolm I.
    GENETIC PROGRAMMING, EUROGP 2019, 2019, 11451 : 162 - 177
  • [36] Enhancing fog load balancing through lifelong transfer learning of reinforcement learning agents
    Ebrahim, Maad
    Hafid, Abdelhakim
    Abid, Mohamed Riduan
    COMPUTER COMMUNICATIONS, 2025, 231
  • [37] Image Quality Assessment in Visual Reinforcement Learning for Fast-moving Targets
    Ryoo, Sanghyun
    Jeong, Jiseok
    Han, Soohee
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 22 (11) : 3303 - 3313
  • [38] Training an Agent to Find and Reach an Object in Different Environments using Visual Reinforcement Learning and Transfer Learning
    Santos Batista, Evelyn Conceicao
    Caarls, Wouter
    Forero, Leonardo A.
    Pacheco, Marco Aurelio C.
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 732 - 741
  • [39] Enhancing HVAC control systems through transfer learning with deep reinforcement learning agents
    Kadamala, Kevlyn
    Chambers, Des
    Barrett, Enda
    SMART ENERGY, 2024, 13
  • [40] Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments
    Gonzalez P.
    Alstrom T.S.
    May T.
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2023, 31 : 3390 - 3403