Generalization Enhancement of Visual Reinforcement Learning through Internal States

被引:0
|
作者
Yang, Hanlin [1 ]
Zhu, William [1 ]
Zhu, Xianchao [2 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu 611731, Peoples R China
[2] Henan Univ Technol, Sch Artificial Intelligence & Big Data, Zhengzhou 450001, Peoples R China
关键词
visual reinforcement learning; transfer learning; generalization;
D O I
10.3390/s24144513
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Visual reinforcement learning is important in various practical applications, such as video games, robotic manipulation, and autonomous navigation. However, a major challenge in visual reinforcement learning is the generalization to unseen environments, that is, how agents manage environments with previously unseen backgrounds. This issue is triggered mainly by the high unpredictability inherent in high-dimensional observation space. To deal with this problem, techniques including domain randomization and data augmentation have been explored; nevertheless, these methods still cannot attain a satisfactory result. This paper proposes a new method named Internal States Simulation Auxiliary (ISSA), which uses internal states to improve generalization in visual reinforcement learning tasks. Our method contains two agents, a teacher agent and a student agent: the teacher agent has the ability to directly access the environment's internal states and is used to facilitate the student agent's training; the student agent receives initial guidance from the teacher agent and subsequently continues to learn independently. From another perspective, our method can be divided into two phases, the transfer learning phase and traditional visual reinforcement learning phase. In the first phase, the teacher agent interacts with environments and imparts knowledge to the vision-based student agent. With the guidance of the teacher agent, the student agent is able to discover more effective visual representations that address the high unpredictability of high-dimensional observation space. In the next phase, the student agent autonomously learns from the visual information in the environment, and ultimately, it becomes a vision-based reinforcement learning agent with enhanced generalization. The effectiveness of our method is evaluated using the DMControl Generalization Benchmark and the DrawerWorld with texture distortions. Preliminary results indicate that our method significantly improves generalization ability and performance in complex continuous control tasks.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Trajectory Design and Generalization for UAV Enabled Networks:A Deep Reinforcement Learning Approach
    Li, Xuan
    Wang, Qiang
    Liu, Jie
    Zhang, Wenqi
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [22] Federated reinforcement learning for robot motion planning with zero-shot generalization
    Yuan, Zhenyuan
    Xu, Siyuan
    Zhu, Minghui
    AUTOMATICA, 2024, 166
  • [23] Tactile recognition of visual stimuli: Specificity versus generalization of perceptual learning
    Arnold, Gabriel
    Auvray, Malika
    VISION RESEARCH, 2018, 152 : 40 - 50
  • [24] Unsupervised visual discrimination learning of complex stimuli: Accuracy, bias and generalization
    Montefusco-Siegmund, Rodrigo
    Toro, Mauricio
    Maldonado, Pedro E.
    Aylwin, Maria de la L.
    VISION RESEARCH, 2018, 148 : 37 - 48
  • [25] Enhancing visual reinforcement learning with State-Action Representation
    Yan, Mengbei
    Lyu, Jiafei
    Li, Xiu
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [26] Scaling Tangled Program Graphs to Visual Reinforcement Learning in ViZDoom
    Smith, Robert J.
    Heywood, Malcolm, I
    GENETIC PROGRAMMING (EUROGP 2018), 2018, 10781 : 135 - 150
  • [27] Learning domain structure through probabilistic policy reuse in reinforcement learning
    Fernandez, Fernando
    Veloso, Manuela
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2013, 2 (01) : 13 - 27
  • [28] Time Horizon Generalization in Reinforcement Learning: Generalizing Multiple Q-Tables in Q-Learning Agents
    Hatcho, Yasuyo
    Hattori, Kiyohiko
    Takadama, Keiki
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2009, 13 (06) : 667 - 674
  • [29] Learning to Regrasp Using Visual-Tactile Representation-Based Reinforcement Learning
    Zhang, Zhuangzhuang
    Sun, Han
    Zhou, Zhenning
    Wang, Yizhao
    Huang, Huang
    Zhang, Zhinan
    Cao, Qixin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [30] UAV visual flight control method based on deep reinforcement learning
    Bai, Shuangxia
    Li, Bo
    Gan, Zhigang
    Chen, Daqing
    2021 INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SOCIAL INTELLIGENCE (ICCSI), 2021,