STATE REPRESENTATION LEARNING FOR EFFECTIVE DEEP REINFORCEMENT LEARNING

被引:0
|
作者
Zhao, Jian [1 ]
Zhou, Wengang [1 ]
Zhao, Tianyu [1 ]
Zhou, Yun [1 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, EEIS Dept, CAS Key Lab GIPAS, Beijing, Peoples R China
关键词
Representation learning; reinforcement learning;
D O I
10.1109/icme46284.2020.9102924
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recent years have witnessed the great success of deep reinforcement learning (DRL) on a variety of vision games. Although DNN has demonstrated strong power in representation learning, such capacity is under-explored in most DRL works whose focus is usually on optimization solvers. In fact, we discover that the state feature learning is the main obstacle for further improvement of DRL algorithms. To address this issue, we propose a new state representation learning scheme with our Adjacent State Consistency Loss (ASC Loss). The loss is defined based on the hypothesis that there are fewer changes between adjacent states than that of far apart ones, since scenes in videos generally evolve smoothly. In this paper, we exploit ASC loss as an assistant of RL loss in the training phase to boost the state feature learning. We conduct evaluation on Atari games and MuJoCo continuous control tasks, which demonstrates that our method is superior to OpenAI baselines.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Bootstrap State Representation Using Style Transfer for Better Generalization in Deep Reinforcement Learning
    Rahman, Md Masudur
    Xue, Yexiang
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 100 - 115
  • [22] An Experimental Study on State Representation Extraction for Vision-Based Deep Reinforcement Learning
    Ren, JunkaiY
    Zeng, Yujun
    Zhou, Sihang
    Zhang, Yichuan
    APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [23] Deep representation learning and reinforcement learning for workpiece setup optimization in CNC milling
    Vladimir Samsonov
    Enslin Chrismarie
    Hans-Georg Köpken
    Schirin Bär
    Daniel Lütticke
    Tobias Meisen
    Production Engineering, 2023, 17 : 847 - 859
  • [24] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Morales, Eduardo F.
    Murrieta-Cid, Rafael
    Becerra, Israel
    Esquivel-Basaldua, Marco A.
    INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805
  • [25] Video Representation Learning for Decoupled Deep Reinforcement Learning Applied to Autonomous Driving
    Mohammed, Shawan Taha
    Kastouri, Mohamed
    Niederfahrenhorst, Artur
    Ascheid, Gerd
    2023 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION, SII, 2023,
  • [26] Deep representation learning and reinforcement learning for workpiece setup optimization in CNC milling
    Samsonov, Vladimir
    Chrismarie, Enslin
    Koepken, Hans-Georg
    Baer, Schirin
    Luetticke, Daniel
    Meisen, Tobias
    PRODUCTION ENGINEERING-RESEARCH AND DEVELOPMENT, 2023, 17 (06): : 847 - 859
  • [27] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Eduardo F. Morales
    Rafael Murrieta-Cid
    Israel Becerra
    Marco A. Esquivel-Basaldua
    Intelligent Service Robotics, 2021, 14 : 773 - 805
  • [28] The State of Sparse Training in Deep Reinforcement Learning
    Graesser, Laura
    Evci, Utku
    Elsen, Erich
    Castro, Pablo Samuel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [29] Domain adaptive state representation alignment for reinforcement learning
    Li, Dongfen
    Meng, Lichao
    Li, Jingjing
    Lu, Ke
    Yang, Yang
    INFORMATION SCIENCES, 2022, 609 : 1353 - 1368
  • [30] NEURAL DISTILLATION AS A STATE REPRESENTATION BOTTLENECK IN REINFORCEMENT LEARNING
    Guillet, Valentin
    Wilson, Dennis
    Aguilar-Melchor, Carlos
    Rachelson, Emmanuel
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199