Guided Reinforcement Learning via Sequence Learning

被引:0
作者
Ramamurthy, Rajkumar [1 ]
Sifa, Rafet [1 ]
Luebbering, Max [1 ]
Bauckhage, Christian [1 ]
机构
[1] Fraunhofer IAIS, St Augustin, Germany
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II | 2020年 / 12397卷
关键词
Reinforcement Learning; Exploration; Novelty Search; Representation learning; Sequence learning;
D O I
10.1007/978-3-030-61616-8_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applications of Reinforcement Learning (RL) suffer from high sample complexity due to sparse reward signals and inadequate exploration. Novelty Search (NS) guides as an auxiliary task, in this regard to encourage exploration towards unseen behaviors. However, NS suffers from critical drawbacks concerning scalability and generalizability since they are based off instance learning. Addressing these challenges, we previously proposed a generic approach using unsupervised learning to learn representations of agent behaviors and use reconstruction losses as novelty scores. However, it considered only fixed-length sequences and did not utilize sequential information of behaviors. Therefore, we here extend this approach by using sequential auto-encoders to incorporate sequential dependencies. Experimental results on benchmark tasks show that this sequence learning aids exploration outperforming previous novelty search methods.
引用
收藏
页码:335 / 345
页数:11
相关论文
共 50 条
[31]   Methodologies for Imitation Learning via Inverse Reinforcement Learning: A Review [J].
Zhang K. ;
Yu Y. .
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (02) :254-261
[32]   Bayesian Deep Reinforcement Learning via Deep Kernel Learning [J].
Xuan, Junyu ;
Lu, Jie ;
Yan, Zheng ;
Zhang, Guangquan .
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (01) :164-171
[33]   State-novelty guided action persistence in deep reinforcement learning [J].
Hu, Jianshu ;
Weng, Paul ;
Ban, Yutong .
MACHINE LEARNING, 2025, 114 (02)
[34]   Vulnerability Mining of Deep Learning Framework for Model Generation Guided by Reinforcement Learning [J].
Pan L. ;
Liu L. ;
Luo S. ;
Zhang Z. .
Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2024, 44 (05) :521-529
[35]   Nondominated Policy-Guided Learning in Multi-Objective Reinforcement Learning [J].
Kim, Man-Je ;
Park, Hyunsoo ;
Ahn, Chang Wook .
ELECTRONICS, 2022, 11 (07)
[36]   Learning to calibrate: Reinforcement learning for guided calibration of visual-inertial rigs [J].
Nobre, Fernando ;
Heckman, Christoffer .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2019, 38 (12-13) :1388-1402
[37]   Online Reinforcement Learning for Designing Automotive Hybrid Assembly Sequence: A Task Clustering-Guided Approach [J].
Elhoud, Anass ;
Piranda, Benoit ;
De Matos, Raphael ;
Bourgeois, Julien .
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT II, AIAI 2024, 2024, 712 :115-128
[38]   Analytically Guided Reinforcement Learning for Green It and Fluent Traffic [J].
Korecki, Marcin ;
Helbing, Dirk .
IEEE ACCESS, 2022, 10 :96348-96358
[39]   A Hardware Accelerator for Language-Guided Reinforcement Learning [J].
Shiri, Aidin ;
Mazumder, Arnab Neelim ;
Prakash, Bharat ;
Homayoun, Houman ;
Waytowich, Nicholas R. ;
Mohsenin, Tinoosh .
IEEE DESIGN & TEST, 2022, 39 (03) :37-44
[40]   Security Reinforcement Learning Guided by Finite Deterministic Automata [J].
Shi, YiMing ;
Wang, QingLong .
2024 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS, ICCCS 2024, 2024, :1330-1334